"Fine-Tuning is a nightmare in practice..." (advice from X). I agree. I have built ML and AI models for clients for 20 years. eHow. eBay. Walmart. Blackrock. Even Google (well, Aardvark). It's hard. But the payoff is there --if you can get it right. Best example: eHow was the first $1B IPO since Google. So if you are Fine-Tuning your own models, how can you know if you are on the right path ? Weightwatcher can help. I invented weightwatcher to help my clients who are training and/or fine-tuning their own AI models. And it's open-source. How can it help ? Here are over a dozen examples of how to interpret weighwatcher results for Instruction Fine-Tuned models. Generally speaking, if you get the Fine-Tuning right, your model will follow the predictions of the weightwatcher HTSR theory. And it when it doesn't, that special case can be useful too. If you know what you are looking at. If you have a fine tuned model, and you have the base model, this is all you do: pip install weightwatcher import weightwatcher as ww watcher = ww.WeightWatcher() details = watcher.analyze(model=model, base_model=base_model) Weightwatcher will remove the instruction fine-tuned components from the base model and analyze them for you Want to learn more ? Check out the examples: https://lnkd.in/gS8bS3tM Have questions? Join our Community Discord: https://lnkd.in/gZQF64Bw Or ping me here. And if there are cases you think we should add, please let us know. WeightWatcher is a one-of-a-kind must-have tool for anyone training, deploying, or monitoring Deep Neural Networks (DNNs). #talkToChuck. #theAIguy
Playing in progress... :)
Thank you Charles for Bielik evaluation 👍👏🙏
Shoutout for sharing this incredible resource! It's a gift to the world. Your selflessness is inspiring, and we're all supporting your endeavors! 🚀👏
AI Insights from the Trenches | Translating Complex AI Concepts into Actionable Knowledge
4wOh wow, that sounds actually interesting. I was looking for something to get good evals for QLoRa adapters. I will check it out.