Charles H. Martin, PhD’s Post

View profile for Charles H. Martin, PhD, graphic

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

"Fine-Tuning is a nightmare in practice..." (advice from X). I agree. I have built ML and AI models for clients for 20 years. eHow. eBay. Walmart. Blackrock. Even Google (well, Aardvark). It's hard. But the payoff is there --if you can get it right. Best example: eHow was the first $1B IPO since Google. So if you are Fine-Tuning your own models, how can you know if you are on the right path ? Weightwatcher can help. I invented weightwatcher to help my clients who are training and/or fine-tuning their own AI models. And it's open-source. How can it help ? Here are over a dozen examples of how to interpret weighwatcher results for Instruction Fine-Tuned models. Generally speaking, if you get the Fine-Tuning right, your model will follow the predictions of the weightwatcher HTSR theory. And it when it doesn't, that special case can be useful too. If you know what you are looking at. If you have a fine tuned model, and you have the base model, this is all you do: pip install weightwatcher import weightwatcher as ww watcher = ww.WeightWatcher() details = watcher.analyze(model=model, base_model=base_model) Weightwatcher will remove the instruction fine-tuned components from the base model and analyze them for you Want to learn more ? Check out the examples: https://lnkd.in/gS8bS3tM Have questions? Join our Community Discord: https://lnkd.in/gZQF64Bw Or ping me here. And if there are cases you think we should add, please let us know. WeightWatcher is a one-of-a-kind must-have tool for anyone training, deploying, or monitoring Deep Neural Networks (DNNs). #talkToChuck. #theAIguy

  • No alternative text description for this image
Marco Herzog 🤖

AI Insights from the Trenches | Translating Complex AI Concepts into Actionable Knowledge

4w

Oh wow, that sounds actually interesting. I was looking for something to get good evals for QLoRa adapters. I will check it out.

Playing in progress... :)

Remigiusz Kinas

Head of AI @ Grupa NEUCA | AI, iRPA | 2xKaggle GM | Speakleash core team

4w

Thank you Charles for Bielik evaluation 👍👏🙏

Like
Reply

Shoutout for sharing this incredible resource! It's a gift to the world. Your selflessness is inspiring, and we're all supporting your endeavors! 🚀👏

Igor Couto

CEO of Sofya | Founder & Chief Technologist at 1STI | Deep Tech Amplified Organization Author

2w
See more comments

To view or add a comment, sign in

Explore topics