Charles H. Martin, PhD’s Post

View profile for Charles H. Martin, PhD, graphic

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

I have been building production machine learning models for clients on Wall Street, Main Street, and in Silicon Valley for over 20 years. I believe that the next generation of AI models will use large scale instruction fine tuning. And to achieve this, you need to know if the fine-tuning is working. To solve this problem, I invented weightwatcher. If you want to get into the space and understand how to fine-tune models at scale, please check out these examples of what you should expect from a good model

View profile for Charles H. Martin, PhD, graphic

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

"Fine-Tuning is a nightmare in practice..." (advice from X). I agree. I have built ML and AI models for clients for 20 years. eHow. eBay. Walmart. Blackrock. Even Google (well, Aardvark). It's hard. But the payoff is there --if you can get it right. Best example: eHow was the first $1B IPO since Google. So if you are Fine-Tuning your own models, how can you know if you are on the right path ? Weightwatcher can help. I invented weightwatcher to help my clients who are training and/or fine-tuning their own AI models. And it's open-source. How can it help ? Here are over a dozen examples of how to interpret weighwatcher results for Instruction Fine-Tuned models. Generally speaking, if you get the Fine-Tuning right, your model will follow the predictions of the weightwatcher HTSR theory. And it when it doesn't, that special case can be useful too. If you know what you are looking at. If you have a fine tuned model, and you have the base model, this is all you do: pip install weightwatcher import weightwatcher as ww watcher = ww.WeightWatcher() details = watcher.analyze(model=model, base_model=base_model) Weightwatcher will remove the instruction fine-tuned components from the base model and analyze them for you Want to learn more ? Check out the examples: https://lnkd.in/gS8bS3tM Have questions? Join our Community Discord: https://lnkd.in/gZQF64Bw Or ping me here. And if there are cases you think we should add, please let us know. WeightWatcher is a one-of-a-kind must-have tool for anyone training, deploying, or monitoring Deep Neural Networks (DNNs). #talkToChuck. #theAIguy

  • No alternative text description for this image
📊 Alastair Muir, PhD, BSc, BEd, MBB

Data Science Consultant | @alastairmuir.bsky.social | Risk Analysis and Optimization

2w

*with fine tuning on carefully selected training data

Like
Reply

To view or add a comment, sign in

Explore topics