Trevor Chow’s Post

View profile for Trevor Chow, graphic

Co-founder at Moonglow (YC S24)

what should pretraining compute be spent on? 4 years ago: mostly parameters 2 years ago: an even split of parameters and data now: mostly on data you might be wondering why it has changed so much; here's a brief intellectual history of pretraining recipes and why it has evolved over the past 4 years https://lnkd.in/gSSQse8X

Three Kuhnian Revolutions in ML Training

Three Kuhnian Revolutions in ML Training

blog.moonglow.ai

Ben Warren

Co-Founder @ Snowpilot (YC S24), Ex-Microsoft

2mo

good stuff!!💡

Like
Reply

To view or add a comment, sign in

Explore topics