Roshan Sumbaly’s Post

Director of AI at Meta - Llama

Llama 3.3 🦙 Am back with the last release of the year - an early Christmas gift for the community. Releasing an updated 70B model, that we know is the workhorse of the OSS community, but now as capable as our 405B. We’ve been refining our post-training recipe, introducing new online RL techniques that pushed on domains like math and reasoning. The minute we saw this model almost reach parity with 405B, we thought it’ll be great to share with everyone. This can now act as a powerful synthetic data generator or teacher for all your distillation needs. On a personal note, this has been an exciting year for the Llama organization. 5 community moments (4 Llama and Movie Gen), combined with lots of product updates, have kept us busy through 2024. 2025 will be the year of Llama 4. I’ll end by saying we’re hiring - Directors and Research Scientists. Ping me and join us on this journey! Download from Meta ➡️ https://lnkd.in/gPK9QzxM

6 Comments

Reza Pirayesh, PhD

Artificial Intelligence and Robotics Scientist

Is there a reference for the "new online RL techniques "?

2 Reactions

Ashish Patel 🇮🇳

Roshan Sumbaly You mentioned the introduction of new online RL techniques that enhanced domains like math and reasoning. Can you elaborate on the specific changes in the RL approach and how they differ from traditional methods in improving model performance?

Eddie Villa

Mail Carrier at United States Postal Service

Can you help me get my hacked Facebook Account Back? I was hacked 11-26-2024

Riccardo Santi

Bravo Duccio! Sei proprio forte!

See more comments

To view or add a comment, sign in

More Relevant Posts

Mark Kovarski

Responsible AI | Co-Founder | CTO | Enterprise | Automation
1w
Report this post
𝐋𝐥𝐚𝐦𝐚 3.3 70𝐁 𝐢𝐬 𝐇𝐞𝐫𝐞! 🌍 A 70B multilingual model supporting 8 languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. 🔑 128K token context window. 📚 Trained on over 15 trillion tokens with knowledge updated to December 2023. 💻 88.4% on HumanEval for code generation. 🧮 91.1% on Multilingual Mathematics Problem Solving (MGSM). 🚀 Significant improvements over Llama 3.1. If you need the extra performance, Llama 3.3 on Groq @2300 tokens/second: https://lnkd.in/gjT6UvYK
Roshan Sumbaly

Director of AI at Meta - Llama
1w

Llama 3.3 🦙 Am back with the last release of the year - an early Christmas gift for the community. Releasing an updated 70B model, that we know is the workhorse of the OSS community, but now as capable as our 405B. We’ve been refining our post-training recipe, introducing new online RL techniques that pushed on domains like math and reasoning. The minute we saw this model almost reach parity with 405B, we thought it’ll be great to share with everyone. This can now act as a powerful synthetic data generator or teacher for all your distillation needs. On a personal note, this has been an exciting year for the Llama organization. 5 community moments (4 Llama and Movie Gen), combined with lots of product updates, have kept us busy through 2024. 2025 will be the year of Llama 4. I’ll end by saying we’re hiring - Directors and Research Scientists. Ping me and join us on this journey! Download from Meta ➡️ https://lnkd.in/gPK9QzxM
Like Comment
To view or add a comment, sign in
Park Chansung

Researcher @ETRI, Google Developers Expert for {ML, GCP}, Fellow @Hugging Face
9mo
Report this post
Vid2Persona: talk to person from video clip A fun project over the last week with Sayak Paul. It has a simple pipeline from extracting traits of video characters to chatting with them. Under the hood, this project leverages the power of both commercial and open source models. We used Google's Gemini 1.0 Pro Vision model to understand the video content directly, then we used HuggingFaceH4/zephyr-7b-beta model to make conversation! Try it Hugging Face Space and let us know what you think. https://lnkd.in/d5qw5jmx The space application is a dedicated implementation for ZeroGPU environment + Hugging Face Inference API with PRO account. If you wish to host it on your own environment, consider duplicate the space or run locally with the project repository : https://lnkd.in/dWagsZdy
8 Comments
Like Comment
To view or add a comment, sign in
Pontus Abrahamsson

Engineer & Co-founder at Midday
4mo Edited
Report this post
Inbox v1 is ready in Midday 🎉 Drag-and-drop files or use your unique email to reconcile everything, saving time and improving accuracy. * Email provided by Postmark 📥 * Uploaded to Supabase storage 🗄️ * Background job via Trigger.dev 🔄 * Vercel AI SDK 🤖 * OCR extraction 👀 An in-depth tech blog coming soon. Share your questions you may have

2 Comments
Like Comment
To view or add a comment, sign in
Matt Ryan

Director, Social Listening & Intelligence at CMI Media
5mo
Report this post
Folks, I am sorry to say Dronc is dead. I had to put my local LLM on oncology into storage as I stay with my family for the summer. Unfortunately for my family, they are going to have to deal with me puttering around the office during my free time creating another LLM on my local laptop. I don't know what your process is but my kids won't stop commenting on my pacing and making jazz scat noises (link here for reference, and of course I don't sound this good when I do it: https://shorturl.at/3oUaX) I don't have a pithy name for it yet but what I want to see is the efficacy of different models with the same data set for a given task. What I expect to see is a see saw effect, where exchanging different models will find some improvements in certain aspects of completing the task while deficient in others. I'm probably going to end up just proving the point that fine tuning, RAG, and data segmented through machine learning is necessary - but I want to tinker. If anyone else out there has melted their laptop trying to tinker running local LLMs, please let me know if have any other fun or novel projects to attempt! On the other hand if you're looking to experiment yourself, highly recommend the below Reddit threads: https://lnkd.in/d_esypdt https://lnkd.in/dY36-b-z
Like Comment
To view or add a comment, sign in
Lambda

21,282 followers
4mo Edited
Report this post
One Llama, two Llamas, three Llamas… 405B Llamas?! Meta’s Llama 3.1 brings top-tier performance to the open-source community—a game-changer for innovation. With Llama 3.1 405B, a multi-node cluster is a must for full-precision inferencing. Read vLLM's complete deployment guide using Lambda’s 1-Click Clusters: https://lnkd.in/g3qrQh4i Get your on-demand cluster today: https://lnkd.in/eu9xieKe
Like Comment
To view or add a comment, sign in
Kirk Marple

Technical Founder, CEO at Graphlit
3mo
Report this post
Day 5 of the '30 Days of Graphlit' is here! In this example, we show how to use Graphlit for competitive intelligence and analyzing mentions on Reddit. By ingesting posts from r/Anthropic, and enabling entity extraction, we can then filter on any Reddit posts that mentioned Google. Notebook: https://lnkd.in/gce-Af5i Colab: https://lnkd.in/ghKRSrad
Like Comment
To view or add a comment, sign in
Thomas Bordes

Connecting ML engineers & researchers with the world's easiest and most efficient AI infrastructure
4mo Edited
Report this post
You're not finished counting Llamas! 🦙 3.1 series brings improved reasoning capabilities, 128K token context... and a 405B version. vLLM produced a guide for inferencing the 405B heavy-hitter - leveraging Lambda 1-Click Clusters. Multi-node is the way to go if you want full precision!
Lambda

21,282 followers
4mo Edited

One Llama, two Llamas, three Llamas… 405B Llamas?! Meta’s Llama 3.1 brings top-tier performance to the open-source community—a game-changer for innovation. With Llama 3.1 405B, a multi-node cluster is a must for full-precision inferencing. Read vLLM's complete deployment guide using Lambda’s 1-Click Clusters: https://lnkd.in/g3qrQh4i Get your on-demand cluster today: https://lnkd.in/eu9xieKe
Like Comment
To view or add a comment, sign in
Manish Kumar

Student at Rungta College of Engineering & Technology Kohka-Kurud Bhilai.
7mo
Report this post
🌟 Day 12: Array Adventures in DSA Land! 🚀 Today, I dove deeper into the world of Data Structures and Algorithms by conquering four intriguing array challenges. Here's a snapshot of my journey: 1. Zeroes to the End: I arranged all zeroes at the end, keeping the rest of the numbers in their original order. 2. Element Hunt: I quickly found a specific number in the array. 3. Array Merge: I combined two sorted arrays into one, avoiding any duplicates. 4. Common Elements: I discovered which numbers appear in both arrays. I’m thrilled to see how these challenges improve my DSA skills! #100DaysOfCode #DSAJourney #SimpleCoding 🌟
Like Comment
To view or add a comment, sign in
Jake Colling

Software Developer
6mo
Report this post
Athena has first class support for research intensive workflows. Think like product research, competitive analysis, sentiment analysis. It's accessible via chat, full scale research reports, or when you need even more control you can use Jupyter notebooks and the Athena SDK to get exactly what you need. Here's a demo of that third option and a line by line walk through of the code to do it. Athena is in private beta. DM me if you want early access!
Like Comment
To view or add a comment, sign in
Karyna Naminas

CEO of Label Your Data. Helping AI teams deploy their ML models faster.
3w
Report this post
🎉 The wait is over – we’re live on Product Hunt! 🎉 Introducing our Data Labeling Platform, built to simplify your annotation workflow: ✅ Computer vision focus (for now 😉) ✅ API access ✅ Free trial ✅ Cost calculator Perfect for ML engineers, AI-driven businesses, and academic researchers. 🥂 You're invited to our comment party there: (link in the comment)
36 Comments
Like Comment
To view or add a comment, sign in

5,654 followers

44 Posts

View Profile Follow

Roshan Sumbaly’s Post

More Relevant Posts

Explore topics