Initial benchmarks of providers of Meta's new Llama 3.3 70B model 📊
In our independent evaluations, AI at Meta's Llama 3.3 70B model demonstrates intelligence comparable to OpenAI's GPT-4o and Mistral Large 2, and approaches the capabilities of Claude 3.5 Sonnet and Gemini 1.5 Pro.
Llama 3.3 70B provides a clear upgrade path for users of 3.1 70B, currently the most popular open-source model. It is also a potential opportunity for users of Llama 3.1 405B to access comparable intelligence at significantly faster speeds and lower cost, though we recommend extensive testing of your specific use-case before doing so.
Llama 3.3 70B sets itself apart with its permissive open-source license and now with the launch of these APIs, the speed and cost at which this intelligence can be accessed.
Congratulations to Cerebras Systems , SambaNova Systems , Groq , Fireworks AI , Together AI , Deep Infra Inc. and Hyperbolic on being fast to launch endpoints!
In particular, we are seeing Cerebras Systems , SambaNova Systems and Groq set new records for the speed at which this level of intelligence can be accessed with their AI-focused custom chips. Congratulations to Cerebras Systems for being the fastest endpoint we benchmark with their blazing 2,237 output tokens/s.
All endpoints are priced at below $1/M tokens (blended 3:1, input:output price), well below proprietary model endpoints of comparable intelligence (GPT-4o is $4.3 on the same basis). Congratulations to Deep Infra Inc. and Hyperbolic on offering the lowest price endpoints.
See the attached article for further analysis