Codingscape reposted this
Today, we shared evals for an early version of the next model in our o-model reasoning series: OpenAI o3 and o3-mini. On several of the most challenging frontier evals, OpenAI o3 sets new milestones for what’s possible in coding, math, and scientific reasoning. It also makes significant progress on the ARC-AGI evaluation for the first time. We plan to deploy these models early next year, but we’re opening up early access applications for safety and security researchers to test these frontier models starting today: https://lnkd.in/ghJWP_ui