The biggest technical challenge for GenAI companies is building models to meet their lofty promises and big dreams.
GPT-o1, nee "Strawberry" is an example of where I think we're headed next: More complex models built to simulate advanced "reasoning" through multi-step processes and more compute.
Read between the lines and this starts to sound a lot like OpenAI serving a multi-step fine-tuned CoT system, heavily filtered and indexed through RLHF, as a model.
That helps explain
- limited access (enterprise, edu, and tier 5 only for the API, 30 prompts per week for ChatGPT Plus and Teams users),
- 128k context window, some of which is consumed by new "reasoning tokens",
- significantly slower responses ("anywhere from a few seconds to several minutes" according to documentation)
- limited abilities (text only - no multimodal, no tools and function calling, no streaming)
- specialized use cases - OpenAI recommends using 4o1 and 4o1-mini for tasks that require "more advanced reasoning" - specifically coding, math, and science problems,
- anticipated higher cost due to compute demand
This model just dropped, and it's not in my hands yet so I can't say anything meaningful about the actual performance. Once I have access I'll do comprehensive testing and give you my take on what this model is for and how to build solutions with it.
More to come!
#openai #strawberry #gpt #gpt4 #gpt4o #gpt4o1 #4o1
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.
This new series of AI models can reason through complex tasks and solve harder problems than previous models in science, coding, and math.
Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5.
More here: www.openai.com/o1