The biggest technical challenge for GenAI companies is building models to meet their lofty promises and big dreams. GPT-o1, nee "Strawberry" is an example of where I think we're headed next: More complex models built to simulate advanced "reasoning" through multi-step processes and more compute. Read between the lines and this starts to sound a lot like OpenAI serving a multi-step fine-tuned CoT system, heavily filtered and indexed through RLHF, as a model. That helps explain - limited access (enterprise, edu, and tier 5 only for the API, 30 prompts per week for ChatGPT Plus and Teams users), - 128k context window, some of which is consumed by new "reasoning tokens", - significantly slower responses ("anywhere from a few seconds to several minutes" according to documentation) - limited abilities (text only - no multimodal, no tools and function calling, no streaming) - specialized use cases - OpenAI recommends using 4o1 and 4o1-mini for tasks that require "more advanced reasoning" - specifically coding, math, and science problems, - anticipated higher cost due to compute demand This model just dropped, and it's not in my hands yet so I can't say anything meaningful about the actual performance. Once I have access I'll do comprehensive testing and give you my take on what this model is for and how to build solutions with it. More to come! #openai #strawberry #gpt #gpt4 #gpt4o #gpt4o1 #4o1
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. This new series of AI models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5. More here: www.openai.com/o1
So I got access and used it over the weekend on an interesting task. Just a spot of devops on a Linux VPS I run to take off-cloud-backups. Problem is, I'm not devops, I can't afford to pay for devops to deal with this, and what devops I do is limited to task based work - so I miss some knowledge. I'm educated enough to do things and be useful on the command line... and to be very dangerous as well, and restoring a backup server from another backup is a slow and painful process that I wished to avoid! o1 walked me through the process, answered my questions, and provided contextful responses after I pasted in error messages or returns and explained patiently, like a good teacher, what had happened and why it mattered. It was perfect and turned a one day stressful job into about an hour of low stress work where I felt supported. Lovely.
Insightful
I architect intuitive and joyful digital experiences. My work isn't just maintainable; it's clear, standard-driven, and built for collaboration.
3moThis new model got me wondering so many things. One being, is an uncensored model involved? Because that would explain why some AI safety people were leaving OpenAI.