OpenAI Unveils ‘Project Strawberry’ as o1, Enhancing Reasoning Capabilities

OpenAI has officially launched o1, the much-anticipated model previously known as “Project Strawberry.”

This new AI model aims to advance the capabilities of reasoning and problem-solving, marking a significant leap in AI development. o1, including its initial versions — o1-preview and o1-mini — are now available in a preview phase to ChatGPT Plus and Teams subscribers.

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.

These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. https://t.co/peKzzKX1bu

— OpenAI (@OpenAI) September 12, 2024

The o1-preview model allows users up to 30 messages, while the o1-mini version is limited to 50 messages.

OpenAI explained:

“We trained these models to spend more time thinking through problems before they respond, much like a person would.”

This approach involves the model refining its thinking process, trying various strategies, and recognising its mistakes.

This advancement is particularly critical because traditional AI models often suffer from “hallucinations,” where they generate inaccurate or nonsensical outputs due to a lack of real reasoning.

By enhancing the model’s ability to deliberate and analyse, OpenAI aims to produce more accurate and reliable responses.

O1’s Advanced Training Techniques

OpenAI’s new o1 model is built on a fundamentally different training methodology compared to its predecessors.

Bob McGrew, OpenAI’s chief research officer shared:

“We have been spending many months working on reasoning because we think this is actually the critical breakthrough.”

Some of our researchers behind OpenAI o1 🍓 pic.twitter.com/XnMx9vY2J2

— OpenAI (@OpenAI) September 12, 2024

Unlike earlier models that focused on predicting the next word or sentence based on patterns, o1 utilises a new optimisation algorithm and a specially tailored training dataset.

This model applies reinforcement learning, where it learns through rewards and penalties, enabling it to solve problems in a step-by-step manner akin to human thought processes.

The result is a model that can handle complex tasks with greater efficiency.

For instance, in a demonstration, the new model was able to solve intricate puzzles and mathematical problems with notable accuracy.

During a test, o1 achieved an impressive 83 percent on the International Mathematics Olympiad qualifying exam, a significant improvement over GPT-4o’s 13 percent.

In online programming contests, o1 reached the 89th percentile, showcasing its advanced problem-solving abilities.

here is o1, a series of our most capable and aligned models yet:https://t.co/yzZGNN8HvD

o1 is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it. pic.twitter.com/Qs1HoSDOz1

— Sam Altman (@sama) September 12, 2024

Enhanced Problem-Solving and Reasoning

OpenAI’s o1 model represents a significant advancement in handling complex queries.

It excels in areas such as coding and mathematical reasoning, performing better than previous models in generating and debugging complex code and solving intricate mathematical formulas.

OpenAI just released its new AI model, OpenAI o1 (internally known as Project Strawberry/Q*)

The wait is finally over.

Here's the model coding an entire video game from a prompt 🤯

The model can think before it answers and is better at math and programming challenges.

The… pic.twitter.com/hIokkdow2T

— Rowan Cheung (@rowancheung) September 12, 2024

Jakub Pachocki, OpenAI’s chief scientist, noted that the new model could take its time to solve problems, which is a departure from the rapid, pattern-based responses of earlier versions.

He stated:

“This model can take its time. It can think through the problem — in English — and try to break it down and look for angles in an effort to provide the best answer.”

OpenAI o1 solves a complex logic puzzle. pic.twitter.com/rpJbh8FkAg

— OpenAI (@OpenAI) September 12, 2024

However, this advanced capability comes with a higher cost.

For developers using the API, the pricing is notably steeper: o1-preview costs $15 per 1 million input tokens and $60 per 1 million output tokens, compared to GPT-4o’s $5 and $15, respectively.

o1-preview is 100x more expensive than GPT-4o mini, costing $15 per million input tokens compared to GPT-4o mini's $0.15. pic.twitter.com/vOlJX6TFNY

— Nicolas Bustamante (@nicbstme) September 12, 2024

Despite its strengths, o1 does have limitations, such as not being as effective in factual knowledge and lacking functionalities like web browsing or image processing.

Human-Like Processing and Future Directions

The introduction of o1 also includes innovative features designed to mimic human-like reasoning.

During demonstrations, the model exhibited phrases like “I’m curious about,” “I’m thinking through,” and “Ok, let me see,” which create an illusion of deliberate thought processes.

Although the model is not truly sentient, these features enhance the user experience by making the reasoning steps more transparent.

OpenAI’s future plans involve expanding o1’s capabilities further.

The company aims to introduce features such as browsing, file, and image uploading to increase the model’s utility.

As OpenAI continues to seek funding and aims for a $150 billion valuation, advancements like o1 are crucial in the broader quest for developing autonomous systems and achieving human-like intelligence.