• Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Home Enterprise

OpenAI o3 & o4 Mini Models Feature Visual Reasoning

Akinola Ajibola by Akinola Ajibola
April 18, 2025
in Enterprise
Share on FacebookShare on Twitter

The business’s latest reasoning-focused models with evident chain-of-thought (CoT) are called o3 and o4-mini. The San Francisco-based AI company said that these models have visual reasoning capability, meaning they can analyze and “think” about an image to respond to more complex user queries. The models are the successors to the o1 and o3-mini, and they will be available to ChatGPT’s paid subscribers at the moment. Notably, the business also put out the GPT-4.1 series of AI models earlier this week.

The announcement of the latest large language models (LLMs) was made via OpenAI’s official handle on X, formerly known as Twitter. The AI company referred to these models as the “smartest and most capable models” and said that they now had the capacity to reason visually.

In essence, visual reasoning implies that these AI models are better able to analyse images and extract implicit and contextual information from them. According to OpenAI’s website, these are the company’s first models capable of combining and using all of ChatGPT’s tools in an agentic manner. These consist of image analysis, file interpretation, online search, Python, and picture creation.

The reasoning models, according to Open’s, can now agentically use and combine all of ChatGPT’s tools, including web searches, Python-based file and data analysis, deep reasoning about visual inputs, and even image generation. Importantly, these models are trained to reason about when and how to use tools to produce thoughtful and detailed answers in the right output formats, usually in less than a minute, to solve more complex problems. Today, releasing OpenAI o3 and o4-mini, the latest in our o-series of models trained to think for longer before responding: these models are the smartest models the business have released to date, representing a step change in ChatGPT’s capabilities for everyone from curious users to advanced researchers. As a result, they are better equipped to handle complex queries, which is a step toward ChatGPT becoming more agentic and capable of carrying out activities on your behalf. Setting a new threshold for intelligence and utility, the combination of cutting-edge reasoning with complete tool access results in noticeably better performance on real-world activities and academic benchmarks.

This implies that the o3 and o4-mini AI models are able to search for the picture online, alter it by flipping, cropping, zooming, and improving it, and even execute a Python code to retrieve data. According to OpenAI, this would enable the models to extract information from photos that aren’t ideal.

These models are now capable of reading handwriting from an upside-down notebook, reading a far sign with hardly visible lettering, identifying a specific query from a long list, determining a bus timetable from a bus image, solving puzzles, and more.

Image Source: OpenAI

In terms of performance, OpenAI asserted that the o3 and o4-mini AI models beat the GPT-4o and o1 models on the CharXiv, MathVista, MMMU, and VLMs are blind benchmarks. There were no performance comparisons with external AI models disclosed by the company.

OpenAI also pointed out a number of these models’ drawbacks. Overly lengthy thinking chains might result from the AI models doing pointless picture editing processes and tool calls. Additionally prone to perception problems, the o3 and o4-mini may provide inaccurate answers by misinterpreting visual cues. The AI company also pointed out that there may be reliability-related problems with the models.

ChatGPT Plus, Pro, and Team users will be able to access both o3 and o4-mini AI models, which will take the place of the o1, o3-mini, and o3-mini-high models in the model selector. Next week, Enterprise and Edu users will be able to access the models through the Chat Completions and Responses application programming interfaces (APIs).

Combining the precise reasoning features of the o-series with more of the natural conversational skills and tool use of the GPT-series, which reflects the direction their models are going in. By combining these strengths, our future models will support advanced problem-solving and proactive tool use in addition to smooth, natural conversations.

More interesting features regarding the introduction of the o3 and the o4 mini, can be known can be read more on the blog site.

Related Posts:

  • Microsoft-datacenter-cold-aisle-server-racks-for-the-AMD-MI300X
    Microsoft Prepares for OpenAI's GPT-5 Launch
  • GettyImages-1778706504
    Rumour: Microsoft Developing AI Models to Rival OpenAI
  • LIVESNS6IVOAJL44LHJMGKDVZI
    Open AI's GPT-4.5 is Here for Pro Users
  • assets_task_01jryqpar7fd1vr3zjb9wj416t_img_0
    OpenAI Unveils GPT-4.1, Its Flagship AI Model
  • openai_o3-2
    OpenAI Launches Free o3-Mini Reasoning Model on ChatGPT
  • deepseek2-1024×640
    DeepSeek Launches R1 Reasoning Model on Hugging Face
  • 1738537437848
    ChatGPT Deep Research Now Links to GitHub Repos
  • W7BnebUnSW8Mxsq8EwkTs3-1200-80
    OpenAI Upgrades Operator Agent's AI Model

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: ChatGPTo3 minio4 mini model
Akinola Ajibola

Akinola Ajibola

BROWSE BY CATEGORIES

Select Category

    Receive top tech news directly in your inbox

    subscription from
    Loading

    Freshly Squeezed

    • Meta AI Reaches 1 Billion Monthly Users May 31, 2025
    • XChat, X’s New DM Feature, Available in Beta Testing May 31, 2025
    • Gmail Adds Gemini AI Summary Cards in May Update May 31, 2025
    • Nigeria Shines at Huawei ICT Competition May 31, 2025
    • 22 Nigerian Banks Join PAPSS Cross-Border Payment System May 31, 2025
    • Nintendo’s Hardware Finally Matches Switch Ambitions May 31, 2025

    Browse Archives

    June 2025
    MTWTFSS
     1
    2345678
    9101112131415
    16171819202122
    23242526272829
    30 
    « May    

    Quick Links

    • About TechBooky
    • Advertise Here
    • Contact us
    • Submit Article
    • Privacy Policy
    • Login

    © 2021 Design By Tech Booky Elite

    Generic selectors
    Exact matches only
    Search in title
    Search in content
    Post Type Selectors
    • African
    • Artificial Intelligence
    • Gadgets
    • Metaverse
    • Tips
    • About TechBooky
    • Advertise Here
    • Submit Article
    • Contact us

    © 2021 Design By Tech Booky Elite

    Discover more from TechBooky

    Subscribe now to keep reading and get access to the full archive.

    Continue reading

    We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.Ok