• Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Home Enterprise

OpenAI o3 & o4 Mini Models Feature Visual Reasoning

Akinola Ajibola by Akinola Ajibola
April 18, 2025
in Enterprise
Share on FacebookShare on Twitter

The business’s latest reasoning-focused models with evident chain-of-thought (CoT) are called o3 and o4-mini. The San Francisco-based AI company said that these models have visual reasoning capability, meaning they can analyze and “think” about an image to respond to more complex user queries. The models are the successors to the o1 and o3-mini, and they will be available to ChatGPT’s paid subscribers at the moment. Notably, the business also put out the GPT-4.1 series of AI models earlier this week.

The announcement of the latest large language models (LLMs) was made via OpenAI’s official handle on X, formerly known as Twitter. The AI company referred to these models as the “smartest and most capable models” and said that they now had the capacity to reason visually.

In essence, visual reasoning implies that these AI models are better able to analyse images and extract implicit and contextual information from them. According to OpenAI’s website, these are the company’s first models capable of combining and using all of ChatGPT’s tools in an agentic manner. These consist of image analysis, file interpretation, online search, Python, and picture creation.

The reasoning models, according to Open’s, can now agentically use and combine all of ChatGPT’s tools, including web searches, Python-based file and data analysis, deep reasoning about visual inputs, and even image generation. Importantly, these models are trained to reason about when and how to use tools to produce thoughtful and detailed answers in the right output formats, usually in less than a minute, to solve more complex problems. Today, releasing OpenAI o3 and o4-mini, the latest in our o-series of models trained to think for longer before responding: these models are the smartest models the business have released to date, representing a step change in ChatGPT’s capabilities for everyone from curious users to advanced researchers. As a result, they are better equipped to handle complex queries, which is a step toward ChatGPT becoming more agentic and capable of carrying out activities on your behalf. Setting a new threshold for intelligence and utility, the combination of cutting-edge reasoning with complete tool access results in noticeably better performance on real-world activities and academic benchmarks.

This implies that the o3 and o4-mini AI models are able to search for the picture online, alter it by flipping, cropping, zooming, and improving it, and even execute a Python code to retrieve data. According to OpenAI, this would enable the models to extract information from photos that aren’t ideal.

These models are now capable of reading handwriting from an upside-down notebook, reading a far sign with hardly visible lettering, identifying a specific query from a long list, determining a bus timetable from a bus image, solving puzzles, and more.

Image Source: OpenAI

In terms of performance, OpenAI asserted that the o3 and o4-mini AI models beat the GPT-4o and o1 models on the CharXiv, MathVista, MMMU, and VLMs are blind benchmarks. There were no performance comparisons with external AI models disclosed by the company.

OpenAI also pointed out a number of these models’ drawbacks. Overly lengthy thinking chains might result from the AI models doing pointless picture editing processes and tool calls. Additionally prone to perception problems, the o3 and o4-mini may provide inaccurate answers by misinterpreting visual cues. The AI company also pointed out that there may be reliability-related problems with the models.

ChatGPT Plus, Pro, and Team users will be able to access both o3 and o4-mini AI models, which will take the place of the o1, o3-mini, and o3-mini-high models in the model selector. Next week, Enterprise and Edu users will be able to access the models through the Chat Completions and Responses application programming interfaces (APIs).

Combining the precise reasoning features of the o-series with more of the natural conversational skills and tool use of the GPT-series, which reflects the direction their models are going in. By combining these strengths, our future models will support advanced problem-solving and proactive tool use in addition to smooth, natural conversations.

More interesting features regarding the introduction of the o3 and the o4 mini, can be known can be read more on the blog site.

Related Posts:

  • Microsoft-datacenter-cold-aisle-server-racks-for-the-AMD-MI300X
    Microsoft Prepares for OpenAI's GPT-5 Launch
  • GettyImages-1778706504
    Rumour: Microsoft Developing AI Models to Rival OpenAI
  • LIVESNS6IVOAJL44LHJMGKDVZI
    Open AI's GPT-4.5 is Here for Pro Users
  • assets_task_01jryqpar7fd1vr3zjb9wj416t_img_0
    OpenAI Unveils GPT-4.1, Its Flagship AI Model
  • openai_o3-2
    OpenAI Launches Free o3-Mini Reasoning Model on ChatGPT
  • 1738537437848
    ChatGPT Deep Research Now Links to GitHub Repos
  • openai-stack-overflow
    OpenAI & Stack Overflow Partner to Revamp…
  • chatgpt_openai_reuters_1675831938432
    OpenAI Strikes Licensing Deal with People Magazine Publisher

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: ChatGPTo3 minio4 mini model
Akinola Ajibola

Akinola Ajibola

BROWSE BY CATEGORIES

Select Category

    Receive top tech news directly in your inbox

    subscription from
    Loading

    Freshly Squeezed

    • Truecaller Filters Verified Business Messages May 12, 2025
    • ChatGPT Deep Research Now Links to GitHub Repos May 12, 2025
    • Microsoft Offers Guide to Fix Windows Blue Screen Errors May 12, 2025
    • We’ve Invested $10b in Nigeria so Far – MTN May 12, 2025
    • Tech Hype vs. Reality – When Big Tech Missed the Mark Pt. 3 May 11, 2025
    • Google’s Antitrust Showdown, AI vs. Search, and other Headlines May 11, 2025

    Browse Archives

    May 2025
    MTWTFSS
     1234
    567891011
    12131415161718
    19202122232425
    262728293031 
    « Apr    

    Quick Links

    • About TechBooky
    • Advertise Here
    • Contact us
    • Submit Article
    • Privacy Policy

    Recent News

    Truecaller Filters Verified Business Messages

    Truecaller Filters Verified Business Messages

    May 12, 2025
    ChatGPT Deep Research Now Links to GitHub Repos

    ChatGPT Deep Research Now Links to GitHub Repos

    May 12, 2025
    Microsoft Offers Guide to Fix Windows Blue Screen Errors

    Microsoft Offers Guide to Fix Windows Blue Screen Errors

    May 12, 2025
    The NCC Commissioned MTNN To Lease Spectrums From NTEL And Renew Its 3G Spectrum

    We’ve Invested $10b in Nigeria so Far – MTN

    May 12, 2025
    Tech Hype vs. Reality – When Big Tech Missed the Mark Pt. 1

    Tech Hype vs. Reality – When Big Tech Missed the Mark Pt. 3

    May 11, 2025
    Google’s Antitrust Showdown, AI vs. Search, and other Headlines

    Google’s Antitrust Showdown, AI vs. Search, and other Headlines

    May 11, 2025
    • Login

    © 2021 Design By Tech Booky Elite

    Generic selectors
    Exact matches only
    Search in title
    Search in content
    Post Type Selectors
    • African
    • Artificial Intelligence
    • Gadgets
    • Metaverse
    • Tips
    • About TechBooky
    • Advertise Here
    • Submit Article
    • Contact us

    © 2021 Design By Tech Booky Elite

    Discover more from TechBooky

    Subscribe now to keep reading and get access to the full archive.

    Continue reading

    We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.Ok