OpenAI's new models let ChatGPT think with images for advanced reasoning

By PulseAugur Editorial · [1 sources] · 2025-04-16 10:00

OpenAI has introduced its latest visual reasoning models, o3 and o4-mini, which allow AI to "think with images" as part of its internal reasoning process. These models can perform image manipulations like cropping and zooming natively, enhancing ChatGPT's ability to analyze complex visual data. This advancement leads to state-of-the-art performance on multimodal benchmarks, particularly in STEM question-answering and visual search, marking a significant step towards more capable multimodal AI agents. AI

RANK_REASON Frontier-lab model release with system card.

Read on OpenAI News →

CharXiv
ChatGPT
MathVista
o3
OpenAI
VLMs are Blind
o4-mini

model release
paper

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

OpenAI's new models let ChatGPT think with images for advanced reasoning

COVERAGE [1]

OpenAI News TIER_1 English(EN) · 2025-04-16 10:00

Thinking with images

COVERAGE [1]

Thinking with images

RELATED ENTITIES

RELATED TOPICS