PulseAugur
实时 23:34:28

OpenAI's new models let ChatGPT think with images for advanced reasoning

OpenAI has introduced its latest visual reasoning models, o3 and o4-mini, which allow AI to "think with images" as part of its internal reasoning process. These models can perform image manipulations like cropping and zooming natively, enhancing ChatGPT's ability to analyze complex visual data. This advancement leads to state-of-the-art performance on multimodal benchmarks, particularly in STEM question-answering and visual search, marking a significant step towards more capable multimodal AI agents. AI

排序理由 Frontier-lab model release with system card.

在 OpenAI News 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

OpenAI's new models let ChatGPT think with images for advanced reasoning

报道来源 [1]

  1. OpenAI News TIER_1 English(EN) ·

    Thinking with images