GPT-5.5 Pro
PulseAugur coverage of GPT-5.5 Pro — every cluster mentioning GPT-5.5 Pro across labs, papers, and developer communities, ranked by signal.
- 2026-05-11 research_milestone GPT-5.5 Pro independently solved open problems in number theory and generated research preprints. source
6 day(s) with sentiment data
-
GPT-5.5 Pro enhances academic paper with new data and arguments
Ethan Mollick shared an experience using GPT-5.5 Pro to analyze a past academic paper. The AI model was able to identify new data, perform analysis, create reproducible files, and even extend the paper's core argument i…
-
Satya Nadella champions AI "frontier ecosystems" over models
Microsoft CEO Satya Nadella has articulated a new AI strategy focused on building "frontier ecosystems" rather than solely on frontier models. This approach emphasizes creating "learning loops" where human and token cap…
-
Claude Fable 5 leads AI coding benchmarks, surpasses GPT-5.5
Anthropic's Claude Fable 5 has emerged as a leading AI model, significantly outperforming competitors like OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro in coding benchmarks. Fable 5 achieved an 80.3% success rate on SWE…
-
AI agent Moonshine generates mathematical conjectures with GPT-5.5
A new autonomous agent named Moonshine has been developed to generate mathematical conjectures and make progress on them. Moonshine explores complex problems by distilling new concepts and building theoretical framework…
-
DeepSeek undercuts GPT-5.5 Pro on cost, Microsoft revises Copilot pricing
DeepSeek's new model offers a significantly lower cost per task compared to GPT-5.5 Pro, with DeepSeek charging only one dollar versus GPT-5.5 Pro's twenty-two dollars. This pricing shift comes as Microsoft transitions …
-
DeepSeek V4 Pro outperforms GPT-5.5 Pro in precision tests
DeepSeek's V4 Pro model has reportedly surpassed OpenAI's GPT-5.5 Pro in precision benchmarks. This achievement marks a significant step for DeepSeek in the competitive landscape of large language models. The performanc…
-
GLM 4.7 generates images for fraction of GPT-5.5 Pro cost
A comparison shared on Mastodon highlights that GLM 4.7 generated a unicorn image for just $0.0032. This cost is reportedly 448 times cheaper than the most expensive attempt using GPT-5.5 Pro. The image generation quali…
-
Claude Opus 4.8 drafts academic paper, GPT-5.5 Pro spots errors
Ethan Mollick utilized Anthropic's Claude 3 Opus 4.8 in its Code environment to generate an academic paper from a large dataset of de-identified research files. He then employed OpenAI's GPT-5.5 Pro as a reviewer, which…
-
GPT-5.5 Pro excels as a fact-checker, notes Ethan Mollick
Ethan Mollick has found GPT-5.5 Pro to be an effective tool for fact-checking large amounts of text, accurately identifying key references within chapters. He notes that the model's tendency to provide nuanced responses…
-
GPT-5.5 Pro attempts humor generation in academic challenge
AI researcher Ethan Mollick has tasked GPT-5.5 Pro with a unique academic challenge: to analyze humor in word pairs and generate its own funny combinations. The model successfully produced phrases like "scrotum snorkel"…
-
Tiny models outperform frontier AI in agent coding benchmark
A recent agent coding benchmark revealed that smaller, more efficient models are outperforming larger, frontier models. The SmolLM3 3B model, capable of running on a laptop, achieved a score of 93.3, significantly surpa…
-
Ten new LLMs including DeepSeek V4, Grok 4.20, GPT-5.5 Pro to be benchmarked
A new benchmark test is scheduled to evaluate ten previously untested large language models, including DeepSeek V4 Pro, Grok 4.20, and GPT-5.5 Pro. The tests will focus on real-world agent coding tasks using a consisten…
-
GPT-5.5 Pro Solves Number Theory Problems, Generates Research Papers
OpenAI's GPT-5.5 Pro model has independently solved open problems in number theory, generating complete research preprints without human assistance. A notable mathematician described the output as being of solid doctora…
-
Google DeepMind AI assists mathematicians, tops FrontierMath benchmark
Google DeepMind has released an AI system called "AI Co-Mathematician" designed to collaborate with human mathematicians on complex problems. This system, built on Gemini 3.1 Pro, achieved a new state-of-the-art score o…
-
AI research lags frontier models, misrepresenting capabilities, study finds
A new paper reveals a significant gap between the capabilities of AI models evaluated in academic research and the actual frontier models available at the time. The study found that the median research paper evaluates m…
-
GPT-5.5 Pro excels on benchmarks; Microsoft Playwright aids web agents
OpenAI's GPT-5.5 Pro has reportedly achieved significant gains on the Epoch benchmark, with its base version outperforming the previous Pro model. This suggests substantial efficiency improvements in OpenAI's latest ite…
-
GPT-5.5 Pro shows sustained bug-fixing performance over 2-hour coding sessions
A user reported that GPT-5.5 Pro demonstrated sustained performance during a two-hour bug-fixing session. This suggests the model may offer improved reliability for extended coding tasks. The specific details of the ses…
-
OpenAI's GPT-5.5 Pro achieves 145 visual IQ, nearing Mensa threshold
OpenAI has reportedly developed a prototype smartphone, with mass production slated for 2028. The device is expected to feature advanced AI capabilities, with GPT-5.5 Pro achieving a visual IQ of 145, potentially meetin…
-
OpenAI unveils GPT-5.5 Pro with 145 visual IQ, nearing Mensa level
OpenAI has reportedly developed a smartphone, with mass production slated for 2028. The device is rumored to feature GPT-5.5 Pro, an AI model with a claimed visual IQ of 145, potentially meeting Mensa-level standards. T…
-
DeepSeek V4 release sparks surge in Chinese semiconductor stocks, boosting domestic AI computing power
DeepSeek V4's release has significantly boosted China's A-share semiconductor market, with sectors like GPU and semiconductor equipment experiencing a surge. This rally is attributed to V4's compatibility with Huawei's …