PulseAugur
实时 21:57:31
实体 GPT-5.5 Pro

GPT-5.5 Pro

PulseAugur coverage of GPT-5.5 Pro — every cluster mentioning GPT-5.5 Pro across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
12
90 天内 12
发布 · 30天
0
90 天内 0
论文 · 30天
5
90 天内 5
层级分布 · 90 天
关系
时间线
  1. 2026-05-11 research_milestone GPT-5.5 Pro independently solved open problems in number theory and generated research preprints. 来源
情绪 · 30 天

5 天有情绪数据

最近 · 第 1/1 页 · 共 12 条
  1. COMMENTARY · CL_48081 ·

    GPT-5.5 Pro excels as a fact-checker, notes Ethan Mollick

    Ethan Mollick has found GPT-5.5 Pro to be an effective tool for fact-checking large amounts of text, accurately identifying key references within chapters. He notes that the model's tendency to provide nuanced responses…

  2. TOOL · CL_48093 ·

    GPT-5.5 Pro attempts humor generation in academic challenge

    AI researcher Ethan Mollick has tasked GPT-5.5 Pro with a unique academic challenge: to analyze humor in word pairs and generate its own funny combinations. The model successfully produced phrases like "scrotum snorkel"…

  3. TOOL · CL_29136 ·

    Tiny models outperform frontier AI in agent coding benchmark

    A recent agent coding benchmark revealed that smaller, more efficient models are outperforming larger, frontier models. The SmolLM3 3B model, capable of running on a laptop, achieved a score of 93.3, significantly surpa…

  4. TOOL · CL_27087 ·

    Ten new LLMs including DeepSeek V4, Grok 4.20, GPT-5.5 Pro to be benchmarked

    A new benchmark test is scheduled to evaluate ten previously untested large language models, including DeepSeek V4 Pro, Grok 4.20, and GPT-5.5 Pro. The tests will focus on real-world agent coding tasks using a consisten…

  5. SIGNIFICANT · CL_26142 ·

    GPT-5.5 Pro Solves Number Theory Problems, Generates Research Papers

    OpenAI's GPT-5.5 Pro model has independently solved open problems in number theory, generating complete research preprints without human assistance. A notable mathematician described the output as being of solid doctora…

  6. RESEARCH · CL_23974 ·

    Google DeepMind AI assists mathematicians, tops FrontierMath benchmark

    Google DeepMind has released an AI system called "AI Co-Mathematician" designed to collaborate with human mathematicians on complex problems. This system, built on Gemini 3.1 Pro, achieved a new state-of-the-art score o…

  7. RESEARCH · CL_20620 ·

    AI research lags frontier models, misrepresenting capabilities, study finds

    A new paper reveals a significant gap between the capabilities of AI models evaluated in academic research and the actual frontier models available at the time. The study found that the median research paper evaluates m…

  8. FRONTIER RELEASE · CL_09563 ·

    GPT-5.5 Pro excels on benchmarks; Microsoft Playwright aids web agents

    OpenAI's GPT-5.5 Pro has reportedly achieved significant gains on the Epoch benchmark, with its base version outperforming the previous Pro model. This suggests substantial efficiency improvements in OpenAI's latest ite…

  9. TOOL · CL_06055 ·

    GPT-5.5 Pro shows sustained bug-fixing performance over 2-hour coding sessions

    A user reported that GPT-5.5 Pro demonstrated sustained performance during a two-hour bug-fixing session. This suggests the model may offer improved reliability for extended coding tasks. The specific details of the ses…

  10. TOOL · CL_05966 ·

    OpenAI's GPT-5.5 Pro achieves 145 visual IQ, nearing Mensa threshold

    OpenAI has reportedly developed a prototype smartphone, with mass production slated for 2028. The device is expected to feature advanced AI capabilities, with GPT-5.5 Pro achieving a visual IQ of 145, potentially meetin…

  11. TOOL · CL_05969 ·

    OpenAI unveils GPT-5.5 Pro with 145 visual IQ, nearing Mensa level

    OpenAI has reportedly developed a smartphone, with mass production slated for 2028. The device is rumored to feature GPT-5.5 Pro, an AI model with a claimed visual IQ of 145, potentially meeting Mensa-level standards. T…

  12. RESEARCH · CL_05974 ·

    DeepSeek V4 release sparks surge in Chinese semiconductor stocks, boosting domestic AI computing power

    DeepSeek V4's release has significantly boosted China's A-share semiconductor market, with sectors like GPU and semiconductor equipment experiencing a surge. This rally is attributed to V4's compatibility with Huawei's …