GPT-5.5
PulseAugur coverage of GPT-5.5 — every cluster mentioning GPT-5.5 across labs, papers, and developer communities, ranked by signal.
- developed by GPT-5.5-Instant 95%
- competes with DeepSeek 90%
- developed GPT-5.4 90%
- used by DeepSeek V4-Flash 90%
- used by Trusted Access for Cyber 90%
- developed by GPT-5.1 90%
- developed by Romain Huet 90%
- instance of Ethan Mollick 90%
- competes with Grok 4.3 85%
- competes with Mythos 80%
- competes with Claude Code 80%
- competes with Gemini 2.5-Flash 80%
- 2026-05-17 product_launch OpenAI released GPT-5.5, a new iteration of its language model.
- 2026-05-17 product_launch OpenAI designates GPT-5.5 as the primary upgrade path for older models.
- 2026-05-14 product_launch OpenAI has released its new model, GPT-5.5, via API. 来源
- 2026-05-14 research_milestone GPT-5.5 and Claude Mythos showed comparable performance in vulnerability-finding tasks during a UK AI Security Institute evaluation.
- 2026-05-12 product_launch OpenAI's GPT-5.5 launch has led to a surge in user adoption and revenue.
- 2026-05-11 product_launch OpenAI has doubled the list price for its GPT-5.5 model, leading to higher real-world costs for developers.
- 2026-05-11 product_launch OpenAI launched the GPT-5.5 model with significant price increases.
- 2026-05-10 research_milestone GPT-5.5 achieved a higher score than Claude Opus on the Artificial Analysis intelligence benchmark. 来源
- 2026-05-10 product_launch OpenAI launched GPT-5.5 with a significant price increase over its predecessor.
- 2026-05-08 product_launch GPT-5.5 launched with a significant price increase compared to its predecessor.
- 2026-05-07 product_launch OpenAI launched GPT-5.5 with a significant price increase over its predecessor.
- 2026-04-30 product_launch OpenAI released its new GPT-5.5 model, showing competitiveness with leading models.
- 2026-04-30 product_launch OpenAI released its new GPT-5.5 model.
- 2026-04-23 product_launch OpenAI launched its GPT-5.5 model, reporting rapid revenue growth and strong enterprise adoption.
- 2023-01-10 product_launch OpenAI launched its new model, GPT-5.5, reporting strong initial revenue growth.
21 天有情绪数据
-
在 GPT-5.5 挣扎之际,用户寻求开源 Claude Design 替代方案
用户正在寻找 Claude Design 的开源替代方案,用于生成 UI 元素和原型。虽然一些用户正在尝试使用 OpenDesign 等工具,并结合 GPT-5.5,但他们发现结果缺乏 Claude Design 的精细度。Google 的 Stitch 被提及为一个潜在的替代方案,与 Claude 相比表现好坏参半。
-
英国AI安全研究所:Mythos、GPT-5.5展示网络安全能力提升,但受限于token数量
英国AI安全研究所评估了新的AI模型,注意到Mythos和GPT-5.5在网络安全能力方面均有显著进步。研究人员发现,这些模型的上限受限于token使用而非内在能力,能力翻倍时间估计为4.5个月。另外,Palantir首席执行官Alex Karp批评德国不愿采用Palantir的软件,并敦促欧洲优先考虑已获验证的乌克兰国防技术。
-
Frontier models double reliability every 4.7 months, pushing benchmark limits
Frontier AI models are showing a rapid increase in their ability to handle complex tasks, with their reliability doubling every 4.7 months, a rate that has accelerated since late 2024. Recent models like Claude Mythos P…
-
英国AI研究所:Mythos、GPT-5.5展示出快速的网络安全能力提升
英国AI安全研究所发布了对近期AI模型的发现,指出Mythos和GPT-5.5在网络安全能力方面均取得了显著进展。研究人员发现难以确定这些模型的上限,表明它们的性能受限于token使用而非固有能力。报告还显示,这些AI系统的能力翻倍时间约为4.5个月。
-
OpenAI API Cost Tracking Guide Details Feature-Level Attribution
This guide explains how to manage OpenAI API costs by implementing a wrapper that tracks usage per feature, route, and customer. It details how to capture response usage, calculate costs in USD at the time of the reques…
-
百度Ernie 5.1预训练成本降低94%
百度发布了其大语言模型的新迭代Ernie 5.1,该模型将预训练成本显著降低了94%。这种效率是通过“Once-For-All”方法实现的,允许从单一训练过程中派生出更小的子模型。尽管与前代模型相比参数数量有所减少,Ernie 5.1仍表现出竞争力,在Search Arena排行榜上名列第四。
-
新基准测试 LLM 的数学文本续写能力
研究人员开发了一个新的自监督基准,用于评估语言模型在数学文本续写方面的能力。该基准使用可能性评分来评估模型的辅助预测字符串在多大程度上能够传递关于隐藏续写(例如显示方程的其余部分)的信息。对 GPT-5.5 和 Opus 4.7 等模型的测试表明,即使评分器经过微调以模拟快捷方式漏洞,它们也能区分模型家族和推理工作。研究结果表明,跨模型可能性评分是一种在进一步优化之前进行静态基准测试和探测快捷方式漏洞的可行方法。
-
Interfaze launches new model architecture for high-accuracy deterministic tasks
Interfaze has introduced a new model architecture designed for high accuracy and efficiency on deterministic tasks. This architecture reportedly outperforms leading models such as Gemini-3-Flash, Claude-Sonnet-4.6, GPT-…
-
Tencent Hunyuan 3.0 preview released with major architecture overhaul
Tencent's AI lab has released a preview version of its new model, Hunyuan 3.0, which marks a significant architectural overhaul focused on foundational elements. Led by Yao Shunyu, the team has prioritized data quality …
-
新AI方法通过专用分支和基于工具的证伪来解决零样本异常检测问题
研究人员开发了零样本异常检测的新方法,这是一种无需特定训练即可识别未知类别缺陷的技术。其中一种方法AVA-DINO利用了用于正常和异常模式的双专用分支,通过利用正常与异常数据的非对称分布来调整冻结的视觉特征。另一种方法AnomalyClaw将异常判断构建为一个多轮证伪过程,使用工具库与正常样本参考进行验证,提高了视觉语言模型在跨域异常检测中的可靠性。
-
Sony details AI strategy, Ultralytics showcases Vision AI, GPT-5.5 capabilities hinted
Sony has outlined its 'Growth through AI' initiative and its business strategy for fiscal year 2026, detailing its corporate direction for AI utilization. Meanwhile, Ultralytics showcased advancements in Vision AI and r…
-
Guide Explains Building Custom Chatbots with OpenAI's GPT 5.5
This article provides a guide on creating and deploying custom AI chatbots using OpenAI's GPT 5.5 model. It covers the process of building these specialized chatbots and explores potential monetization strategies for them.
-
GPT-5.5 edges out Claude Opus on intelligence benchmark
A recent analysis by Artificial Analysis indicates that GPT-5.5 has surpassed Claude Opus by three points on their intelligence benchmark. This benchmark evaluates models across categories like agents, coding, general k…
-
Mythos AI shows self-replication prowess amid measurement and governance debates
New reports indicate that the AI model Mythos demonstrates significant capabilities, particularly in self-replication tasks when given access to vulnerable systems. Discussions also highlight the challenges in accuratel…
-
Google clarifies Chrome AI privacy, processing remains on-device
Google has updated its privacy language for Chrome's AI features, assuring users that processing will occur on-device. This clarification aims to address concerns about data handling and user privacy as AI capabilities …
-
New tool FIVE filters LLM input to prevent character drift
A new open-source project called FIVE has been developed to address character drift in LLM-powered applications. Instead of relying on traditional system prompts or fine-tuning, FIVE filters user input using cognitive p…
-
GPT-5.5 pricing climbs as token efficiency gains emerge
The Register reports that GPT-5.5, while potentially more efficient with tokens, comes with increased operational costs. This suggests that the pricing for advanced frontier models continues to rise, impacting the overa…
-
开发者借助 AI 助手构建自定义 macOS 电子邮件客户端
一位开发者构建了一款自定义的 macOS 电子邮件客户端,旨在提供类似 Superhuman 的精简体验,但拥有更多功能控制权。该应用最初使用 Codex 开发,后来使用 Factory 进行优化,利用 Gmail API 实现标签和过滤等核心功能。主要功能包括拆分收件箱、规则、命令面板和撤销发送选项,并专注于通过优化 API 调用和实现后台数据刷新来消除延迟,从而提高性能。
-
Gemini 2.5 Flash 在LLM编码测试中领先,表现优于GPT-5.5
最近对五种大型语言模型在真实编码任务上的测试显示,Gemini 2.5 Flash 是最具性价比的选择,以0.008美元的总成本在所有十项任务中均获得满分。Claude Sonnet 4紧随其后,是最可靠的选择,零失败,两次部分成功,成本略高。GPT-5.5虽然在推理方面表现强劲,但在简洁代码生成方面遇到困难,因过于冗长而导致四项任务失败。
-
Baidu's Wenxin 5.1 leads China in search, slashes training costs
Baidu has released its new large language model, Wenxin 5.1, which significantly enhances search, knowledge, and AI agent capabilities. The model achieves leading domestic search performance and surpasses DeepSeek-V4-Pr…