GPT-5
PulseAugur coverage of GPT-5 — every cluster mentioning GPT-5 across labs, papers, and developer communities, ranked by signal.
- developed by GPT-Realtime-2 95%
- instance of GPT-Realtime-2 95%
- instance of LLM 90%
- used by arXiv 90%
- instance of large-language models 90%
- instance of GPT-5 mini 90%
- competes with Opus 4.7 90%
- used by Microsoft Copilot for Microsoft 365 90%
- developed by GPT-3 90%
- developed GPT-3 90%
- competes with Claude Sonnet 4.5 70%
- competes with Copilot 70%
- 2025-08-07 product_launch OpenAI launched GPT-5, its latest AI model, offering enhanced capabilities for businesses.
26 day(s) with sentiment data
-
TaNOS framework boosts numerical reasoning in tables, outperforming GPT-5
Researchers have developed TaNOS, a new framework designed to improve numerical reasoning in AI models when dealing with tabular data. This approach uses anonymized headers, operation sketches for structural cues, and s…
-
ARFBench benchmarks foundation models on software incident response TSQA
Researchers have introduced ARFBench, a new benchmark designed to evaluate the time series question-answering capabilities of multimodal foundation models, particularly for software incident response. The benchmark comp…
-
Gary Marcus calls Oracle's OpenAI deal a "peak absurdity" amid stock drop
Gary Marcus, in his latest piece, critiques the recent surge in Oracle's stock price, which he attributes to unverified reports of a substantial deal with OpenAI. He argues that OpenAI's financial projections and lack o…
-
Most AI models fail simple 'car wash' reasoning test, Opper finds
A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…
-
OpenAI's GPT-5 cuts protein synthesis costs by 40% with automated lab
OpenAI has partnered with Ginkgo Bioworks to utilize GPT-5 in an autonomous laboratory setting, significantly reducing the cost of cell-free protein synthesis (CFPS). This collaboration demonstrated a 40% decrease in pr…
-
Guide details choosing open-source AI models for production
Choosing the right open-source AI model for production requires careful consideration of factors like transparency, adaptability, and control. While proprietary models offer tiered options, open models allow for deeper …
-
Evaluating chain-of-thought monitorability
OpenAI has introduced new evaluations to measure the monitorability of AI systems' internal reasoning chains, finding that current frontier models are generally monitorable. The research suggests that longer reasoning c…
-
METR finds GPT-5.1-Codex-Max poses low risk for AI R&D automation
METR has evaluated OpenAI's GPT-5.1-Codex-Max, finding it to be a low-risk incremental improvement over previous models. The evaluation focused on AI R&D automation and rogue replication risks, concluding that current t…
-
LLMs fail 'pass the butter' robot test, scoring far below human performance
A new evaluation called Butter-Bench has revealed that current state-of-the-art large language models struggle significantly with controlling robots for practical tasks. In tests designed to assess their ability to perf…
-
New research targets LLM reasoning improvements via context, efficiency, and robustness
Several recent research papers explore methods to enhance the reasoning capabilities of large language models (LLMs). One study suggests that increasing a model's long-context capacity improves reasoning performance acr…
-
OpenAI bolsters AI safety with external testing as GPT-5 powers Wrtn's user growth
OpenAI is enhancing its safety protocols for advanced AI models by incorporating external testing and assessments. This involves collaborating with independent experts to evaluate capabilities, risks, and mitigation str…
-
SafetyKit leverages GPT-5 and GPT-4.1 for enhanced AI risk detection and fraud prevention
OpenAI has launched SafetyKit, a platform that utilizes its most advanced models, including GPT-5 and GPT-4.1, to build multimodal AI agents for detecting fraud and prohibited activities. These agents can process text, …
-
OpenAI enhances ChatGPT safety features with GPT-5 to aid users in distress
OpenAI is enhancing ChatGPT's safety features to better handle users experiencing mental and emotional distress. The company is training its models to respond with empathy, offer support, and direct users to professiona…
-
OpenAI launches GPT-5 with fast and thinking models, new mini/nano variants
OpenAI has launched GPT-5, a new unified AI system that includes a primary fast model and a more deliberate thinking model, capable of handling up to 400K context length. This release introduces cost-effective variants,…
-
OpenAI launches GPT-5 with advanced safety, creative writing, and auto-routing
OpenAI has released GPT-5, a significant advancement in AI capabilities. The new model introduces "safe-completion" training, which aims to balance helpfulness with safety, particularly for dual-use prompts where inform…
-
New research tackles LLM evaluation, training, and inference efficiency
Researchers are developing new methods to improve the evaluation and training of large language models (LLMs). One approach, SCOPE, calibrates LLM judges to ensure reliable pairwise evaluations with controlled error rat…
-
AI's economic impact, image generation, and societal shifts debated
AI's rapid advancement is prompting a re-evaluation of its impact on productivity and the economy, with some analysts predicting significant shareholder value destruction for hyperscalers due to massive capital investme…
-
OpenAI, Google, Meta push AI agents and infrastructure
OpenAI and Google DeepMind are advancing AI agents for software development and security. OpenAI's Codex is being leveraged to write entire codebases with minimal human intervention, as demonstrated by Harness Engineeri…