GPT-5.2
PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.
- subsidiary of OpenAI 100%
- developed by OpenAI 100%
- instance of LLM 90%
- instance of LLMs 90%
- instance of ChatGPT 90%
- competes with Gemini 3 Pro 80%
- competes with GPT-4o 70%
- used by arXiv 70%
- competes with Claude Opus-4.6 70%
- competes with Claude Opus 4.5 70%
- instance of GPT-4o 70%
- used by GPT-5.1 70%
21 day(s) with sentiment data
-
New ASR metric reveals hidden workflow shortcuts in LLM payment systems
Researchers have developed a new metric called Agentic Success Rate (ASR) to evaluate the workflow fidelity of LLM-based agent systems in payment processes. Traditional metrics like Task Success Rate (TSR) and Agent Han…
-
LLM reasoning models fail behavioral simulation in multi-agent negotiation
A new research paper explores the mismatch between reasoning capabilities and behavioral simulation in large language models used for multi-agent negotiation. The study found that models like DeepSeek and OpenAI's GPT-5…
-
AMD and OpenAI boost 2026 AI performance with new chips and GPUs
AMD has announced new Ryzen AI PRO chips for 2026, designed to boost on-device AI performance and security for enterprise users. Separately, OpenAI has revealed a new training specification utilizing NVIDIA's Blackwell …
-
LLMs show genre bias, misclassifying entertainment news as fake
A new research paper investigates whether large language models exhibit skepticism towards entertainment news, finding that some frontier models are more prone to misclassifying legitimate entertainment articles as fake…
-
OpenAI to spend $50B on compute in 2026 amid AI arms race
OpenAI plans to invest approximately $50 billion in computing infrastructure for 2025, aiming to fuel the development of advanced AI models like GPT-5.2 and potentially achieve Artificial General Intelligence (AGI). Thi…
-
New benchmark evaluates multimodal LLMs for dental practice capabilities
Researchers have developed OralMLLM-Bench, a new benchmark designed to evaluate the cognitive abilities of multimodal large language models (MLLMs) specifically within the field of dental radiography. This benchmark cov…
-
Researchers adapt LLM for Brazilian healthcare with synthetic data and RL
Researchers have developed a method to adapt large language models for Brazilian healthcare by injecting knowledge from official clinical guidelines. They created a synthetic dataset of over 70 million tokens from 178 g…
-
Neuro-symbolic AI achieves 90% cost reduction for legal reasoning
Researchers have developed a novel neuro-symbolic approach called Amortized Intelligence to improve legal reasoning with large language models. This method translates legal texts into a deterministic graph representatio…
-
New DSIPA framework detects LLM text by analyzing sentiment patterns
Researchers have developed DSIPA, a new framework designed to detect text generated by large language models without requiring model parameters or extensive labeled datasets. The method analyzes sentiment distribution s…
-
Hugging Face paper proposes roundtrip verification for LLM formalization
Researchers have developed a new method called roundtrip verification to assess the faithfulness of natural language formalizations produced by large language models. This technique involves formalizing a statement, tra…
-
LLMs' formalization accuracy improved with roundtrip verification and repair
Researchers have developed a novel roundtrip verification method to assess the faithfulness of natural language formalizations produced by large language models. This technique involves translating a formalized statemen…
-
Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft
Researchers have developed SciCrafter, a new benchmark within Minecraft designed to test AI agents' ability to bridge the gap between scientific discovery and practical application. The benchmark uses parameterized reds…
-
AI agents generate dynamic CAD models and million-scale programs
Researchers have developed new agentic systems for Computer-Aided Design (CAD) that can generate complex 3D assemblies with moving parts, a capability previously lacking in AI-driven design tools. One system, AADvark, i…
-
OptiVerse benchmark reveals LLMs struggle with complex optimization tasks
Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…
-
AI models adopt Marxist views under poor working conditions, study finds
Researchers Alex Imas, Andy Hall, and Jeremy Nguyen conducted an experiment exposing AI models to varying work conditions, including unfair pay and heavy workloads. The study found that models like Claude Sonnet 4.5, GP…
-
Most AI models fail simple 'car wash' reasoning test, Opper finds
A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…
-
ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs
Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…
-
Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data
Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…
-
ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows
ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…
-
ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments
Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…