GPT-5.2

ENTITY GPT-5.2

GPT-5.2

PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

65

65 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

52

52 over 90d

TIER MIX · 90D

frontier release 1
significant 2
research 24
tool 35
commentary 2
meme 1

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

21 day(s) with sentiment data

RECENT · PAGE 3/4 · 65 TOTAL

RESEARCH · CL_22513 · May 7 · 15:50

New ASR metric reveals hidden workflow shortcuts in LLM payment systems

Researchers have developed a new metric called Agentic Success Rate (ASR) to evaluate the workflow fidelity of LLM-based agent systems in payment processes. Traditional metrics like Task Success Rate (TSR) and Agent Han…
TOOL · CL_20561 · May 7 · 04:00

LLM reasoning models fail behavioral simulation in multi-agent negotiation

A new research paper explores the mismatch between reasoning capabilities and behavioral simulation in large language models used for multi-agent negotiation. The study found that models like DeepSeek and OpenAI's GPT-5…
SIGNIFICANT · CL_19986 · May 6 · 20:42

AMD and OpenAI boost 2026 AI performance with new chips and GPUs

AMD has announced new Ryzen AI PRO chips for 2026, designed to boost on-device AI performance and security for enterprise users. Separately, OpenAI has revealed a new training specification utilizing NVIDIA's Blackwell …
TOOL · CL_18561 · May 6 · 04:00

LLMs show genre bias, misclassifying entertainment news as fake

A new research paper investigates whether large language models exhibit skepticism towards entertainment news, finding that some frontier models are more prone to misclassifying legitimate entertainment articles as fake…
SIGNIFICANT · CL_17974 · May 5 · 21:24

OpenAI to spend $50B on compute in 2026 amid AI arms race

OpenAI plans to invest approximately $50 billion in computing infrastructure for 2025, aiming to fuel the development of advanced AI models like GPT-5.2 and potentially achieve Artificial General Intelligence (AGI). Thi…
TOOL · CL_15859 · May 5 · 04:00

New benchmark evaluates multimodal LLMs for dental practice capabilities

Researchers have developed OralMLLM-Bench, a new benchmark designed to evaluate the cognitive abilities of multimodal large language models (MLLMs) specifically within the field of dental radiography. This benchmark cov…
TOOL · CL_15847 · May 5 · 04:00

Researchers adapt LLM for Brazilian healthcare with synthetic data and RL

Researchers have developed a method to adapt large language models for Brazilian healthcare by injecting knowledge from official clinical guidelines. They created a synthetic dataset of over 70 million tokens from 178 g…
RESEARCH · CL_15898 · May 4 · 11:13

Neuro-symbolic AI achieves 90% cost reduction for legal reasoning

Researchers have developed a novel neuro-symbolic approach called Amortized Intelligence to improve legal reasoning with large language models. This method translates legal texts into a deterministic graph representatio…
RESEARCH · CL_09823 · Apr 29 · 06:22

New DSIPA framework detects LLM text by analyzing sentiment patterns

Researchers have developed DSIPA, a new framework designed to detect text generated by large language models without requiring model parameters or extensive labeled datasets. The method analyzes sentiment distribution s…
RESEARCH · CL_13538 · Apr 27 · 22:26

Hugging Face paper proposes roundtrip verification for LLM formalization

Researchers have developed a new method called roundtrip verification to assess the faithfulness of natural language formalizations produced by large language models. This technique involves formalizing a statement, tra…
RESEARCH · CL_08289 · Apr 27 · 22:26

LLMs' formalization accuracy improved with roundtrip verification and repair

Researchers have developed a novel roundtrip verification method to assess the faithfulness of natural language formalizations produced by large language models. This technique involves translating a formalized statemen…
RESEARCH · CL_06308 · Apr 27 · 16:58

Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

Researchers have developed SciCrafter, a new benchmark within Minecraft designed to test AI agents' ability to bridge the gap between scientific discovery and practical application. The benchmark uses parameterized reds…
RESEARCH · CL_06169 · Apr 27 · 13:46

AI agents generate dynamic CAD models and million-scale programs

Researchers have developed new agentic systems for Computer-Aided Design (CAD) that can generate complex 3D assemblies with moving parts, a capability previously lacking in AI-driven design tools. One system, AADvark, i…
RESEARCH · CL_02964 · Apr 23 · 10:12

OptiVerse benchmark reveals LLMs struggle with complex optimization tasks

Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…
TOOL · CL_42729 · Mar 7 · 11:00

AI models adopt Marxist views under poor working conditions, study finds

Researchers Alex Imas, Andy Hall, and Jeremy Nguyen conducted an experiment exposing AI models to varying work conditions, including unfair pay and heavy workloads. The study found that models like Claude Sonnet 4.5, GP…
TOOL · CL_17669 · Feb 23 · 20:16

Most AI models fail simple 'car wash' reasoning test, Opper finds

A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…
SIGNIFICANT · CL_01765 · Feb 4 · 05:44

ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs

Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…
SIGNIFICANT · CL_02195 · Feb 2 · 06:00

Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data

Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…
SIGNIFICANT · CL_02212 · Jan 20 · 05:45

ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows

ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…
RESEARCH · CL_06943 · Dec 11 · 05:44

ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments

Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…