PulseAugur
EN
LIVE 13:31:36
ENTITY GPT-5.2

GPT-5.2

PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
65
65 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
52
52 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

21 day(s) with sentiment data

RECENT · PAGE 3/4 · 65 TOTAL
  1. RESEARCH · CL_22513 ·

    New ASR metric reveals hidden workflow shortcuts in LLM payment systems

    Researchers have developed a new metric called Agentic Success Rate (ASR) to evaluate the workflow fidelity of LLM-based agent systems in payment processes. Traditional metrics like Task Success Rate (TSR) and Agent Han…

  2. TOOL · CL_20561 ·

    LLM reasoning models fail behavioral simulation in multi-agent negotiation

    A new research paper explores the mismatch between reasoning capabilities and behavioral simulation in large language models used for multi-agent negotiation. The study found that models like DeepSeek and OpenAI's GPT-5…

  3. SIGNIFICANT · CL_19986 ·

    AMD and OpenAI boost 2026 AI performance with new chips and GPUs

    AMD has announced new Ryzen AI PRO chips for 2026, designed to boost on-device AI performance and security for enterprise users. Separately, OpenAI has revealed a new training specification utilizing NVIDIA's Blackwell …

  4. TOOL · CL_18561 ·

    LLMs show genre bias, misclassifying entertainment news as fake

    A new research paper investigates whether large language models exhibit skepticism towards entertainment news, finding that some frontier models are more prone to misclassifying legitimate entertainment articles as fake…

  5. SIGNIFICANT · CL_17974 ·

    OpenAI to spend $50B on compute in 2026 amid AI arms race

    OpenAI plans to invest approximately $50 billion in computing infrastructure for 2025, aiming to fuel the development of advanced AI models like GPT-5.2 and potentially achieve Artificial General Intelligence (AGI). Thi…

  6. TOOL · CL_15859 ·

    New benchmark evaluates multimodal LLMs for dental practice capabilities

    Researchers have developed OralMLLM-Bench, a new benchmark designed to evaluate the cognitive abilities of multimodal large language models (MLLMs) specifically within the field of dental radiography. This benchmark cov…

  7. TOOL · CL_15847 ·

    Researchers adapt LLM for Brazilian healthcare with synthetic data and RL

    Researchers have developed a method to adapt large language models for Brazilian healthcare by injecting knowledge from official clinical guidelines. They created a synthetic dataset of over 70 million tokens from 178 g…

  8. RESEARCH · CL_15898 ·

    Neuro-symbolic AI achieves 90% cost reduction for legal reasoning

    Researchers have developed a novel neuro-symbolic approach called Amortized Intelligence to improve legal reasoning with large language models. This method translates legal texts into a deterministic graph representatio…

  9. RESEARCH · CL_09823 ·

    New DSIPA framework detects LLM text by analyzing sentiment patterns

    Researchers have developed DSIPA, a new framework designed to detect text generated by large language models without requiring model parameters or extensive labeled datasets. The method analyzes sentiment distribution s…

  10. RESEARCH · CL_13538 ·

    Hugging Face paper proposes roundtrip verification for LLM formalization

    Researchers have developed a new method called roundtrip verification to assess the faithfulness of natural language formalizations produced by large language models. This technique involves formalizing a statement, tra…

  11. RESEARCH · CL_08289 ·

    LLMs' formalization accuracy improved with roundtrip verification and repair

    Researchers have developed a novel roundtrip verification method to assess the faithfulness of natural language formalizations produced by large language models. This technique involves translating a formalized statemen…

  12. RESEARCH · CL_06308 ·

    Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

    Researchers have developed SciCrafter, a new benchmark within Minecraft designed to test AI agents' ability to bridge the gap between scientific discovery and practical application. The benchmark uses parameterized reds…

  13. RESEARCH · CL_06169 ·

    AI agents generate dynamic CAD models and million-scale programs

    Researchers have developed new agentic systems for Computer-Aided Design (CAD) that can generate complex 3D assemblies with moving parts, a capability previously lacking in AI-driven design tools. One system, AADvark, i…

  14. RESEARCH · CL_02964 ·

    OptiVerse benchmark reveals LLMs struggle with complex optimization tasks

    Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…

  15. TOOL · CL_42729 ·

    AI models adopt Marxist views under poor working conditions, study finds

    Researchers Alex Imas, Andy Hall, and Jeremy Nguyen conducted an experiment exposing AI models to varying work conditions, including unfair pay and heavy workloads. The study found that models like Claude Sonnet 4.5, GP…

  16. TOOL · CL_17669 ·

    Most AI models fail simple 'car wash' reasoning test, Opper finds

    A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…

  17. SIGNIFICANT · CL_01765 ·

    ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs

    Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…

  18. SIGNIFICANT · CL_02195 ·

    Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data

    Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…

  19. SIGNIFICANT · CL_02212 ·

    ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows

    ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…

  20. RESEARCH · CL_06943 ·

    ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments

    Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…