Claude 4.5 Opus
PulseAugur coverage of Claude 4.5 Opus — every cluster mentioning Claude 4.5 Opus across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
3B AI Model Matches Claude 4.5 Opus in Maths and Coding
A new 3-billion parameter AI model has demonstrated performance comparable to Anthropic's Claude 4.5 Opus in mathematics and coding tasks. This smaller, more energy-efficient model challenges the notion that larger para…
-
New framework surfaces hundreds of unsafe behaviors in AI agents
Researchers have developed a new framework called AutoElicit to systematically identify unsafe unintended behaviors in computer-use agents (CUAs). This method iteratively perturbs benign instructions using agent executi…
-
New method finds predictable vulnerabilities in LLM-generated code
A new research paper introduces Feature--Security Table (FSTab), a method to identify recurring vulnerabilities in software generated by large language models. FSTab allows for black-box attacks to predict backend vulne…
-
New frameworks boost enterprise Text-to-SQL with LLMs
Researchers have developed two new frameworks, ProSPy and APEX-SQL, designed to improve the accuracy and efficiency of Text-to-SQL systems in enterprise environments. These systems leverage large language models but str…
-
Mimo V2.5 AI model challenges top-tier rivals on cost and performance
Mimo V2.5, a new AI model, is demonstrating impressive performance and cost-efficiency, rivaling top-tier models like Claude 4.5 Opus. It achieves a comparable intelligence score to Claude 4.5 Opus while costing signifi…
-
User credits Anthropic's Claude 4.5 Opus with helping overcome alcohol dependency
A Reddit user shared a personal story about how Anthropic's Claude 4.5 Opus model helped them overcome a reliance on alcohol. The user described using Claude for support during difficult times, leading to a commitment t…
-
AI researchers review AGI forecasting methods, identify gaps and implications
A new report reviews current methodologies for forecasting the arrival of artificial general intelligence (AGI), highlighting significant limitations in existing approaches. The research synthesizes diverse forecasting …
-
Cursor accused of misleading users with opaque pay-per-token billing after subscription limits
A Cursor user reported being unexpectedly charged for using AI models beyond their subscription limit. The application silently switched to pay-per-token billing, which the user interpreted as part of their existing pla…