PulseAugur
LIVE 09:16:11
research · [1 source] ·
0
research

LLM API prices plummet for top models, but Anthropic's Haiku tier rises

The LLM API pricing landscape has seen significant shifts in Q1-Q2 2026, with major providers like OpenAI and xAI drastically reducing costs for their flagship models. OpenAI's o3, for instance, dropped 80% to $2/$8 per million tokens, while xAI's Grok 4.3 saw an 83% output cost reduction. Google also introduced more competitive pricing with its Gemini 2.5 Flash. However, not all changes were downward; Anthropic's Haiku 3 retirement led to a 3-4x cost increase for users who did not migrate to newer versions. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Accelerates adoption of LLMs for production workloads by making inference significantly cheaper across the board, while highlighting the need for continuous cost monitoring.

RANK_REASON Article details significant price changes across multiple major LLM providers, impacting production workloads and cost-effectiveness. [lever_c_demoted from significant: ic=1 ai=0.7]

Read on dev.to — LLM tag →

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 · Dave Graham ·

    LLM API Pricing Trends Q2 2026 — Who Got Cheaper, Who Got Expensive

    <p>The LLM market has repriced dramatically since early 2025. Frontier intelligence that cost $10/M input tokens 18 months ago now runs $1–3/M. Budget tiers have hit $0.10/M. But not every direction is down — Anthropic's budget tier got more expensive when Haiku 3 retired. Here's…