Despite a dramatic decrease in the cost per token for AI models, many companies are experiencing rising AI expenditures. This paradox stems from the increased usage of AI, with complex agentic workflows now requiring numerous model calls per task, significantly inflating the total number of tokens processed. Additionally, techniques like retrieval-augmented generation and the deployment of always-on AI agents further contribute to higher overall bills, mirroring historical patterns of efficiency gains in computing leading to greater adoption and usage. AI
IMPACT Increased AI adoption and complex workflows are driving up operational costs despite falling per-token prices.
RANK_REASON Article discusses economic trends and usage patterns of AI models rather than a specific release or event.
- Andreessen Horowitz
- Anthropic
- Claude Opus 4.5
- Claude Opus 4.6
- Claude Opus 4.7
- Claude Opus 4.8
- DeepSeek-R1
- DeepSeek V4-Pro
- epoch.ai
- Gemini 3.5 Flash
- GPT-3.5
- GPT-5.4 Nano
- OpenAI
- Stanford University
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →