PulseAugur
EN
LIVE 01:26:54

Meta's Llama 4 Scout claims 10M context but struggles with comprehension

A new AI model, Llama 4 Scout, has been announced with a claimed 10 million token context window, significantly larger than existing models from OpenAI, Anthropic, and Google. This model utilizes a Mixture-of-Experts architecture and interleaved Rotary Position Embeddings (iRoPE) to manage its extensive context length and is priced affordably. However, real-world testing reveals limitations, with the practical context window capped at 327,680 tokens on hosted platforms and comprehension significantly degrading beyond approximately 256,000 tokens, making it more of a search index than a reasoning partner at its full claimed capacity. AI

IMPACT Challenges existing long-context models and pricing, but practical limitations may temper its impact.

RANK_REASON New model release with a significant claimed capability increase. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 English(EN) · Kimachin ·

    I found this Massive 10M Context Window AI Model

    <p>A few months ago, I got tired of manually checking which AI model had the longest context window. Every week, some provider would quietly update a model card, or a new release would drop with a bigger number, and the leaderboard would shift without anyone noticing.</p> <p>So I…