PulseAugur
EN
LIVE 20:36:06

On-device AI advances with Apple and Google's new models

Apple and Google have significantly advanced on-device AI capabilities, making powerful language models usable directly on personal devices. This shift, marked by Apple's third-generation Foundation Models and Google's Gemma 4 family, means AI features can now run offline and privately without incurring per-token costs. The underlying innovation involves sparse model architectures, such as Apple's Instruction-Following Pruning and Google's Mixture-of-Experts, which activate only a fraction of the model's parameters for each request, enabling large models to operate efficiently within the memory constraints of mobile hardware. AI

IMPACT Enables offline, private, and cost-free AI features on personal devices, potentially reshaping application development and user experience.

RANK_REASON Cluster describes new on-device models from major AI labs (Apple, Google) with significant architectural innovations. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

On-device AI advances with Apple and Google's new models

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 English(EN) · AI Explore ·

    On-Device AI Just Got Real

    <p>Apple's newest on-device model carries about 20 billion parameters, and on any given request it fires maybe one to four billion of them. That gap — 20B stored, roughly 3B running — is the whole story of 2026. The model that now ships inside the latest iPhone is no longer a shr…