Platforms Cross Below Neoclouds in 2026 First as Cached Pricing Diverges 59 Points Between Frontier and Platform Channels YTD
Third-party AI inference platforms have begun pricing below direct cloud provider offerings for average text models, a shift from earlier in the year. This change is driven by platforms capturing cheaper open-weight models and expanding cached pricing tiers, while direct cloud providers' pricing has remained relatively stable. The divergence in cached pricing between frontier and platform channels has reached its widest point this year, indicating a significant restructuring of the AI model pricing landscape. AI
IMPACT This pricing shift could influence how developers choose to deploy AI models, potentially favoring third-party platforms for cost-sensitive text-based applications.