PulseAugur
EN
LIVE 03:19:12

OpenAI reportedly slashes ChatGPT inference costs by over half

OpenAI has reportedly achieved significant cost reductions in its AI model inference, cutting expenses by over half. These optimizations have been applied to ChatGPT, with the company at times requiring only a few hundred Nvidia GPUs for its operations. This efficiency improvement could allow OpenAI to serve more users or reallocate resources. AI

IMPACT Potential for increased accessibility or improved service quality for ChatGPT users due to reduced operational costs.

RANK_REASON Report on cost-saving measures for an existing product, not a new release or major industry shift.

Read on The Decoder →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

OpenAI reportedly slashes ChatGPT inference costs by over half

COVERAGE [1]

  1. The Decoder TIER_1 English(EN) · Matthias Bastian ·

    OpenAI reportedly cut response costs for guest ChatGPT users by more than half

    <p><img alt="" class="attachment-full size-full wp-post-image" height="1152" src="https://the-decoder.com/wp-content/uploads/2026/06/openai_logo_background_dark.png" style="height: auto; margin-bottom: 10px;" width="2048" /></p> <p> According to a report by The Information, OpenA…