PulseAugur
EN
LIVE 05:21:10

DeepSWE benchmark costs revealed: GPT-5.5 and Mimo V2.5 pricing detailed

A user on Reddit's r/singularity shared insights into the cost of running the DeepSWE benchmark, noting that pricing is per task rather than a total run cost. This means models like Mimo V2.5 Pro can cost around $225 for a full benchmark, and GPT 5.5 medium approximately $264. The user projected that Mimo V2.5 (non-pro) would cost about $7.15 for a complete run, based on early results. AI

IMPACT Provides cost insights for researchers and developers using AI models for benchmarks, influencing tool selection and budget planning.

RANK_REASON User-generated analysis of benchmark costs, not a primary release or official evaluation. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/singularity →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/singularity TIER_2 English(EN) · /u/pneuny ·

    Heads up for DeepSWE benchmark: The cost is measured per task, not the total run.

    <!-- SC_OFF --><div class="md"><p>I was running the Deep SWE benchmark and saw Mimo V2.5 Pro at $1.99 and figured running Mimo V2.5 (non-pro) would be cheaper than $1.99. But actually, it's not like Artificial Analysis where it measure the total amount, you need to multiply that …