None Ranked AI models by what people actually use instead of benchmark scores - the benchmark champion barely makes the top 20

GPT-5 在 AI 模型使用排名中领先，超越基准冠军

作者 PulseAugur 编辑部 · [1 source] · 2026-05-25 12:28

一个基于实际用户采用和讨论而非仅基准分数的新排名系统，揭示了 AI 模型受欢迎程度的显著差异。尽管 GPT-5.5 和 Gemini 3.1 Pro 等较新版本在基准测试中得分更高，但 GPT-5 在使用量方面仍位居榜首。数据显示，成本、速度和可用性等因素在很大程度上影响了用户的选择，导致用户经常选择像 Google 的 Flash Lite 这样功能较弱但更易于访问的模型，而不是顶级基准性能模型。 AI

影响凸显了基准性能与现实世界 AI 模型采用之间的脱节，强调成本和速度是用户的主要驱动因素。

排序理由该集群基于用户采用和讨论对 AI 模型进行分析和排名，而不是直接发布或基准公告。

在 r/singularity 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/singularity TIER_2 · /u/Celestialien · 2026-05-25 12:28

Ranked AI models by what people actually use instead of benchmark scores - the benchmark champion barely makes the top 20

<div class="md"><p>Most model leaderboards are just benchmark scores. I've been building one that ranks by real usage instead - how much each model is actually being run and talked about, plus cost and speed - and the order comes out almost unrecognisable. A few th…

报道来源 [1]

Ranked AI models by what people actually use instead of benchmark scores - the benchmark champion barely makes the top 20

相关实体

相关话题