实体
tokens per second
tokens per second
PulseAugur coverage of tokens per second — every cluster mentioning tokens per second across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
0
90 天内 0
层级分布 · 90 天
情绪 · 30 天
2 天有情绪数据
最近 · 第 1/1 页 · 共 2 条
-
LLM benchmarks mislead on inference speed for long contexts
Current LLM inference benchmarks are misleading because they primarily measure short-context performance, which does not reflect real-world usage involving longer contexts. This discrepancy arises from the differing com…
-
Tool visualizes LLM token generation speeds from 5 to 800 tokens/sec
A new interactive tool allows users to visualize the speed of language model token generation, from 5 to 800 tokens per second. Developed by Mike Veerman, this web application helps users understand advertised speeds li…