PulseAugur
LIVE 23:12:24
tool · [3 sources] ·
1
tool

Tool visualizes LLM token generation speeds from 5 to 800 tokens/sec

A new interactive tool allows users to visualize the speed of language model token generation, from 5 to 800 tokens per second. Developed by Mike Veerman, this web application helps users understand advertised speeds like "30 tokens/second" by simulating the output in real-time. The tool is useful for gauging the practical performance of different LLMs. AI

Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →

IMPACT Helps users intuitively grasp and compare LLM generation speeds, aiding in model selection and expectation setting.

RANK_REASON The cluster describes a new interactive tool for visualizing LLM performance metrics.

Read on Simon Willison →

COVERAGE [3]

  1. Simon Willison TIER_1 ·

    How fast is 10 tokens per second really?

    <p><strong><a href="https://mikeveerman.github.io/tokenspeed/">How fast is 10 tokens per second really?</a></strong></p> Neat little HTML app by Mike Veerman (<a href="https://github.com/MikeVeerman/tokenspeed/blob/master/index.html">source code here</a>) which simulates LLM toke…

  2. Mastodon — mastodon.social TIER_1 · [email protected] ·

    How fast is 10 tokens per second really? https://simonwillison.net/2026/May/20/tokens-per-second/#atom-everything # AI # LLM # Performance

    How fast is 10 tokens per second really? https://simonwillison.net/2026/May/20/tokens-per-second/#atom-everything # AI # LLM # Performance

  3. Mastodon — mastodon.social TIER_1 · [email protected] ·

    How fast is N tokens per second really? https://mikeveerman.github.io/tokenspeed/ # HackerNews # Tech # AI

    How fast is N tokens per second really? https://mikeveerman.github.io/tokenspeed/ # HackerNews # Tech # AI