New C++ tokenizer 'quicktok' offers 11x speedup over tiktoken

By PulseAugur Editorial · [1 sources] · 2026-06-16 04:24

A new C++ tokenizer called quicktok has been developed, offering significant speed improvements over existing solutions. It achieves byte-identical tokenization to tiktoken and is notably faster, running 2-3.6x faster than bpe-openai and 4-11x faster than tiktoken itself. The tokenizer supports various models including cl100k, o200k, GPT-OSS, Llama-3, and Qwen2.5/3, utilizing data structure engineering for enhanced performance. AI

IMPACT Accelerates tokenization workflows, potentially speeding up LLM inference and training processes.

RANK_REASON The cluster describes a new open-source software release for a specific AI task (tokenization) with benchmark results. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/MachineLearning →

infra

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

r/MachineLearning TIER_1 English(EN) · /u/_casa_nova_ · 2026-06-16 04:24

quicktok: a faster tokenizer (exact and byte-identical with tiktoken) [P]

<div class="md">Been working on this a while! Should be useful for anyone trying to speed up their tokenization workflows. quicktok is a fast/exact BPE tokenizer written in C++. Token ids are byte-identical to <code>tiktoken</code> and en…

COVERAGE [1]

quicktok: a faster tokenizer (exact and byte-identical with tiktoken) [P]

RELATED ENTITIES

RELATED TOPICS