(Yet Another) KV cache calculator - kvanta.vcerny.cz
A new web-based tool called KVANTA has been released to calculate KV cache sizes for large language models. The developer created KVANTA because they found existing calculators to be inadequate. The tool is designed to support any model available on Hugging Face and is open-source under the Apache 2.0 license. AI
IMPACT Provides a new utility for users running local LLMs, simplifying resource management.