DeepSeek V4 Can Be Even More Economical! New Tool Achieves Cache Hit Rate of Up to 99.82%, Stable at 20% of Original Price
A new open-source tool called Reasonix has been developed to significantly reduce the cost of using DeepSeek V4 models, achieving a cache hit rate of up to 99.82%. This optimization can lower the cost of processing 400 million tokens from $61 to just $12. Reasonix is specifically designed for DeepSeek's caching mechanisms, employing an append-only loop and a cache-first strategy to keep older contexts stable and minimize recomputation. The tool also includes features for repairing tool calls and intelligently managing model versions to further control expenses. AI
IMPACT Significantly reduces operational costs for users of DeepSeek V4, potentially influencing how other models handle long contexts.