vLLM has released version 0.22.1, with a release candidate v0.22.1rc2 also available. These releases address a compatibility issue with CUTLASS fmin initialization specifically for the DeepSeek-V4 model. The fix ensures smoother integration and operation of DeepSeek-V4 when using the vLLM inference engine. AI
IMPACT Ensures smoother operation of DeepSeek-V4 with the vLLM inference engine.
RANK_REASON This is a software release for an open-source inference engine, addressing a specific model compatibility issue.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →