StepFunai has released Step-3.7-Flash, a 198 billion parameter sparse Mixture-of-Experts model. This new vision-language model offers day-zero support within the vLLM inference engine. The integration with vLLM is highlighted as a key feature for efficient deployment. AI
IMPACT Enables efficient deployment of a large sparse MoE vision-language model.
RANK_REASON Release of a new model with specific technical details and integration. [lever_c_demoted from research: ic=1 ai=1.0]
Read on Mastodon — fosstodon.org →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →