DeepSeek has introduced its V3.2 model, incorporating DeepSeek Sparse Attention (DSA). This innovation reduces attention complexity from O(L²) to O(Lk), significantly enhancing efficiency for processing long contexts. The model's architecture also leverages Lightning Indexer for further performance gains. AI
IMPACT Improves efficiency for long-context processing, potentially enabling new applications.
RANK_REASON Release of a new model version with a novel attention mechanism.
Read on Mastodon — mastodon.social →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →