ENTITY
deepseek-moe-16b-base
deepseek-moe-16b-base
PulseAugur coverage of deepseek-moe-16b-base — every cluster mentioning deepseek-moe-16b-base across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New method prunes MoE language models using generic text corpora
Researchers have developed a new method called Generic TB-Coverage for pruning sparsely activated Mixture-of-Experts (MoE) language models. This technique addresses the challenge of removing redundant experts without re…
-
ConMoE framework compresses MoE models without retraining
Researchers have developed ConMoE, a novel framework for compressing Mixture-of-Experts (MoE) language models without requiring retraining. This method consolidates the expert pool by reassigning original expert referen…