ENTITY deepseek-moe-16b-base

deepseek-moe-16b-base

PulseAugur coverage of deepseek-moe-16b-base — every cluster mentioning deepseek-moe-16b-base across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

2 over 90d

Releases · 30d

0 over 90d

Papers · 30d

2 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

TOOL · CL_122974 · Jul 3 · 04:00

New method prunes MoE language models using generic text corpora

Researchers have developed a new method called Generic TB-Coverage for pruning sparsely activated Mixture-of-Experts (MoE) language models. This technique addresses the challenge of removing redundant experts without re…
TOOL · CL_58625 · May 29 · 04:00

ConMoE framework compresses MoE models without retraining

Researchers have developed ConMoE, a novel framework for compressing Mixture-of-Experts (MoE) language models without requiring retraining. This method consolidates the expert pool by reassigning original expert referen…

New method prunes MoE language models using generic text corpora

ConMoE framework compresses MoE models without retraining