PulseAugur
EN
LIVE 10:07:35
ENTITY mixture of experts

mixture of experts

PulseAugur coverage of mixture of experts — every cluster mentioning mixture of experts across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
153
153 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
120
120 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
TIMELINE
  1. 2026-05-11 research_milestone A new paper proposes an enhanced Mixture-of-Experts framework for faster time series forecasting model training. source
SENTIMENT · 30D

21 day(s) with sentiment data

RECENT · PAGE 1/8 · 153 TOTAL
  1. RESEARCH · CL_111297 ·

    New SharpMoE framework enhances diffusion models with accurate routing

    Researchers have developed SharpMoE, a new framework designed to improve the efficiency and performance of Mixture-of-Experts (MoE) diffusion models used in visual generation. The framework addresses a routing assignmen…

  2. TOOL · CL_109953 ·

    Study questions modularity of frontier Mixture-of-Experts models

    A new study published on arXiv investigates the modularity of Mixture-of-Experts (MoE) models, specifically testing the Command A+ model. The research found that apparent functional modularity in these models is often r…

  3. RESEARCH · CL_109525 ·

    SARA framework enhances multilingual capabilities in Mixture-of-Experts models

    Researchers have introduced SARA (Semantically Anchored Routing Alignment), a new framework designed to improve the performance of Mixture-of-Experts (MoE) models in low-resource languages. SARA addresses the issue wher…

  4. RESEARCH · CL_109609 ·

    New method learns domain generalization via subset-shared invariances

    Researchers have introduced a new approach to domain generalization called subset-shared invariance, which addresses limitations of current methods that enforce global invariance across all source domains. This new tech…

  5. TOOL · CL_108072 ·

    Automated pipeline uncovers bias in MoE4 architecture search

    Researchers have developed an automated pipeline to explore heterogeneous 4-Expert Mixture-of-Experts (MoE4) architectures within the LEMUR dataset ecosystem. This pipeline systematically combines base architecture fami…

  6. TOOL · CL_108057 ·

    MoE models show mixed inference performance on consumer and edge hardware

    A recent study investigated whether Mixture-of-Experts (MoE) language models offer practical inference advantages on consumer and edge hardware. The research found that while MoE models theoretically reduce per-token co…

  7. RESEARCH · CL_107851 ·

    RAVEN model enhances financial forecasting with adaptive context windows

    Researchers have introduced RAVEN, a novel Mixture-of-Experts framework designed to improve financial time series forecasting. Unlike traditional models that use fixed context windows, RAVEN adaptively determines the op…

  8. TOOL · CL_105119 ·

    New MoE framework integrates diverse architectures for improved plant disease classification

    Researchers have developed a novel adaptive soft Mixture-of-Experts (MoE) framework designed to improve plant leaf disease classification. This framework integrates three distinct architectures—EfficientNet-B0, DenseNet…

  9. RESEARCH · CL_105072 ·

    New framework uses hierarchical RL for neural network compression

    Researchers have developed HiReLC, a hierarchical reinforcement learning framework designed to jointly quantize and prune deep neural networks. This approach uses low-level agents for per-kernel configurations and high-…

  10. TOOL · CL_105172 ·

    New RAD method controls MoE language model reasoning without text analysis

    Researchers have developed a new method called RAD (Routing Agreement Decoding) for controlling reasoning in sparse Mixture-of-Experts (MoE) language models. This technique leverages the internal routing states of MoE m…

  11. SIGNIFICANT · CL_106351 ·

    NVIDIA Nemotron 3 Nano: Open Model for Efficient AI Agents

    NVIDIA has released Nemotron 3 Nano, a 30-billion parameter open model designed for efficient reasoning and long-context applications. This model utilizes a hybrid Mixture-of-Experts architecture, activating only a frac…

  12. SIGNIFICANT · CL_100955 ·

    NVIDIA unveils efficient Nemotron 3 LLM family with hybrid architecture

    NVIDIA has released two new large language models, Nemotron 3 Nano and Nemotron 3 Ultra, focusing on efficiency and advanced capabilities. Nemotron 3 Nano is a 30B-class model designed for private inference and agentic …

  13. SIGNIFICANT · CL_100080 ·

    DeepSeek unveils V4 models with 1M token context and MoE architecture

    DeepSeek has released a preview of its DeepSeek-V4 series of Mixture-of-Experts (MoE) language models, featuring DeepSeek-V4-Pro (1.6T parameters) and DeepSeek-V4-Flash (284B parameters). Both models support an unpreced…

  14. TOOL · CL_100078 ·

    LLM Cross-Lingual Transfer: Task Alignment Over Linguistic Family

    A new research paper explores cross-lingual transfer in large language models, specifically examining Arabic fine-tuning and its impact on Semitic languages. The study found no evidence of Semitic-specific transfer, ind…

  15. TOOL · CL_111008 ·

    New framework improves speaker verification for non-verbal vocalizations

    Researchers have developed a new framework for speaker verification that improves accuracy for non-verbal vocalizations (NVVs) while preserving performance on speech. The system combines frozen self-supervised features …

  16. RESEARCH · CL_98148 ·

    New research analyzes MoE model calibration and discontinuities · 4 sources tracked

    Two new research papers explore the complexities of Mixture-of-Experts (MoE) models, particularly concerning calibration and discontinuities. The first paper investigates how expert-level calibration impacts MoE perform…

  17. RESEARCH · CL_97837 ·

    FoMoE system partitions LLM experts to reduce distributed training costs

    Researchers have introduced FoMoE, a novel system designed to overcome the limitations of training large language models (LLMs) across geographically distributed data centers. Unlike previous methods that required full …

  18. TOOL · CL_96117 ·

    New research enables editable and composable KV cache for LLMs

    A new research paper introduces a novel method for optimizing KV cache usage in large language models, enabling editable and composable notes within the prefill stage. This approach allows for efficient editing of model…

  19. COMMENTARY · CL_95449 ·

    Mixture of Experts (MoE) enhances AI model inference speed

    Mixture of Experts (MoE) is presented as a solution to slow model inference times. By optimizing token routing, MoE architectures can effectively scale to handle increased request volumes. This approach aims to improve …

  20. RESEARCH · CL_95840 ·

    SoftMoE introduces differentiable routing for Mixture-of-Experts LLMs

    Researchers have introduced SoftMoE, a novel approach to Mixture-of-Experts (MoE) architectures for Large Language Models (LLMs). Unlike traditional sparse MoE models that use a non-differentiable top-k routing mechanis…