Qwen3-30B-A3B
PulseAugur coverage of Qwen3-30B-A3B — every cluster mentioning Qwen3-30B-A3B across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
SHAPE framework prunes MoE LLMs by modeling expert coalitions
Researchers have developed a new framework called SHAPE for pruning experts in sparse Mixture-of-Experts (MoE) large language models. Unlike previous methods that evaluated experts independently, SHAPE considers the coo…
-
New method allows MoE models to skip over half of experts
Researchers have developed a new framework called Zero-Expert Self-Distillation Adaptation (ZEDA) to make Mixture-of-Experts (MoE) language models more efficient. ZEDA allows post-trained static MoE models to dynamicall…
-
New frameworks automate software repository generation and management
Researchers have developed new frameworks to automate the creation and management of software repositories, addressing a key bottleneck in automated software engineering. One system, RepoLaunch, successfully builds and …
-
AI safety research finds ways to preserve model capabilities during fine-tuning
Researchers explored methods to mitigate capability degradation in AI models when using off-model supervised fine-tuning (SFT) for safety. They found that while off-model SFT can suppress capabilities, these abilities m…
-
LEVI system offers AlphaEvolve capabilities at fraction of cost
A new open-source system named LEVI has been developed to emulate AlphaEvolve's capabilities at a significantly reduced cost, reportedly up to 35 times cheaper. LEVI's core principle is that smaller language models can …
-
New framework finds and fixes errors in AI logic datasets
Researchers have identified significant inaccuracies in popular Natural Language to First-Order Logic (NL-to-FOL) datasets, with FOLIO and MALLS showing approximately 39% and 36% incorrect formalizations, respectively. …
-
New method allows MoE models to skip over half of experts
Researchers have developed a new framework called Zero-Expert Self-Distillation Adaptation (ZEDA) to make existing Mixture-of-Experts (MoE) language models more efficient. ZEDA allows post-trained static MoE models to d…
-
MoE models misroute tokens on complex reasoning tasks, study finds
Researchers have identified a significant issue in Mixture-of-Experts (MoE) language models where the routing mechanism, which directs tokens to specific experts, often selects suboptimal paths. While the standard route…
-
Researchers propose efficient LLM classification probes to reduce latency and VRAM
Researchers have developed a method to integrate classification tasks, such as safety checks, directly into the forward pass of large language models (LLMs). This approach uses lightweight probes trained on the LLM's in…