Qwen 3
PulseAugur coverage of Qwen 3 — every cluster mentioning Qwen 3 across labs, papers, and developer communities, ranked by signal.
-
AI agent memory failures diagnosed via circuit analysis in Qwen models
Researchers have analyzed the internal workings of agent memory in LLMs, specifically examining the Qwen-3 family and two memory frameworks. Their findings indicate that control circuitry becomes active at smaller model…
-
New dataset and benchmark advance Bangla text-to-gloss translation for BdSL
Researchers have developed the first dataset and benchmark for Bangla text-to-gloss translation, addressing a significant gap for the Bangla Sign Language (BdSL) community. The dataset includes manually annotated and sy…
-
DeepSeek's V4 model omits Engram memory module, sparking debate and new research
DeepSeek's latest model, V4, notably omits Engram, a novel memory and efficiency module co-developed with Peking University. Engram, designed to augment Transformers by enabling direct knowledge lookups instead of recal…
-
Goodfire launches Silico for LLM debugging; OpenAI adds autonomous coding to Codex CLI
Startup Goodfire has launched Silico, a tool designed to bring more scientific rigor to AI model development. Silico allows researchers to inspect and adjust model parameters during the training process, aiming to reduc…
-
Chinese AI Labs Release Frontier Models Qwen 3.5, GLM 5, and MiniMax 2.5
Several Chinese AI labs have released new flagship open-weight models, including Qwen 3.5, GLM 5, and MiniMax 2.5. These releases represent a significant push in the frontier of AI development from these organizations. …