Qwen3.5-397B-A17B
PulseAugur coverage of Qwen3.5-397B-A17B — every cluster mentioning Qwen3.5-397B-A17B across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
LLM pricing shifts: Kimi K2.7 up, Claude 3.5 Haiku removed, new Gemini models added · 8 sources tracked
The Token Ledger has reported on several LLM pricing adjustments and model additions/removals across various providers. Notably, MoonshotAI's Kimi K2.7 Code saw a price increase for completions, while its Kimi Latest an…
-
Poolside releases Laguna M.1, a 225B MoE model for agentic coding
Poolside has released Laguna M.1, a 225 billion parameter Mixture-of-Experts model optimized for agentic coding tasks. The model features a large sparse MoE architecture with 256 experts and global attention, enabling i…
-
Rio de Janeiro AI Model Exposed as Merged Open-Source Models
The City of Rio de Janeiro's IT agency, IplanRIO, claimed to have developed an original 397-billion-parameter AI model named Rio-3.5-Open-397B. This model reportedly outperformed Alibaba's Qwen 3.7 Plus on several codin…
-
SWE-rebench leaderboard adds 110 new Python tasks for AI models
The SWE-rebench leaderboard has been updated with 110 new Python tasks from GitHub PRs spanning March, April, and May. This update focuses on evaluating models' ability to read real issues, edit code, and pass test suit…
-
New LLM Safety Tools Target Financial Regulatory Compliance
Researchers have developed two new systems, FinGuard and FinHarness, to enhance the safety and regulatory compliance of Large Language Models (LLMs) in financial services. FinGuard, built on Qwen3-8B, uses a novel pipel…
-
Together AI releases Violin, an open-source video translation tool
Together AI has launched Violin, an open-source video translation tool designed to make online video content accessible across language barriers. The system utilizes advanced AI, including speech recognition, large lang…
-
Medical thinking with multiple images
Researchers have developed MIRAGE, a system designed to aid medical education by retrieving and generating multimodal medical images and texts. MIRAGE utilizes a fine-tuned CLIP model (MedICaT-ROCO) and a diffusion mode…
-
New research explores LLM security, efficiency, and training optimization
Researchers are developing novel methods to enhance the efficiency and security of Large Language Models (LLMs). One approach, "Widening the Gap," exploits outlier injection to compromise LLM quantization, demonstrating…
-
Qwen3.6-27B model offers flagship coding performance in a smaller package
Qwen has released Qwen3.6-27B, an open-weight model that reportedly matches flagship-level coding performance. This new model significantly outperforms its predecessor, Qwen3.5-397B-A17B, while being substantially small…
-
Alibaba's Qwen3.5-397B-A17B model offers multimodal capabilities and efficient inference
Alibaba has released Qwen3.5-397B-A17B, an open-weight, natively multimodal model featuring a hybrid attention mechanism and sparse Mixture-of-Experts architecture. The model boasts support for 201 languages and demonst…