Qwen
PulseAugur coverage of Qwen — every cluster mentioning Qwen across labs, papers, and developer communities, ranked by signal.
- developed by Alibaba Group 95%
- instance of Qwen 3.6 95%
- developed Alibaba Cloud 90%
- developed by Qwen 2.5 90%
- developed by Qwen3.6-Plus 90%
- instance of generative pre-trained transformer 90%
- employed by Lin Chun-yang 90%
- founded Lin Chun-yang 90%
- employed by HongShan 90%
- instance of Qwen3.7 Max 90%
- instance of Qwen 3.7 90%
- founded HongShan 90%
- 2026-05-23 product_launch Alibaba Cloud releases Qwen 3.6 and Qwen 2.5 models with enhanced features. 来源
- 2026-05-21 product_launch Alibaba integrated its Qwen AI model with the Taobao e-commerce platform to enable AI-powered shopping.
- 2026-05-19 product_launch Alibaba's Qwen team released preview versions of its Qwen 3.7 Max and Qwen 3.7 Plus models. 来源
- 2026-05-19 product_launch Qwen released version 3.7 of its language model, featuring a tunable censorship circuit. 来源
- 2026-05-18 product_launch Alibaba's Qwen team released previews of their Qwen3.7-Max and Qwen3.7-Plus models. 来源
- 2026-05-16 research_milestone Qwen team developed a new Variational Autoencoder model. 来源
- 2026-05-11 research_milestone Researchers achieved high accuracy in a Ukrainian document understanding task using a retrieval-augmented system powered by Qwen models. 来源
- 2026-05-11 product_launch Alibaba integrated its Qwen AI model with Taobao to create an end-to-end AI shopping experience.
- 2026-05-10 product_launch Alibaba fully integrated its Qwen AI assistant with Taobao and Tmall, enabling conversational shopping.
- 2026-05-10 product_launch Alibaba launched an AI shopping assistant by integrating its Qwen AI with Taobao and Tmall.
- 2026-04-30 research_milestone Qwen released Qwen-Scope, an interpretability toolkit for LLMs.
- 2026-04-15 product_launch Alibaba's Qwen team released new multimodal models Qwen3.6-27B and Qwen3.6-35B-A3B.
21 天有情绪数据
-
本地AI工具通过新的预测和解码技术提升LLM速度
本地AI社区的最新更新正在提高推理速度,并为开放权重模型提供实际的基准测试。llama.cpp项目现已支持多令牌预测(MTP),该技术在消费级硬件上使Gemma 26B模型的速度提升了40%。另外,vLLM利用DFlash推测解码,使Gemma 4 26B模型在RTX 5090 GPU上达到了每秒600个令牌的速度。此外,Ollama社区发布了Qwen和DeepSeek编码模型在本地开发任务上的比较基准测试。
-
New research reveals "coupling tax" limits LLM reasoning accuracy
A new research paper introduces the concept of a "coupling tax" in large language models, highlighting how shared token budgets for reasoning and final answers can hinder accuracy. The study found that for certain tasks…
-
Autolearn framework enables language models to learn from documents without supervision
Researchers have introduced Autolearn, a novel framework designed to enable language models to learn from documents without external supervision. The system identifies passages that generate unusually high per-token los…
-
Gemma 4 and Kimi K2 models tested for local inference
The second round of a model showdown includes Gemma 4 from Google and Kimi K2 from Moonshot AI, with a focus on local inference capabilities. Gemma 4, a 27B parameter model, was easily integrated into the Coder platform…
-
Chinese LLMs offer significant cost savings but face adoption hurdles for global developers.
Chinese large language models offer significantly lower pricing compared to Western counterparts like GPT-4o, with some models being 8 to 20 times cheaper. Despite their cost-effectiveness and surprisingly strong perfor…
-
AI firms secure funding, launch new products, and integrate as xAI joins SpaceX
Qwen has launched an AI voice input feature for its PC client, allowing users to dictate text and issue commands across various desktop applications. This update includes capabilities for cleaning up spoken language, er…
-
2026年七款小型编码AI模型提供本地开发能力
文章重点介绍了七款适合本地开发的小型编码AI模型,强调了它们的效率和隐私优势。这些模型,包括OpenAI的gpt-oss-20b和Microsoft的Phi-3.5-mini-instruct,专为在消费级硬件上运行而设计,并在编码任务中可与大型闭源模型相媲美。该列表还包括了具有视觉能力的Qwen3-VL-32B-Instruct,具有推理能力的Apriel-1.5-15b-Thinker,以及性能出色的ByteDance的Seed-…
-
Qianwen推出PC版AI语音输入,增强桌面应用使用体验
Qwen为其PC应用程序推出了一款由AI驱动的语音输入功能,使用户能够通过语音输入文本并在各种桌面程序中发出指令。这项新功能包括去除填充词、纠错和上下文感知响应等功能,可用于内容创作和翻译等任务。此次更新旨在提高用户在桌面环境中的生产力和交互性。
-
Mistral、QWen 模型在生物医学文本简化中展现出不同的策略
一篇新的研究论文比较了 Mistral-Small 和 QWen2.5 模型在应用于生物医学信息时的文本简化策略。研究发现,Mistral-Small 能有效平衡可读性和准确性,其表现与人类简化相当。QWen2.5 也能提高可读性,但在简化文本和保留其原始含义之间的平衡方面表现不太一致。
-
分布式输出模板而非单点位置驱动LLM上下文学习
研究人员已证明,大型语言模型中的上下文学习是由分布式输出模板而非单点激活驱动的。通过多点干预,他们实现了高达96%的任务迁移率,并将第8层确定为上下文学习任务身份的因果位点。这一发现跨越了多种模型架构,表明在网络深度约30%处存在一个通用的干预窗口。
-
Alibaba Cloud leads China's AI for Science cloud market for research institutions
Alibaba Cloud has emerged as the leader in China's AI for Science (AI4S) cloud market for research institutions, capturing a 26% market share. The AI4S market is experiencing rapid growth, with projections indicating it…
-
爱芯元智 leverages dual-wheel drive for automotive and edge AI growth
AI chip company Aixin Yuanzhi is experiencing significant growth in its automotive and edge AI inference businesses, with revenues surging by 618.2% and 134.6% respectively in 2025. The company's strategy focuses on a "…
-
Language models' self-verification effectiveness varies by task and model
Researchers have investigated the effectiveness of language models verifying their own answers as a confidence signal. Their study, conducted on ARC-Challenge and TruthfulQA-MC datasets using various models like Phi-2 a…
-
新的EO-Gym环境训练AI代理进行交互式地球观测分析
研究人员推出了EO-Gym,这是一个专为地球观测(EO)代理设计的交互式框架。该环境支持多模态分析和工具使用,模拟现实世界中经常涉及扩展感兴趣区域和检索不同传感器历史数据的EO任务。创建了一个包含超过9000个轨迹的基准数据集EO-Gym-Data来评估代理性能,结果显示当前的大型视觉语言模型在交互式EO推理方面存在困难。在EO-Gym-Data上微调Qwen模型显著提高了其在这些任务上的性能。
-
Unsloth 推出用于本地 LLM 部署的 API 端点
Unsloth 发布了一个新的 API 推理端点,允许用户运行具有增强功能的本地大型语言模型。该端点同时支持 Anthropic 和 OpenAI 兼容的方言,从而能够与各种 AI 代理和聊天客户端无缝集成。此次更新还引入了 NVIDIA Nemotron 3 Nano Omni 和 Mistral 3.5 Medium 等新模型,并对 Unsloth Studio 进行了一些错误修复和改进。
-
Ant Group releases Ling 2.6 AI model family with trillion-parameter flagship
Ant Group has released Ling 2.6, a new family of open-source AI models that rival Western counterparts like DeepSeek and Qwen. The flagship version boasts a trillion parameters, while a leaner 'flash' model features 104…
-
Amazon SageMaker adds agentic fine-tuning for Llama, Qwen, Deepseek, and Nova
Amazon SageMaker has introduced agentic fine-tuning capabilities for open-weight models like Llama, Qwen, and Deepseek. This new feature allows developers to customize AI agents using reinforcement learning, aiming to e…
-
新方法通过高效的稀疏化、量化和压缩来加速大型语言模型
研究人员开发了几种新的方法来压缩和优化大型语言模型(LLMs),以提高效率并降低计算成本。SparseForge 通过优化稀疏掩码来专注于高效的半结构化稀疏化,以显著更少的重新训练 token 实现高精度。FASQ 引入了灵活的加速子空间量化,能够在没有校准数据的情况下实现连续的压缩级别,并在商品 GPU 上在准确性和速度方面均优于现有方法。此外,CoSpaDi 使用校准引导的稀疏字典学习进行结构化分解,改善了精度-压缩权衡。另一种方…
-
GRACE framework enables efficient, quantized Vision-Language Models
Researchers have developed GRACE, a new framework that combines knowledge distillation and quantization-aware training to make Vision-Language Models (VLMs) more efficient. This method aims to reduce the accuracy loss t…
-
使用 LFM 2 和 Transformers.js,通过 WebGPU 在本地运行 LLM
Thomas Bley 发布了新的幻灯片,详细介绍了如何使用 LFM 2 在本地运行大型语言模型 (LLM)。该演示文稿还涵盖了将 Transformers.js 与 WebGPU 结合用于隐私过滤器、函数调用和嵌入,所有这些都在用户的浏览器中进行处理。