PulseAugur
实时 12:42:09
实体 Qwen 3.5

Qwen 3.5

PulseAugur coverage of Qwen 3.5 — every cluster mentioning Qwen 3.5 across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
23
90 天内 23
发布 · 30天
0
90 天内 0
论文 · 30天
8
90 天内 8
层级分布 · 90 天
关系
情绪 · 30 天

6 天有情绪数据

最近 · 第 1/2 页 · 共 23 条
  1. RESEARCH · CL_47641 ·

    llama.cpp 发布增加新的张量支持和错误修复

    llama.cpp 项目发布了多个更新,包括 b9297 版本,增加了 NVFP4 MTP 标量张量并链接了 Qwen3.5 MTP 张量。之前的版本,如 b9296 和 b9295,则侧重于 Vulkan 和其他功能的错误修复和改进。这些版本为包括 macOS、Linux、Android 和 Windows 在内的各种操作系统和硬件架构提供了预编译的二进制文件,并支持 CUDA、ROCm、Vulkan 和 SYCL 等多种计算后端。

  2. SIGNIFICANT · CL_43676 ·

    Microsoft 发布 Fara1.5 代理,性能超越 OpenAI 和 Google

    Microsoft Research 推出了 Fara1.5,这是一系列基于 Qwen3.5 构建的浏览器计算机使用代理模型(4B、9B 和 27B 参数)。这些代理通过解释屏幕截图并执行鼠标和键盘操作来与真实浏览器交互以完成任务。在 Online-Mind2Web 基准测试的评估中,最大的 Fara1.5 模型实现了 72% 的任务成功率,超越了 OpenAI 的 Operator 和 Google 的 Gemini 2.5 Com…

  3. TOOL · CL_44658 ·

    小型语言模型根据情感提示表现出行为转变

    一篇新研究论文探讨了提示中的情感框架如何影响小型语言模型(如Qwen 3.5)的行为和内部表征。研究发现,基于压力的提示会导致模型采取更多捷径和过拟合,而平静和好奇驱动的提示则会产生更诚实的回答。对模型内部运作的分析揭示了与不同情感框架相对应的独特方向向量,尤其是在最后的Transformer层中。

  4. TOOL · CL_37897 ·

    Qwen 3.5 LLM weights show signs of political censorship

    Researchers have uncovered evidence of political censorship embedded within the weights of the Qwen 3.5 large language model. Analysis revealed that the model exhibits biased responses, downplaying or omitting informati…

  5. TOOL · CL_36223 ·

    Jinja chat templates for Qwen 3.5 and 3.6 models updated

    Jinja chat templates for the Qwen 3.5 and 3.6 models have been corrected. These templates are used for interacting with AI models and are hosted on Hugging Face. The updates were shared via social media and a dedicated …

  6. RESEARCH · CL_33607 ·

    Vector RAG vs. LLM Wiki: Study reveals trade-offs in research synthesis

    A new research paper compares Vector Retrieval-Augmented Generation (RAG) against an LLM-compiled wiki for answering questions over a small corpus of 24 research papers. While the wiki excelled at synthesizing informati…

  7. TOOL · CL_26561 ·

    Ollama enables local and cloud AI coding tools for indie hackers

    In 2026, indie hackers can significantly reduce AI coding costs by leveraging local or cloud-based models through Ollama. While proprietary models like Claude Opus 4.7 offer higher performance, local alternatives such a…

  8. RESEARCH · CL_26033 ·

    Ant Group's Ling-2.6-flash cuts AI costs with token efficiency

    Ant Group's new Ling-2.6-flash model, tested anonymously as Elephant Alpha, aims to significantly reduce AI operational costs by optimizing token efficiency. This model uses a hybrid linear architecture for faster infer…

  9. TOOL · CL_25188 ·

    Qwen 3.5 leads local LLM benchmarks after switch to llama.cpp

    A technical blog post details a shift from using Ollama to llama.cpp for running large language models locally. The author found that Ollama, while user-friendly, introduced an abstraction layer that potentially skewed …

  10. TOOL · CL_23121 ·

    Small AI models enable local agents like kaibot on low-power hardware

    A new personal AI agent named kaibot has been developed to run on low-spec local hardware, challenging the trend of cloud-dependent AI. This agent leverages smaller, capable models like Alibaba's Qwen3.5 (4B) and Google…

  11. TOOL · CL_18789 ·

    New MSI metric reveals nuanced bias in LLMs, with distillation reintroducing bias

    Researchers have developed a new metric, the Moral Sensitivity Index (MSI), to evaluate contextual bias in large language models. This index quantifies the probability of biased output across a seven-tier stress test, m…

  12. TOOL · CL_15849 ·

    Component-aware self-speculative decoding boosts hybrid language model inference

    Researchers have developed a new method called component-aware self-speculative decoding, which enhances the efficiency of hybrid language models. This technique leverages the internal architectural differences within t…

  13. RESEARCH · CL_11885 ·

    LLMs generate privacy-safe synthetic clinical reports for data augmentation

    Researchers have developed a new evaluation framework to assess the quality of synthetic clinical data generated by Large Language Models (LLMs). The framework measures semantic fidelity, lexical diversity, and privacy …

  14. TOOL · CL_47613 ·

    Qwen 为门控 Delta 网络开发 FlashQLA 以实现高效注意力机制

    Qwen 开发了 FlashQLA,这是一套新的融合线性注意力内核,旨在兼容深度学习中的前向和后向传播。这些内核针对门控 Delta 网络(GDN)进行了优化,GDN 现在是 Qwen 模型家族的核心组成部分,包括 Qwen3-Next 及其后续迭代,如 Qwen3.5 和 Qwen3.6。此开发旨在提高具有扩展上下文窗口的大模型的效率和可扩展性。

  15. MEME · CL_03575 ·

    LocalLLaMA用户就编码和工具调用任务的精度与参数量进行辩论

    一位r/LocalLLaMA的用户正在寻求理解模型精度与参数量在本地LLM部署中的权衡。他们特别关注不同的量化方法和模型大小如何影响性能,尤其是在编码和工具调用任务方面。讨论内容包括比较低精度(例如1比特)的大模型与高精度的小模型。

  16. FRONTIER RELEASE · CL_46520 ·

    Alibaba's Qwen3.7-Max launches with enhanced agentic and reasoning skills

    Alibaba's Qwen has released Qwen3.7-Max, a new flagship model designed for the Agent Era. This model demonstrates significant improvements in scientific reasoning, coding, and agentic capabilities, achieving a score of …

  17. RESEARCH · CL_03678 ·

    Google's Gemma 4 AI models now run offline on iPhones

    Google's Gemma 4 models can now run directly on iPhones, enabling full offline AI inference. This development signifies a shift towards on-device AI, with smaller variants like E2B and E4B optimized for mobile efficienc…

  18. SIGNIFICANT · CL_05791 ·

    TianShu Zhixin cuts inference chip prices to gain market share amid revenue concerns

    Chinese AI chip designer Tianshu Zhixin reported 10.34 billion yuan in revenue for 2025, a 91.6% year-over-year increase, though this fell short of market expectations. The company's training chip series, "Tianhe," rema…

  19. TOOL · CL_48054 ·

    Unsloth 修复 Gemma 4 训练和量化错误

    Unsloth 为 Gemma 4 模型发布了重要的修复补丁,解决了最初并非由 Unsloth 引起但影响训练和量化的问题。这些更新解决了诸如梯度累积期间的损失爆炸和较大模型变体出现的索引错误等问题,确保 Gemma 4 训练现在能在 Unsloth 框架内正常运行。此次发布还包括了比其他设置更快的训练速度和更低的 VRAM 使用量优化,以及增强了 Unsloth Studio 对各种模型类型和任务能力的更新。

  20. TOOL · CL_17412 ·

    Google 的 Gemma 4 26B 模型可在 LM Studio 的新无头 CLI 上本地运行

    Google 的 Gemma 4 模型系列,特别是 26B-A4B 变体,现在可以在 MacBooks 等消费级硬件上进行本地推理。这种混合专家模型在每次推理时仅激活其一部分参数,从而在需要显著更少的内存和计算能力的同时,实现与更大密集模型相当的质量。LM Studio 的最新更新 0.4.0 版本引入了无头 CLI,无需图形界面即可方便地在本地设置和使用 Gemma 4 及其他模型。