Nederlands(NL) Qwen 3.6 27B on DeepSWE

Qwen 3.6 27B 模型在 DeepSWE 基准测试中得分 1.79%

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-07 20:13

Qwen 3.6 27B 模型在 DeepSWE 基准测试中取得了 1.79% 的分数，在 20 个模型中排名第 18 位。此次基准测试运行耗时 70 小时，使用了 RTX6000 Pro Blackwell GPU 和 262k 上下文窗口。尽管社区普遍认为该模型输出冗长，但其输出 token 与类似模型相当，并且被认为是与 Kimi 等领先的闭源模型相比，一个强大的本地化选项。 AI

影响为一款开源模型提供了性能基准，表明其在本地 LLM 生态系统中相对于其他模型的能力。

排序理由该集群报告了一款开源模型的特定基准测试结果，属于研究范畴。[lever_c_从研究降级：ic=1 ai=1.0]

在 r/LocalLLaMA 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 Nederlands(NL) · /u/SteppenAxolotl · 2026-06-07 20:13

Qwen 3.6 27B 在 DeepSWE 上

<div class="md"><p>Overview:</p> <ul> <li>It scored 2% (1.79% rounded up)</li> <li>It is 18/20th place scoring above Haiku 4.5 and Minimax M2.7</li> <li>Full benchmark took 70 hours</li> <li>Average time per task 32m</li> <li>Average output tokens per task: 44k</li…

报道来源 [1]

Qwen 3.6 27B 在 DeepSWE 上

相关实体

相关话题