English(EN) I tested 4 local VLMs as "bad hands" detectors. Here's which one works best as a judge

Qwen 3.5 122B 在检测 AI 生成的手部错误方面领先本地 VLM

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 17:40

一位用户测试了四种本地视觉语言模型 (VLM)，以确定它们在检测 AI 图像中生成不佳的手部方面的有效性。Qwen 3.5 122B 表现最佳，具有 100% 的精确率和不错的召回率，尽管它偶尔会错过细微的解剖学错误。Gemma 4 26B 和 Qwen3-VL 被发现无效，Gemma 拒绝了太多图像，而 Qwen3-VL 则通过了大多数图像。 AI

影响通过检测常见错误，确定了 VLM 在提高 AI 图像生成质量方面的实际应用。

排序理由用户对现有模型进行的特定任务基准测试。[lever_c_demoted from research: ic=1 ai=0.7]

在 r/StableDiffusion 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/StableDiffusion TIER_2 English(EN) · /u/dh7net · 2026-06-02 17:40

我测试了4个本地VLM作为“坏手”检测器。结果显示哪一个最适合作为裁判

<div class="md">We all know that hands can be hard for small local models, so I tried to find the best way to detect bad hands with my local setup (GX10 Spark). I though any VLM like Gemma would work, but not at all. So I had to test several of the…

报道来源 [1]

我测试了4个本地VLM作为“坏手”检测器。结果显示哪一个最适合作为裁判

相关实体

相关话题