English(EN) ComfyUI's nvfp4 quantization of Krea 2 is 2x slower than fp8_scaled

ComfyUI Krea 2 NVFP4 量化显示性能比 fp8_scaled 慢

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-24 18:41

Reddit r/StableDiffusion 子版块的一位用户报告称，在使用 ComfyUI 时，Krea 2 模型的 NVFP4 量化版本比 fp8_scaled 版本明显慢。该用户在 5060 Ti GPU 上观察到这种性能下降，并正在寻求其他用户的验证，因为他们期望 NVFP4 能够像在 klein9b 模型上那样提供速度提升。 AI

影响使用 ComfyUI 中的 Krea 2 和 NVFP4 量化的用户可能面临性能瓶颈。

排序理由用户报告了特定软件和模型量化的性能问题。

在 r/StableDiffusion 阅读 →

基础设施

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

ComfyUI Krea 2 NVFP4 量化显示性能比 fp8_scaled 慢

报道来源 [1]

r/StableDiffusion TIER_2 English(EN) · /u/KissMyShinyArse · 2026-06-24 18:41

ComfyUI 的 Krea 2 的 nvfp4 量化比 fp8_scaled 慢 2 倍

<div class="md"><p><code>krea2_turbo_nvfp4.safetensors</code> performs much worse than <code>krea2_turbo_fp8_scaled.safetensors</code> on my 5060 Ti. I'd expect NVFP4 to be at least twice as fast (which is true for klein9b), but somehow the opposite is true.</p> <p…

报道来源 [1]

ComfyUI 的 Krea 2 的 nvfp4 量化比 fp8_scaled 慢 2 倍

相关实体

相关话题