English(EN) DiffusionGemma 26B A4B results on my 5090

DiffusionGemma 26B A4B 针对 RTX 5090 进行调优，提升速度

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-11 15:00

一位 Reddit 用户分享了他们对 DiffusionGemma 26B A4B 模型进行的调优结果，特别关注了在 RTX 5090 GPU 上的性能表现。他们详细介绍了最优参数，并提供了不同量化级别和上下文长度的速度对比。调优显著提高了吞吐量，其中 Q4_K_M 变体在更长上下文时速度提升高达 44%。 AI

影响展示了参数调优如何显著提升开源模型在消费级硬件上的性能。

排序理由用户生成的开源模型调优结果和性能基准测试。[lever_c_demoted from research: ic=1 ai=1.0]

在 r/LocalLLaMA 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/giveen · 2026-06-11 15:00

DiffusionGemma 26B A4B results on my 5090

<div class="md"># DiffusionGemma 26B A4B — Tuning Results <a href="https://huggingface.co/unsloth/diffusiongemma-26B-A4B-it-GGUF">https://huggingface.co/unsloth/diffusiongemma-26B-A4B-it-GGUF</a> ## System - **GPU**: RTX 5090 (32 GB VRAM), …

报道来源 [1]

DiffusionGemma 26B A4B results on my 5090

相关实体

相关话题