PulseAugur
EN
LIVE 03:14:10

Stable Diffusion users report INT8 performance issues with LoRA

Users of Stable Diffusion are encountering a performance issue where enabling INT8 precision, which typically doubles generation speed, causes a significant slowdown when LoRA (Low-Rank Adaptation) is also applied. This negates the speed benefits of INT8, with generation times becoming comparable to or even slower than standard FP8 precision when LoRAs are in use. The problem has been reported by multiple users on platforms like Reddit, suggesting a widespread compatibility or optimization challenge within the Stable Diffusion ecosystem. AI

IMPACT Potential slowdown in AI image generation workflows for users employing INT8 precision with LoRA, requiring further optimization.

RANK_REASON User-reported issue with a specific feature (INT8 precision) in an existing AI tool (Stable Diffusion) when combined with another feature (LoRA).

Read on r/StableDiffusion →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Stable Diffusion users report INT8 performance issues with LoRA

COVERAGE [1]

  1. r/StableDiffusion TIER_2 English(EN) · /u/Puzzled-Valuable-985 ·

    Is there a solution yet? INT8 is twice as fast, adding LoRa doubles generation time.

    <!-- SC_OFF --><div class="md"><p>Using Krea2 Int8, the speed on the RTX 3060 Ti practically doubles, taking half the time of FP8.</p> <p>However, the problem arises when adding a LoRA.</p> <p>The time becomes the same as FP8, or even increases slightly.</p> <p>Int8 without LoRA:…