PulseAugur
EN
LIVE 23:56:09

AI-toolkit fork enhances model training on lower VRAM

An optimized fork of the ai-toolkit has been released, focusing on memory optimizations to enable training of most models on 24GB of VRAM without performance compromises. This fork includes support for DoRA and inference LoRA, allowing users to train on base models and generate samples using turbo LoRAs. These enhancements aim to make model training more accessible on hardware with less VRAM, though some larger models like Qwen may still require 6-bit training. AI

IMPACT Enables training of more AI models on consumer-grade hardware, potentially lowering the barrier to entry for AI development.

RANK_REASON This is a fork of an existing toolkit with optimizations, not a new frontier release or significant industry event.

Read on r/StableDiffusion →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

AI-toolkit fork enhances model training on lower VRAM

COVERAGE [1]

  1. r/StableDiffusion TIER_2 English(EN) · /u/Incognit0ErgoSum ·

    Optimized ai-toolkit fork -- Memory optimizations so most models train in 24GB or less without block swapping or disabling sample inference; DoRA support; inference LoRA support (so you can train on base and generate samples on turbo); min_snr_gamma for more models; more optimizer selections

    <!-- SC_OFF --><div class="md"><p><a href="https://github.com/envy-ai/ai-toolkit-envy-optimize">https://github.com/envy-ai/ai-toolkit-envy-optimize</a></p> <p>Edit: I should have mentioned this in my title, but it specifically trains Krea 2 comfortably on a 4090 with no layer swa…