PulseAugur
EN
LIVE 17:12:22

Prism ML releases compact Bonsai Image 4B diffusion model

Prism ML has released Bonsai Image 4B, a text-to-image diffusion model that utilizes ternary weights for significant size reduction. The model is available in two versions: one optimized for Apple Silicon using MLX and another for NVIDIA GPUs with a Gemlite deployment. Despite its compact size, the model achieves fast generation speeds, though users note that text rendering can be poor while other image aspects are surprisingly good. AI

IMPACT Offers a highly compressed text-to-image model, potentially enabling wider deployment on edge devices and consumer hardware.

RANK_REASON This is a release of a new model with a novel quantization technique, but it is not from a frontier lab and does not represent a significant industry shift.

Read on Hugging Face Trending Models →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

  1. Hugging Face Trending Models TIER_1 English(EN) · prism-ml ·

    prism-ml/bonsai-image-ternary-4B-mlx-2bit

    text-to-image · 0 downloads · 44 likes

  2. Hugging Face Trending Models TIER_1 English(EN) · prism-ml ·

    prism-ml/bonsai-image-ternary-4B-gemlite-2bit

    text-to-image · 0 downloads · 48 likes

  3. r/StableDiffusion TIER_2 English(EN) · /u/dh7net ·

    Testing the new prismML Bonsai Image 4B

    <!-- SC_OFF --><div class="md"><p>I just tested the new Bonsai Image 4B (ternary variant).</p> <p>It is super fast: 4.2 seconds per 1024×1024 image at 4 steps on a spark GX10.</p> <p>The results are bad for text, but surprisingly good for everything else.</p> <p>You can see by yo…