PulseAugur
EN
LIVE 10:38:44

HiDream-ai releases 8B image model with unified pixel and text transformer

HiDream-ai has released HiDream-O1-Image, an 8-billion parameter image generation model built on a Pixel-level Unified Transformer architecture. This model natively handles raw pixels and text without external VAEs or separate text encoders, enabling tasks like text-to-image generation, image editing, and subject-driven personalization at resolutions up to 2,048x2,048. The model also features a Reasoning-Driven Prompt Agent for enhanced generation and has achieved a high ranking on the Artificial Analysis Text to Image Arena. AI

IMPACT Offers a new open-weights option for high-resolution image generation and editing tasks.

RANK_REASON Open-source model release from a non-frontier lab with technical report and benchmark performance. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Trending Models →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Hugging Face Trending Models TIER_1 English(EN) · HiDream-ai ·

    HiDream-ai/HiDream-O1-Image

    image-text-to-image · 24,939 downloads · 427 likes