HiDream-ai has released HiDream-O1-Image, an 8-billion parameter image generation model built on a Pixel-level Unified Transformer architecture. This model natively handles raw pixels and text without external VAEs or separate text encoders, enabling tasks like text-to-image generation, image editing, and subject-driven personalization at resolutions up to 2,048x2,048. The model also features a Reasoning-Driven Prompt Agent for enhanced generation and has achieved a high ranking on the Artificial Analysis Text to Image Arena. AI
IMPACT Offers a new open-weights option for high-resolution image generation and editing tasks.
RANK_REASON Open-source model release from a non-frontier lab with technical report and benchmark performance. [lever_c_demoted from research: ic=1 ai=1.0]
Read on Hugging Face Trending Models →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →