中文(ZH) 会写字、懂排版、能分镜：HiDream-O1-Image-1.5 全能图像生成能力解析

HiDream.ai's HiDream-O1-Image-1.5 leads image generation with native multi-modal architecture

By PulseAugur Editorial · [1 sources] · 2026-06-10 15:43

HiDream.ai has released its commercial image generation model, HiDream-O1-Image-1.5, which has achieved top rankings on the Artificial Analysis Text to Image Leaderboard. The model excels in complex tasks such as rendering text, detailed scene composition, and multi-subject consistency, surpassing many international competitors. This advancement is attributed to its novel native multi-modal architecture, Unified Transformer (UiT), which integrates various data types at a foundational level, moving beyond traditional modular approaches. AI

IMPACT Sets a new benchmark for complex image generation tasks, potentially accelerating adoption of native multi-modal architectures in creative industries.

RANK_REASON New commercial model release from a company achieving top benchmark scores, highlighting a novel architecture. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

HiDream.ai's HiDream-O1-Image-1.5 leads image generation with native multi-modal architecture

COVERAGE [1]

雷峰网 (Leiphone) TIER_1 中文(ZH) · 2026-06-10 15:43

Can Write, Understand Layout, and Create Storyboards: An Analysis of HiDream-O1-Image-1.5's All-Around Image Generation Capabilities

<p>近日，智象未来（HiDream.ai）全新推出的商用版图像生成模型 HiDream-O1-Image-1.5 再次实现 SOTA，在全球知名独立 AI 模型评测与分析平台 Artificial Analysis的文生图榜单（Text to Image Leaderboard）上，一举登上中国图像生成模型第一，成为评分仅次于OpenAI的中国大模型公司，超越Google Nano Banana 2（Gemini 3.1 Flash Image Preview）、NVIDIA Cosmos3-Super-Text2Image 和字节跳动的Seedre…

COVERAGE [1]

Can Write, Understand Layout, and Create Storyboards: An Analysis of HiDream-O1-Image-1.5's All-Around Image Generation Capabilities

RELATED ENTITIES

RELATED TOPICS