中文(ZH) 全员本科生！何恺明组新作：文生图，258M参数就够了

Kaiming He's undergraduate team unveils MiniT2I text-to-image model with 258M parameters

By PulseAugur Editorial · [1 sources] · 2026-06-18 09:12

Researchers, including a team led by Kaiming He and composed primarily of undergraduate students, have introduced MiniT2I, a novel text-to-image generation model. This model achieves competitive results with significantly fewer parameters (258M) and lower training costs, comparable to standard ImageNet experiments. MiniT2I utilizes a new MM-JiT architecture that operates directly in pixel space, eliminating the need for VAEs and simplifying the diffusion process by removing mechanisms like AdaLN, which are common in other large-scale text-to-image models. AI

IMPACT Demonstrates a path to more efficient text-to-image generation, potentially lowering barriers for research and development.

RANK_REASON New research paper detailing a novel model architecture and its performance. [lever_c_demoted from research: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

量子位 (QbitAI) TIER_1 中文(ZH) · henry · 2026-06-18 09:12

All undergraduates! He Kaiming's new work: Text-to-image, 258M parameters are enough

整篇论文一共六位作者。除了何恺明之外，其余五位都还是本科生。

COVERAGE [1]

All undergraduates! He Kaiming's new work: Text-to-image, 258M parameters are enough

RELATED ENTITIES

RELATED TOPICS