ENTITY MM-DiT

MM-DiT

PulseAugur coverage of MM-DiT — every cluster mentioning MM-DiT across labs, papers, and developer communities, ranked by signal.

Total · 30d

5

5 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

5

5 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL

TOOL · CL_111804 · Jun 26 · 04:00

New method enables text-and-image-to-image generation without retraining

Researchers have developed TF-TI2I, a novel method for text-and-image-to-image generation that adapts existing text-to-image models without requiring further training. This approach leverages the MM-DiT architecture, en…
TOOL · CL_56563 · May 28 · 04:00

New method enables open-vocabulary scene text editing with style consistency

Researchers have developed a novel self-prompting method for editing scene text in images, addressing limitations of existing approaches that neglect visual details of target regions and are constrained by pre-trained g…
RESEARCH · CL_41800 · May 20 · 02:55

New method improves AI portrait generation by balancing alignment, realism, and aesthetics

Researchers have developed a new method to improve human portrait generation in text-to-image diffusion models, addressing the common trade-offs between text-image alignment, realism, and aesthetics. Their approach uses…
RESEARCH · CL_08432 · Apr 29 · 02:23

Galaxy General LDA-1B model unifies diverse data for embodied AI's GPT-2 moment

Galaxy General LDA has introduced LDA-1B, a 1.6 billion parameter model designed to unify the utilization of diverse data sources for embodied AI. This model employs a novel World-Action Fusion approach, enabling it to …
RESEARCH · CL_04991 · Apr 24 · 04:26

UniSonate model unifies speech, music, and sound effect generation

Researchers have developed UniSonate, a novel unified framework for generating speech, music, and sound effects using natural language instructions. This model addresses the fragmentation in generative audio by reconcil…