PulseAugur
EN
LIVE 12:58:07
ENTITY MM-DiT

MM-DiT

PulseAugur coverage of MM-DiT — every cluster mentioning MM-DiT across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
5
5 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL
  1. TOOL · CL_111804 ·

    New method enables text-and-image-to-image generation without retraining

    Researchers have developed TF-TI2I, a novel method for text-and-image-to-image generation that adapts existing text-to-image models without requiring further training. This approach leverages the MM-DiT architecture, en…

  2. TOOL · CL_56563 ·

    New method enables open-vocabulary scene text editing with style consistency

    Researchers have developed a novel self-prompting method for editing scene text in images, addressing limitations of existing approaches that neglect visual details of target regions and are constrained by pre-trained g…

  3. RESEARCH · CL_41800 ·

    New method improves AI portrait generation by balancing alignment, realism, and aesthetics

    Researchers have developed a new method to improve human portrait generation in text-to-image diffusion models, addressing the common trade-offs between text-image alignment, realism, and aesthetics. Their approach uses…

  4. RESEARCH · CL_08432 ·

    Galaxy General LDA-1B model unifies diverse data for embodied AI's GPT-2 moment

    Galaxy General LDA has introduced LDA-1B, a 1.6 billion parameter model designed to unify the utilization of diverse data sources for embodied AI. This model employs a novel World-Action Fusion approach, enabling it to …

  5. RESEARCH · CL_04991 ·

    UniSonate model unifies speech, music, and sound effect generation

    Researchers have developed UniSonate, a novel unified framework for generating speech, music, and sound effects using natural language instructions. This model addresses the fragmentation in generative audio by reconcil…