ENTITY
Multimodal Diffusion Transformer
Multimodal Diffusion Transformer
PulseAugur coverage of Multimodal Diffusion Transformer — every cluster mentioning Multimodal Diffusion Transformer across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
AudioX-Turbo framework enables efficient multimodal audio generation
Researchers have introduced AudioX-Turbo, a novel framework designed for efficient generation of audio from various multimodal inputs like text, video, and audio signals. The system employs a teacher-student distillatio…
-
UniSonate model unifies speech, music, and sound effect generation
Researchers have developed UniSonate, a novel unified framework for generating speech, music, and sound effects using natural language instructions. This model addresses the fragmentation in generative audio by reconcil…