实体 Diffusion Transformer

Diffusion Transformer

PulseAugur coverage of Diffusion Transformer — every cluster mentioning Diffusion Transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

90 天内 27

发布 · 30天

90 天内 0

论文 · 30天

90 天内 25

层级分布 · 90 天

research 10
tool 16
meme 1

情绪 · 30 天

3 天有情绪数据

最近 · 第 2/2 页 · 共 27 条

RESEARCH · CL_10165 · Apr 30 · 04:00

Omni2Sound model unifies video, text to audio generation with new dataset

Researchers have developed Omni2Sound, a unified diffusion model capable of generating audio from video, text, or a combination of both. The model addresses challenges in data scarcity and cross-task competition by intr…
RESEARCH · CL_10149 · Apr 30 · 04:00

New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

Researchers have introduced KeyTailor, a new framework designed to improve video virtual try-on (VVT) by addressing challenges in capturing garment dynamics and maintaining background consistency. The method utilizes a …
RESEARCH · CL_09744 · Apr 29 · 14:01

X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

Researchers have developed X-WAM, a novel Unified 4D World Model designed to integrate real-time robotic action execution with high-fidelity 4D world synthesis. This framework addresses limitations in previous models by…
RESEARCH · CL_08581 · Apr 29 · 04:00

UniSER foundation model unifies soft effects removal in images

Researchers have developed UniSER, a novel foundation model designed to address a variety of soft visual degradations in digital images, such as lens flare, haze, shadows, and reflections. Unlike previous specialized mo…
RESEARCH · CL_09783 · Apr 29 · 02:58

MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

Researchers have developed MetaSR, a novel framework for generative super-resolution that adaptively selects and injects relevant metadata to enhance image and video quality. This Diffusion Transformer-based approach is…
RESEARCH · CL_06609 · Apr 28 · 04:00

Audio-Omni framework unifies audio generation, editing, and understanding

Researchers have introduced Audio-Omni, a novel framework designed to unify audio understanding, generation, and editing across diverse domains like speech, music, and general sounds. This system integrates a frozen Mul…
RESEARCH · CL_06501 · Apr 28 · 04:00

New REDEdit framework enables mask-free local image editing with diffusion transformers

Researchers have developed REDEdit, a novel adapter framework designed to enhance the precision of local image editing in large diffusion transformers (DiTs). This system retrofits existing DiTs without altering their c…

Omni2Sound model unifies video, text to audio generation with new dataset

New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

UniSER foundation model unifies soft effects removal in images

MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

Audio-Omni framework unifies audio generation, editing, and understanding

New REDEdit framework enables mask-free local image editing with diffusion transformers