PulseAugur
实时 22:22:10
实体 Diffusion Transformer

Diffusion Transformer

PulseAugur coverage of Diffusion Transformer — every cluster mentioning Diffusion Transformer across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
27
90 天内 27
发布 · 30天
0
90 天内 0
论文 · 30天
25
90 天内 25
层级分布 · 90 天
情绪 · 30 天

3 天有情绪数据

最近 · 第 2/2 页 · 共 27 条
  1. RESEARCH · CL_10165 ·

    Omni2Sound model unifies video, text to audio generation with new dataset

    Researchers have developed Omni2Sound, a unified diffusion model capable of generating audio from video, text, or a combination of both. The model addresses challenges in data scarcity and cross-task competition by intr…

  2. RESEARCH · CL_10149 ·

    New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

    Researchers have introduced KeyTailor, a new framework designed to improve video virtual try-on (VVT) by addressing challenges in capturing garment dynamics and maintaining background consistency. The method utilizes a …

  3. RESEARCH · CL_09744 ·

    X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

    Researchers have developed X-WAM, a novel Unified 4D World Model designed to integrate real-time robotic action execution with high-fidelity 4D world synthesis. This framework addresses limitations in previous models by…

  4. RESEARCH · CL_08581 ·

    UniSER foundation model unifies soft effects removal in images

    Researchers have developed UniSER, a novel foundation model designed to address a variety of soft visual degradations in digital images, such as lens flare, haze, shadows, and reflections. Unlike previous specialized mo…

  5. RESEARCH · CL_09783 ·

    MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

    Researchers have developed MetaSR, a novel framework for generative super-resolution that adaptively selects and injects relevant metadata to enhance image and video quality. This Diffusion Transformer-based approach is…

  6. RESEARCH · CL_06609 ·

    Audio-Omni framework unifies audio generation, editing, and understanding

    Researchers have introduced Audio-Omni, a novel framework designed to unify audio understanding, generation, and editing across diverse domains like speech, music, and general sounds. This system integrates a frozen Mul…

  7. RESEARCH · CL_06501 ·

    New REDEdit framework enables mask-free local image editing with diffusion transformers

    Researchers have developed REDEdit, a novel adapter framework designed to enhance the precision of local image editing in large diffusion transformers (DiTs). This system retrofits existing DiTs without altering their c…