PulseAugur
LIVE 04:18:09
ENTITY diffusion model

diffusion model

PulseAugur coverage of diffusion model — every cluster mentioning diffusion model across labs, papers, and developer communities, ranked by signal.

Total · 30d
51
51 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
50
50 over 90d
TIER MIX · 90D
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/2 · 23 TOTAL
  1. TOOL · CL_27495 ·

    Cold diffusion tackles percussive audio dereverberation

    Researchers have developed a novel cold diffusion framework to address the challenge of dereverberating percussive audio signals, such as drums, which have been largely overlooked in favor of speech processing. This new…

  2. TOOL · CL_27730 ·

    New theory resolves instability in MeanFlow generative models

    Researchers have developed a theoretical framework to address instability issues in MeanFlow training, a one-step generative modeling technique. They identified that the conditional velocity field is misused in the loss…

  3. TOOL · CL_25766 ·

    New BRIDGE method improves local image editing by controlling mask influence

    Researchers have developed a new method called BRIDGE for local image editing, which aims to modify specific regions of an image while keeping the background intact. This approach tackles the issue of "mask-shape bias,"…

  4. TOOL · CL_20798 ·

    X-Cache accelerates world model inference for autonomous driving simulations

    Researchers have developed X-Cache, a novel method to accelerate the inference of autoregressive world models used in autonomous driving simulations. This technique caches residual computations across generation chunks …

  5. TOOL · CL_19909 ·

    AI researchers explore learning the integral of diffusion models

    A new paper explores the mathematical concept of integrating diffusion models, which are foundational to many generative AI systems. The research delves into the theoretical underpinnings of these models, potentially le…

  6. TOOL · CL_16005 ·

    StyleShield framework evades AI content detectors with controllable style transfer

    Researchers have developed StyleShield, a novel framework that manipulates text style in the continuous token embedding space to evade AI-generated content detectors. This method utilizes a DiT backbone with cross-atten…

  7. RESEARCH · CL_18705 ·

    Ortho-Hydra paper introduces new method to improve LoRA fine-tuning for diffusion transformers

    Researchers have introduced Ortho-Hydra, a novel re-parameterization technique designed to improve LoRA fine-tuning for diffusion transformers (DiT) on multi-style data. This method addresses the issue of 'style bleed' …

  8. RESEARCH · CL_15510 ·

    Mamoda2.5 model integrates multimodal AI with efficient DiT-MoE for top video editing

    Researchers have introduced Mamoda2.5, a unified AR-Diffusion framework designed for multimodal understanding and generation. This model utilizes a Diffusion Transformer backbone enhanced with a Mixture-of-Experts (MoE)…

  9. RESEARCH · CL_14333 ·

    New AI methods enhance time series forecasting accuracy and interpretability

    Researchers have introduced several new methods for time-series forecasting, aiming to improve accuracy and generalization. MeLISA, a latent-free autoregressive model, enhances rollout efficiency and long-horizon statis…

  10. RESEARCH · CL_14344 ·

    Video Generation with Predictive Latents

    Researchers have developed several new methods to improve the efficiency and quality of visual generative models. DC-DiT introduces dynamic chunking to Diffusion Transformers, adaptively compressing visual data for fast…

  11. RESEARCH · CL_11834 ·

    YOSE framework speeds up video object removal with token selection

    Researchers have developed YOSE, a new framework designed to significantly speed up video object removal using Diffusion Transformer (DiT) models. YOSE achieves this efficiency by adaptively selecting only the essential…

  12. RESEARCH · CL_11361 ·

    Researchers release TripVVT dataset and framework for in-the-wild video virtual try-on

    Researchers have introduced TripVVT, a new framework for in-the-wild video virtual try-on, addressing limitations caused by scarce data and improper mask usage. The system utilizes a Diffusion Transformer and a stable h…

  13. RESEARCH · CL_10493 ·

    Sora's architect Bill Peebles departs OpenAI citing commercialization and copyright issues

    Bill Peebles, a key figure behind OpenAI's Sora and the inventor of the DiT architecture, has departed the company. His exit is attributed to OpenAI's aggressive push towards an IPO, which has reportedly shifted the com…

  14. RESEARCH · CL_10165 ·

    Omni2Sound model unifies video, text to audio generation with new dataset

    Researchers have developed Omni2Sound, a unified diffusion model capable of generating audio from video, text, or a combination of both. The model addresses challenges in data scarcity and cross-task competition by intr…

  15. RESEARCH · CL_10149 ·

    New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

    Researchers have introduced KeyTailor, a new framework designed to improve video virtual try-on (VVT) by addressing challenges in capturing garment dynamics and maintaining background consistency. The method utilizes a …

  16. RESEARCH · CL_09744 ·

    X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

    Researchers have developed X-WAM, a novel Unified 4D World Model designed to integrate real-time robotic action execution with high-fidelity 4D world synthesis. This framework addresses limitations in previous models by…

  17. RESEARCH · CL_08581 ·

    UniSER foundation model unifies soft effects removal in images

    Researchers have developed UniSER, a novel foundation model designed to address a variety of soft visual degradations in digital images, such as lens flare, haze, shadows, and reflections. Unlike previous specialized mo…

  18. RESEARCH · CL_09783 ·

    MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

    Researchers have developed MetaSR, a novel framework for generative super-resolution that adaptively selects and injects relevant metadata to enhance image and video quality. This Diffusion Transformer-based approach is…

  19. RESEARCH · CL_06435 ·

    AI research advances 3D asset generation and anomaly detection for autonomous driving

    Researchers have developed a novel approach called GenAssets for generating high-quality 3D assets from in-the-wild LiDAR and camera data, crucial for autonomous driving simulations. This method utilizes a "reconstruct-…

  20. RESEARCH · CL_06609 ·

    Audio-Omni framework unifies audio generation, editing, and understanding

    Researchers have introduced Audio-Omni, a novel framework designed to unify audio understanding, generation, and editing across diverse domains like speech, music, and general sounds. This system integrates a frozen Mul…