ENTITY Diffusion Transformer

Diffusion Transformer

PulseAugur coverage of Diffusion Transformer — every cluster mentioning Diffusion Transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

77 over 90d

Releases · 30d

0 over 90d

Papers · 30d

66 over 90d

TIER MIX · 90D

frontier release 1
significant 1
research 38
tool 36
meme 1

TOPICS

RELATIONSHIPS

used by Bernini 70%

SENTIMENT · 30D

19 day(s) with sentiment data

RECENT · PAGE 1/4 · 77 TOTAL

SIGNIFICANT · CL_113064 · Jun 27 · 01:00

Alibaba releases open-source Wan 2.1 video generation suite

Alibaba's Wan team has released Wan 2.1, an open-source video generation model suite that aims to make high-quality video generation more accessible. The suite includes capabilities for text-to-video, image-to-video, an…
RESEARCH · CL_111635 · Jun 25 · 17:51

RayPE encoding boosts 3D awareness in video generation models

Researchers have developed RayPE, a novel positional encoding method for video diffusion transformers that enhances 3D awareness. Unlike existing methods that use camera grid coordinates, RayPE incorporates 6D Plucker c…
RESEARCH · CL_111561 · Jun 25 · 11:29

New Diffusion Transformer framework enhances pattern-preserving attribute retrieval

Researchers have introduced a novel framework called MO-DiT+HPPO for pattern-preserving attribute retrieval. This method uses a diffusion transformer to generate query embeddings that satisfy specific attributes while m…
TOOL · CL_110009 · Jun 25 · 04:00

New CCUA method boosts AI image generation for rare classes

Researchers have developed a new method called Contrastive Conditional-Unconditional Alignment (CCUA) to improve the quality and diversity of images generated by diffusion models, particularly for classes with limited t…
RESEARCH · CL_111525 · Jun 25 · 00:00

PhysiFormer uses coordinate-space diffusion for physically-plausible 3D object motion simulation · 3 sources tracked

Researchers have developed PhysiFormer, a novel diffusion transformer capable of simulating physically plausible 3D object motions. Unlike previous methods that operate in pixel space, PhysiFormer works directly with 3D…
TOOL · CL_108449 · Jun 24 · 09:33

PID method enhances Krea 2 image generation quality

A new method called PID (Pixel Diffusion) has been developed to improve image generation quality in Krea 2, which previously used a suboptimal VAE. PID decodes images directly in pixel space, bypassing the VAE to enhanc…
RESEARCH · CL_109621 · Jun 24 · 00:00

TryOnCrafter framework enables camera-controllable video virtual try-on · 3 sources tracked

Researchers have introduced TryOnCrafter, a novel framework for camera-controllable video virtual try-on. This system moves beyond existing methods by decoupling human subjects from their environments using a renderable…
RESEARCH · CL_107583 · Jun 23 · 00:00

DiffusionBench benchmark and NanoGen framework challenge image generation evaluation

Researchers have introduced DiffusionBench, a new benchmark designed to holistically evaluate diffusion transformers (DiTs) used in image generation. The benchmark highlights that current evaluation methods, primarily f…
RESEARCH · CL_105105 · Jun 22 · 17:19

New methods enhance text-to-image generation with improved rewards and simplified models

Researchers have developed new methods for improving text-to-image generation models. DiT-Reward, a novel approach, leverages pretrained Diffusion Transformers to create reward models that outperform existing methods on…
TOOL · CL_106837 · Jun 22 · 17:11

Vera layered diffusion model enhances video editing with content preservation

Researchers have introduced Vera, a novel layered diffusion framework designed for content-preserving video editing. Unlike existing methods that regenerate entire videos, Vera focuses on generating an edit layer and an…
TOOL · CL_105138 · Jun 22 · 12:37

New SteerVTE framework enables precise video text editing

Researchers have introduced SteerVTE, a novel framework designed for precise text editing within videos. This system leverages a frozen video diffusion model, enhanced by a lightweight adapter that captures the original…
RESEARCH · CL_105132 · Jun 22 · 00:00

New AI frameworks enhance video editing with content preservation and real-time capabilities

Researchers have developed new frameworks for video editing, addressing limitations in current automated systems. VideoAgent offers an all-in-one solution for diverse video comprehension and editing tasks, utilizing a m…
RESEARCH · CL_105015 · Jun 22 · 00:00

MeshFlow generates triangle meshes 18x faster using equivariant flow matching · 2 sources tracked

Researchers have developed MeshFlow, a novel method for generating triangle meshes using equivariant optimal-transport flow matching models. This approach directly models triangle soups, respecting symmetries like verte…
TOOL · CL_104729 · Jun 20 · 20:34

New Delta-Diffusion model synthesizes longitudinal brain amyloid-PET data

Researchers have developed Delta-Diffusion, a new framework for synthesizing longitudinal brain amyloid-PET imaging data. This method uses a conditional Poisson Diffusion Bridge process, anchored to a subject's baseline…
TOOL · CL_100206 · Jun 19 · 04:00

TetriServe system improves DiT model serving efficiency

Researchers have developed TetriServe, a novel system designed to efficiently serve Diffusion Transformer (DiT) models, which are computationally intensive for image generation. Traditional serving methods struggle with…
TOOL · CL_100170 · Jun 19 · 04:00

EndoCoT framework enhances diffusion models' reasoning with MLLMs

Researchers have introduced EndoCoT, a new framework designed to enhance the reasoning capabilities of diffusion models when integrated with Multimodal Large Language Models (MLLMs). The framework addresses limitations …
RESEARCH · CL_99594 · Jun 18 · 11:20

Hybrid Diffusion Transformer Enhances Instruction-Guided Audio Editing

Researchers have developed a novel hybrid diffusion transformer architecture for instruction-guided audio editing. This two-stage approach, based on rectified flow matching, aims to improve both the accuracy and efficie…
FRONTIER RELEASE · CL_104102 · Jun 18 · 04:50

Krea.ai releases Krea 2 text-to-image models

Krea.ai, Inc. has released two new text-to-image diffusion models, Krea 2 Raw and Krea 2 Turbo, both featuring a Diffusion Transformer architecture with 12 billion parameters. The Raw version is intended as a base for f…
TOOL · CL_98010 · Jun 18 · 04:00

Ghost Attractor Networks offer efficient sequential generation with stable latent structures

Researchers have introduced Ghost Attractor Networks (GANs), a novel dynamical decoder designed to improve sequential generation efficiency and control in large-scale models. GANs utilize a learned potential with a basi…
RESEARCH · CL_97838 · Jun 17 · 12:31

Spotlight system cuts DiT RL post-training costs using spot GPUs

Researchers have developed Spotlight, a novel system designed to significantly reduce the cost of post-training Diffusion Transformers (DiTs) for reinforcement learning. By leveraging insights into exploration tolerance…

Alibaba releases open-source Wan 2.1 video generation suite

RayPE encoding boosts 3D awareness in video generation models

New Diffusion Transformer framework enhances pattern-preserving attribute retrieval

New CCUA method boosts AI image generation for rare classes

PhysiFormer uses coordinate-space diffusion for physically-plausible 3D object motion simulation · 3 sources tracked

PID method enhances Krea 2 image generation quality

TryOnCrafter framework enables camera-controllable video virtual try-on · 3 sources tracked

DiffusionBench benchmark and NanoGen framework challenge image generation evaluation

New methods enhance text-to-image generation with improved rewards and simplified models

Vera layered diffusion model enhances video editing with content preservation

New SteerVTE framework enables precise video text editing

New AI frameworks enhance video editing with content preservation and real-time capabilities

MeshFlow generates triangle meshes 18x faster using equivariant flow matching · 2 sources tracked

New Delta-Diffusion model synthesizes longitudinal brain amyloid-PET data

TetriServe system improves DiT model serving efficiency

EndoCoT framework enhances diffusion models' reasoning with MLLMs

Hybrid Diffusion Transformer Enhances Instruction-Guided Audio Editing

Krea.ai releases Krea 2 text-to-image models

Ghost Attractor Networks offer efficient sequential generation with stable latent structures

Spotlight system cuts DiT RL post-training costs using spot GPUs