Diffusion Large Language Models
PulseAugur coverage of Diffusion Large Language Models — every cluster mentioning Diffusion Large Language Models across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New methods accelerate Diffusion LLMs, addressing speed-quality trade-offs · 3 sources tracked
Researchers are developing new methods to accelerate Diffusion Large Language Models (dLLMs), which are computationally intensive due to their sequence length scaling. Two new frameworks, Dynamic-dLLM and Streaming-dLLM…
-
New 7B Uniform Diffusion Language Model 'Sumi' Released, Alongside Diffusion Model Advancements
Researchers have introduced Sumi, a 7-billion parameter uniform diffusion language model (UDLM) pretrained from scratch on 1.5 trillion tokens. This open-source model demonstrates competitive performance against autoreg…
-
FOCUS system boosts DLLM inference speed by 3.5x
Researchers have developed a new inference system called FOCUS designed to improve the efficiency of Diffusion Large Language Models (DLLMs). This system addresses the high decoding costs associated with DLLMs by dynami…
-
New CreditDecoding Method Accelerates Diffusion LLM Text Generation
Researchers have developed a new method called CreditDecoding to accelerate the text generation process in diffusion large language models (dLLMs). This technique addresses an inefficiency where models predict correct t…
-
New benchmarks and methods advance multimodal LLM capabilities
Researchers are developing new methods for multimodal large language models (MLLMs) to improve their understanding of sequential audio-video data and large-scale visual recognition. One approach, DLLM-VSR, uses diffusio…
-
New D^2-Monitor system enhances safety for diffusion LLMs
Researchers have introduced $D^2$-Monitor, a novel safety monitoring system designed for diffusion large language models (D-LLMs). This system addresses the unique challenges of monitoring D-LLMs, which generate text th…
-
TAD framework boosts diffusion LLM speed and accuracy
Researchers have introduced TAD, a Temporal-Aware trajectory self-Distillation framework designed to improve the speed and accuracy of diffusion large language models (dLLMs). TAD addresses the common trade-off where fa…