New frameworks enable real-time streaming audio-video generation and narration

By PulseAugur Editorial · [2 sources] · 2026-05-25 00:00

Researchers have developed two new frameworks, StreamChar and FlowNar, designed for real-time audio-video generation and narration for long-form videos. StreamChar utilizes an LLM-based orchestrator and a joint audio-video diffusion transformer to achieve efficient, real-time character animation generation. FlowNar addresses scalability challenges in streaming video narration by employing dynamic context management and a novel Cross Linear Attentive Memory module to maintain bounded computational complexity and improve narration quality. AI

IMPACT These frameworks advance real-time AI capabilities for video content creation and analysis, potentially enabling more dynamic and interactive applications.

RANK_REASON Two distinct research papers introducing new frameworks for real-time audio-video generation and narration.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New frameworks enable real-time streaming audio-video generation and narration

COVERAGE [2]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-25 00:00

StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration

StreamChar enables real-time streaming audio-video generation for character animation by separating long-horizon orchestration from short-window denoising through an LLM-based orchestrator and joint audio-video DiT, achieving efficient deployment via two-stage distillation and ma…
arXiv cs.CV TIER_1 English(EN) · Zeyun Zhong, Manuel Martin, Chengzhi Wu, David Schneider, Frederik Diederichs, Juergen Gall, Juergen Beyerer · 2026-06-02 04:00

FlowNar: Scalable Streaming Narration for Long-Form Videos

arXiv:2606.00620v1 Announce Type: new Abstract: Recent Large Multimodal Models (LMMs), primarily designed for offline settings, are ill-suited for the dynamic requirements of streaming video. While recent online adaptations improve real-time processing, they still face critical s…

COVERAGE [2]

StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration

FlowNar: Scalable Streaming Narration for Long-Form Videos

RELATED ENTITIES

RELATED TOPICS