PulseAugur / Brief
EN
LIVE 08:30:59

Brief

last 24h
[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Efficient-Large-Model/SANA-WM_bidirectional

    Researchers have released SANA-WM, an open-source world model capable of generating minute-long videos at 720p resolution. This diffusion transformer model utilizes a hybrid linear attention mechanism and a dual-branch architecture for precise camera control. The model also incorporates a two-stage generation pipeline with a refiner for enhanced quality and temporal consistency, and it was trained using a robust annotation pipeline with metric-scale 6-DoF camera poses. AI

    IMPACT Enables creation of longer, high-fidelity videos with precise camera control, potentially impacting content generation and simulation.

  2. ResembleAI/Dramabox

    Resemble AI has released Dramabox, an expressive text-to-speech model built on Lightricks' LTX-2 audio branch. This model utilizes prompt-driven control for speaker identity, emotion, and delivery, with an optional voice cloning feature using a 10-second reference. Dramabox is an IC-LoRA fine-tune of the LTX-2.3 3.3B model, conditioned on Gemma 3 12B text embeddings. AI

    IMPACT Enables more nuanced and expressive AI-generated speech with voice cloning capabilities.