ENTITY Vad

Vad

PulseAugur coverage of Vad — every cluster mentioning Vad across labs, papers, and developer communities, ranked by signal.

Total · 30d

5

5 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

4 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL

RESEARCH · CL_109476 · Jun 23 · 00:00

Wan-Streamer v0.1: Unified model for real-time audio-visual interaction

Researchers have introduced Wan-Streamer v0.1, a novel end-to-end multimodal foundation model designed for real-time, low-latency audio-visual interaction. Unlike traditional cascaded systems, Wan-Streamer integrates la…
RESEARCH · CL_93101 · Jun 15 · 07:50

GraphBEV++ framework tackles feature misalignment in autonomous driving perception

Researchers have introduced GraphBEV++, a novel framework designed to tackle feature misalignment in Bird's-Eye View (BEV) perception for autonomous driving systems. The framework employs two main modules: LocalAlign-v2…
COMMENTARY · CL_23142 · May 8 · 14:27

Voice AI paradox: Advanced chat, basic failures

Voice AI assistants like Yandex's Alisa exhibit a paradox of advanced conversational abilities alongside basic functional failures, stemming from their complex architecture. This hybrid system combines speech recognitio…
RESEARCH · CL_15496 · May 4 · 16:01

Unified Map Prior Encoder enhances autonomous driving mapping and planning

Researchers have developed a Unified Map Prior Encoder (UMPE) designed to integrate diverse map data, such as HD/SD vector maps, rasterized maps, and satellite imagery, into autonomous driving systems. This encoder addr…
RESEARCH · CL_11761 · Apr 28 · 04:00

New LLMs unify audio and language processing for full-duplex and medical applications

Researchers have developed UAF, a novel unified audio front-end LLM designed for full-duplex speech interaction. This model integrates diverse audio front-end tasks like voice activity detection and turn-taking into a s…