PulseAugur
EN
LIVE 07:41:49

New framework TurnNat automatically evaluates spoken dialogue turn-taking naturalness

Researchers have introduced TurnNat, a novel framework designed to automatically evaluate the naturalness of turn-taking in spoken dialogue systems. This system utilizes a causal prediction model to estimate future voice activity states between two speakers, with the negative log-likelihood of observed activity serving as a measure of timing atypicality. TurnNat aggregates these scores over turn-taking boundary units to produce a dialogue-level naturalness score, and has demonstrated its effectiveness in identifying unnatural turn-taking in controlled experiments. AI

IMPACT This framework could improve the naturalness and user experience of full-duplex spoken dialogue systems.

RANK_REASON The cluster contains a research paper detailing a new framework for evaluating spoken dialogue systems. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New framework TurnNat automatically evaluates spoken dialogue turn-taking naturalness

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Hao Zhang, Thomas Thebaud, Georgi Tinchev, Venkatesh Ravichandran, Laureano Moro-Velazquez ·

    TurnNat: Automatic Evaluation of Turn-Taking Naturalness in Dyadic Spoken Dialogue

    arXiv:2607.01345v1 Announce Type: cross Abstract: Turn-taking naturalness is central to full-duplex spoken dialogue systems, yet its automatic evaluation remains limited. Existing evaluations often rely on human judgments or behavior-specific timing metrics, making it difficult t…