Audio Language Models
PulseAugur coverage of Audio Language Models — every cluster mentioning Audio Language Models across labs, papers, and developer communities, ranked by signal.
2 天有情绪数据
-
Researchers warn AI voice assistants vulnerable to hidden audio commands
Researchers have identified a significant security vulnerability in AI voice assistants and audio-language models. These systems, increasingly used as everyday interfaces, can be manipulated through imperceptible audio …
-
New architecture boosts audio language models' attention to salient sounds
Researchers have developed NAACA, a novel architecture designed to improve how audio language models process long audio recordings. NAACA uses a training-free approach with an Oscillatory Working Memory (OWM) to filter …
-
新AI方法可自动编码治疗会话
研究人员开发了一种新方法,利用音频语言模型(ALMs)自动编码动机性访谈(MI)会话。该方法分析口语和声学线索,整合来自多条推理路径的预测以提高准确性。多模态自洽性技术实现了46.40%的宏观F1分数,优于基线方法,并表明结合语言和非语言信号可提高MI编码的可靠性。