ENTITY Audio Language Models

Audio Language Models

PulseAugur coverage of Audio Language Models — every cluster mentioning Audio Language Models across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

9 over 90d

Releases · 30d

0 over 90d

Papers · 30d

8 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

TOOL · CL_105203 · Jun 22 · 12:28

New framework enhances in-context learning for clinical audio diagnosis

Researchers have developed a new framework called Federated Self-Contextualization (FSC) designed to improve in-context learning for audio-language models in clinical settings, particularly in low-resource environments.…
TOOL · CL_93750 · Jun 16 · 04:00

New framework enhances audio language models with trainable audio prompts

Researchers have developed a new framework for Audio Language Models (ALMs) that introduces trainable prompts directly into the audio encoder. This approach aims to capture task-specific acoustic features, enhancing few…
RESEARCH · CL_76796 · Jun 5 · 14:26

Audio language models improve speech emotion recognition with acoustic cues

Researchers have developed a method to improve speech emotion recognition in audio language models by incorporating explicit acoustic cues. By deriving six interpretable acoustic concept tokens from paralinguistic featu…
RESEARCH · CL_70436 · Jun 3 · 17:57

Audio-language models override clear audio with conflicting text

Researchers have identified a significant issue in audio-language models where conflicting text inputs override clear audio evidence, leading to incorrect outputs. A new study reveals that in 64.1% of conflict cases acr…
RESEARCH · CL_65876 · Jun 2 · 04:00

New tools enhance audio deepfake detection and analysis

Researchers have developed new tools and methods to combat audio deepfakes. AUDDT is an open-source toolkit designed to evaluate the generalization capabilities of deepfake detectors across a wide array of audio dataset…
TOOL · CL_53672 · May 27 · 04:00

New PitchBench Benchmark Reveals Unreliable Pitch Hearing in Audio-Language Models

Researchers have developed PitchBench, a new evaluation suite designed to systematically measure the pitch perception abilities of audio-language models (ALMs). The suite includes 28 experiments that test both absolute …
TOOL · CL_37352 · May 18 · 16:31

Researchers warn AI voice assistants vulnerable to hidden audio commands

Researchers have identified a significant security vulnerability in AI voice assistants and audio-language models. These systems, increasingly used as everyday interfaces, can be manipulated through imperceptible audio …
TOOL · CL_30734 · May 13 · 15:09

New architecture boosts audio language models' attention to salient sounds

Researchers have developed NAACA, a novel architecture designed to improve how audio language models process long audio recordings. NAACA uses a training-free approach with an Oscillatory Working Memory (OWM) to filter …
RESEARCH · CL_30795 · May 13 · 04:36

New AI method automates coding of therapy sessions

Researchers have developed a new method for automatically coding Motivational Interviewing (MI) sessions using audio-language models (ALMs). This approach analyzes both spoken words and acoustic cues, integrating predic…

New framework enhances in-context learning for clinical audio diagnosis

New framework enhances audio language models with trainable audio prompts

Audio language models improve speech emotion recognition with acoustic cues

Audio-language models override clear audio with conflicting text

New tools enhance audio deepfake detection and analysis

New PitchBench Benchmark Reveals Unreliable Pitch Hearing in Audio-Language Models

Researchers warn AI voice assistants vulnerable to hidden audio commands

New architecture boosts audio language models' attention to salient sounds

New AI method automates coding of therapy sessions