PulseAugur
实时 09:10:46

新型AI模型SelectTSL实现选择性声音定位

研究人员推出SelectTSL,这是一种新颖的端到端架构,专为复杂声学环境中的提示引导式选择性声音定位而设计。该系统通过提取目标声音并保留空间信息以实现精确本地化,克服了现有方法的局限性。SelectTSL利用提示引导式选择性注意力模块生成受提示信息影响的嵌入,然后这些嵌入会精炼相位线索并估计到达方向和声源基数,从而有效地关注用户指定的空间线索并处理不同数量的目标声源。 AI

影响 引入了一种新的选择性声音定位方法,有望提高AI在嘈杂环境中聚焦特定音频源的能力。

排序理由 详细介绍新AI模型和方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新型AI模型SelectTSL实现选择性声音定位

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Ziyang Jiang, Yu Chen, Zexu Pan, Xinyuan Qian, Bowen Xing, Ivor W. Tsang, Xu-Cheng Yin, Haizhou Li ·

    SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios

    arXiv:2607.02343v1 Announce Type: cross Abstract: Humans can selectively attend to a target sound and estimate its direction in complex scenarios, whereas such selective localization remains challenging for current deep learning-based systems. Sound source localization (SSL) has …

  2. arXiv cs.AI TIER_1 English(EN) · Haizhou Li ·

    SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios

    Humans can selectively attend to a target sound and estimate its direction in complex scenarios, whereas such selective localization remains challenging for current deep learning-based systems. Sound source localization (SSL) has achieved remarkable success with deep learning, ye…