PulseAugur
实时 15:13:56
English(EN) Quality Adaptive Angular Margin Learning for Respiratory Sound Classification

AI模型通过新技术推进呼吸音分类

两篇新研究论文提出了用于呼吸音分类的先进AI技术。一篇论文介绍了QLung,一个质量自适应框架,它根据录音质量调整学习裕度,从而提高了在ICBHI和SPRSound数据集上的性能。另一篇论文Lung-SRAD,探索了状态空间模型作为Transformer在该任务上的替代方案,并结合了频谱感知正则化和对比学习,在ICBHI基准测试上比基线方法提高了5%。 AI

影响 这些新颖的AI方法有望为呼吸系统疾病带来更准确、更鲁棒的诊断工具。

排序理由 两篇新的arXiv论文详细介绍了用于呼吸音分类的新型AI方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. arXiv cs.AI TIER_1 English(EN) · Yoon Tae Kim, Heejoon Koo, Miika Toikkanen, June-Woo Kim ·

    Quality Adaptive Angular Margin Learning for Respiratory Sound Classification

    arXiv:2606.11915v1 Announce Type: cross Abstract: We present a quality-adaptive angular-margin learning framework that improves feature generalization by enforcing intra-class compactness and inter-class separability. Our framework, titled QLung, introduces a no-reference audio q…

  2. arXiv cs.AI TIER_1 English(EN) · Hemansh Shridhar, Miika Toikkanen, June-Woo Kim ·

    Lung-SRAD: Spectral-Aware Regularized Audio DASS with Dual-Axis Patch-Mix Contrastive Learning for Respiratory Sound Classification

    arXiv:2606.11922v1 Announce Type: cross Abstract: Recent respiratory sound classification (RSC) studies largely rely on CLS-token driven self-attention architectures such as the Audio Spectrogram Transformer (AST). While effective at modeling global context, recent analyses sugge…

  3. arXiv cs.AI TIER_1 English(EN) · June-Woo Kim ·

    Lung-SRAD:具有双轴块混合对比学习的频谱感知正则化音频DASS用于呼吸音分类

    Recent respiratory sound classification (RSC) studies largely rely on CLS-token driven self-attention architectures such as the Audio Spectrogram Transformer (AST). While effective at modeling global context, recent analyses suggest a low-pass filtering behavior that may reduce s…

  4. arXiv cs.AI TIER_1 English(EN) · June-Woo Kim ·

    面向呼吸音分类的质量自适应角度裕度学习

    We present a quality-adaptive angular-margin learning framework that improves feature generalization by enforcing intra-class compactness and inter-class separability. Our framework, titled QLung, introduces a no-reference audio quality margin derived from spectral entropy and ro…