PulseAugur
实时 15:35:37

CARVE软件通过重采样增强聚类分析验证

研究人员推出CARVE,一个开源软件包,旨在改进聚类分析结果的验证和探索。CARVE解决了聚类结果对算法和超参数选择的敏感性问题,而这种敏感性常常阻碍科学发现的可重复性。该软件包在多个层面提供稳定性和泛化性诊断,并提供原则性的选择规则,在合成和真实世界生物数据上表现优于传统的验证指标。 AI

影响 提高了从数据聚类中获得的科学发现的可重复性。

排序理由 该集群包含一篇详细介绍用于统计分析的新开源软件包的研究论文。

在 arXiv stat.ML 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv stat.ML TIER_1 English(EN) · Kai R. Wycik, Tiffany M. Tang, Tarek M. Zikry, Genevera I. Allen ·

    使用重采样进行验证和探索的聚类分析 (CARVE)

    arXiv:2606.00327v1 Announce Type: cross Abstract: Clustering is widely used across the sciences as the foundation for downstream data-driven scientific discoveries. However, clustering results are highly sensitive to the choice of algorithm, preprocessing, and the number of clust…

  2. arXiv stat.ML TIER_1 English(EN) · Genevera I. Allen ·

    Cluster Analysis with Resampling for Validation and Exploration (CARVE)

    Clustering is widely used across the sciences as the foundation for downstream data-driven scientific discoveries. However, clustering results are highly sensitive to the choice of algorithm, preprocessing, and the number of clusters $k$, producing scientific claims that are ofte…