PulseAugur
EN
LIVE 14:32:14

CARVE software enhances cluster analysis validation with resampling

Researchers have introduced CARVE, an open-source software package designed to improve the validation and exploration of cluster analysis results. CARVE addresses the sensitivity of clustering outcomes to algorithm and hyperparameter choices, which often hinders reproducibility in scientific discovery. The package offers stability and generalizability diagnostics at multiple levels and provides principled selection rules, outperforming traditional validation indices on synthetic and real-world biological data. AI

IMPACT Improves reproducibility of scientific discoveries derived from data clustering.

RANK_REASON The cluster contains a research paper detailing a new open-source software package for statistical analysis.

Read on arXiv stat.ML →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv stat.ML TIER_1 English(EN) · Kai R. Wycik, Tiffany M. Tang, Tarek M. Zikry, Genevera I. Allen ·

    Cluster Analysis with Resampling for Validation and Exploration (CARVE)

    arXiv:2606.00327v1 Announce Type: cross Abstract: Clustering is widely used across the sciences as the foundation for downstream data-driven scientific discoveries. However, clustering results are highly sensitive to the choice of algorithm, preprocessing, and the number of clust…

  2. arXiv stat.ML TIER_1 English(EN) · Genevera I. Allen ·

    Cluster Analysis with Resampling for Validation and Exploration (CARVE)

    Clustering is widely used across the sciences as the foundation for downstream data-driven scientific discoveries. However, clustering results are highly sensitive to the choice of algorithm, preprocessing, and the number of clusters $k$, producing scientific claims that are ofte…