PulseAugur
LIVE 08:32:26
research · [2 sources] ·
0
research

Study finds feature dimensionality more critical than model complexity for breast cancer classification

A new study published on arXiv evaluates machine learning models for classifying breast cancer subtypes using gene expression data from TCGA-BRCA. The research found that feature dimensionality significantly impacts classification performance, often outweighing model complexity. Logistic regression demonstrated the most balanced performance across subtypes, including rare classes, while random forest and SVM showed varying sensitivities to feature selection and dimensionality. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Highlights the importance of feature selection and model simplicity in high-dimensional biological classification tasks.

RANK_REASON Academic paper on machine learning applied to biological data.

Read on arXiv cs.LG →

COVERAGE [2]

  1. arXiv cs.LG TIER_1 · Meena Al Hasani ·

    Feature Dimensionality Outweighs Model Complexity in Breast Cancer Subtype Classification Using TCGA-BRCA Gene Expression Data

    arXiv:2605.06562v1 Announce Type: new Abstract: Accurate classification of breast cancer subtypes from gene expression data is critical for diagnosis and treatment selection. However, such datasets are characterized by high dimensionality and limited sample size, posing challenge…

  2. arXiv cs.LG TIER_1 · Meena Al Hasani ·

    Feature Dimensionality Outweighs Model Complexity in Breast Cancer Subtype Classification Using TCGA-BRCA Gene Expression Data

    Accurate classification of breast cancer subtypes from gene expression data is critical for diagnosis and treatment selection. However, such datasets are characterized by high dimensionality and limited sample size, posing challenges for machine learning models. In this study, we…