PulseAugur
EN
LIVE 12:18:35

Machine learning user seeks advice on extreme imbalance data for failure prediction

A user on r/MachineLearning is seeking advice on predicting machine failures using a dataset with extreme class imbalance. The dataset contains 100,000 entries, but only 56 instances are labeled as failures. The user has found that operating hours and humidity do not correlate with failures and is looking for suitable algorithms or deep learning approaches to handle this data scarcity for failure prediction and remaining useful life (RUL) estimation. AI

RANK_REASON This is a user query on a forum, not a research paper, model release, or industry news.

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Machine learning user seeks advice on extreme imbalance data for failure prediction

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/False-Seesaw-1899 ·

    [P] Extreme Imbalance Data from 100K dataset only have 56 failure [P]

    <table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1u2ut7s/p_extreme_imbalance_data_from_100k_dataset_only/"> <img alt="[P] Extreme Imbalance Data from 100K dataset only have 56 failure [P]" src="https://preview.redd.it/plbydmenmm6h1.png?width=140&amp;hei…