[P] Extreme Imbalance Data from 100K dataset only have 56 failure [P]
A user on r/MachineLearning is seeking advice on predicting machine failures using a dataset with extreme class imbalance. The dataset contains 100,000 entries, but only 56 instances are labeled as failures. The user has found that operating hours and humidity do not correlate with failures and is looking for suitable algorithms or deep learning approaches to handle this data scarcity for failure prediction and remaining useful life (RUL) estimation. AI