ML engineers grapple with hyperparameter optimization drift in long training runs

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

A machine learning practitioner is facing challenges with hyperparameter optimization (HPO) for large models that require a full day to train. To make HPO feasible, they are reducing the number of training epochs, which raises concerns about parameter drift and suboptimal optimization for full training runs. The user is also questioning the effectiveness of pruning methods, suspecting they might favor faster convergence over achieving higher accuracy. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

RANK_REASON This is a user question on a forum about a technical challenge, not a news event.

Read on r/MachineLearning →

Machine Learning

other

COVERAGE [1]

r/MachineLearning TIER_1 Norsk(NO) · /u/Counter-Business · 2026-04-24 17:52

HPO - hyperparameter drift [D]

<div class="md"><p>Hey all, so I am running into a problem. I am training massive ML models which take literally a day to fully train. </p> <p>We want to run HPO to make it so that we can get the best parameters for the model and we require very high accuracy for t…

COVERAGE [1]

HPO - hyperparameter drift [D]

RELATED ENTITIES

RELATED TOPICS