PulseAugur
EN
LIVE 00:11:42

ML engineers grapple with hyperparameter optimization drift in long training runs

A machine learning practitioner is facing challenges with hyperparameter optimization (HPO) for large models that require a full day to train. To make HPO feasible, they are reducing the number of training epochs, which raises concerns about parameter drift and suboptimal optimization for full training runs. The user is also questioning the effectiveness of pruning methods, suspecting they might favor faster convergence over achieving higher accuracy. AI

RANK_REASON This is a user question on a forum about a technical challenge, not a news event.

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/MachineLearning TIER_1 Norsk(NO) · /u/Counter-Business ·

    HPO - hyperparameter drift [D]

    <!-- SC_OFF --><div class="md"><p>Hey all, so I am running into a problem. I am training massive ML models which take literally a day to fully train. </p> <p>We want to run HPO to make it so that we can get the best parameters for the model and we require very high accuracy for t…