Researchers have developed a new heuristic-based method to merge High-Performance Computing (HPC) execution traces, aiming to expand the coverage of hardware counters available for machine learning-based performance prediction. This technique addresses the limitation of collecting a restricted set of hardware counters simultaneously by merging traces from multiple runs, each with different counters. The approach matches computation bursts across executions using MPI structure, timing, and communication patterns to create a unified dataset with a richer feature space for training ML models without manual counter selection. Validation on the MareNostrum5 machine demonstrated that the merged counters maintain acceptable accuracy for various applications and kernels. AI
IMPACT Enables more comprehensive hardware counter data for ML models, potentially improving the accuracy of HPC performance predictions.
RANK_REASON Publication of an academic paper on a novel methodology for HPC trace analysis. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →