Pretrained music embeddings show promise for jazz standard recognition

By PulseAugur Editorial · [1 sources] · 2026-07-01 11:04

Researchers have evaluated pretrained music embeddings for the task of recognizing jazz standards from audio, a challenging problem due to variations in performance, tempo, and arrangement. A Harmonic CNN model trained from scratch showed overfitting to training performances, while pretrained embeddings from foundation models offered better retrieval results but were sensitive to performer identity. A lightweight contrastive projection helped mitigate this sensitivity, suggesting jazz standard recognition can serve as a valuable benchmark for music representation models. AI

IMPACT This research could lead to improved music information retrieval systems, particularly for complex genres like jazz.

RANK_REASON Academic paper published on arXiv detailing research findings. [lever_c_demoted from research: ic=1 ai=0.7]

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Pretrained music embeddings show promise for jazz standard recognition

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Çağrı Eser · 2026-07-01 11:04

Evaluating Pretrained Music Embeddings for Cross-Performance Jazz Standard Recognition

Recognizing jazz standards from audio is a challenging form of tune-level music retrieval: different performances of the same standard may vary in tempo, key, arrangement, instrumentation, improvisational content, and even whether the head melody is present. We study this problem…

COVERAGE [1]

Evaluating Pretrained Music Embeddings for Cross-Performance Jazz Standard Recognition

RELATED ENTITIES

RELATED TOPICS