PulseAugur
EN
LIVE 06:04:14

Pretrained music embeddings show promise for jazz standard recognition

Researchers have evaluated pretrained music embeddings for the task of recognizing jazz standards from audio, a challenging problem due to variations in performance, tempo, and arrangement. A Harmonic CNN model trained from scratch showed overfitting to training performances, while pretrained embeddings from foundation models offered better retrieval results but were sensitive to performer identity. A lightweight contrastive projection helped mitigate this sensitivity, suggesting jazz standard recognition can serve as a valuable benchmark for music representation models. AI

IMPACT This research could lead to improved music information retrieval systems, particularly for complex genres like jazz.

RANK_REASON Academic paper published on arXiv detailing research findings. [lever_c_demoted from research: ic=1 ai=0.7]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Pretrained music embeddings show promise for jazz standard recognition

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Çağrı Eser ·

    Evaluating Pretrained Music Embeddings for Cross-Performance Jazz Standard Recognition

    Recognizing jazz standards from audio is a challenging form of tune-level music retrieval: different performances of the same standard may vary in tempo, key, arrangement, instrumentation, improvisational content, and even whether the head melody is present. We study this problem…