FROM FIXED WINDOWS TO MUSICAL BARS: STRUCTURE-AWARE PREPROCESSING IN MIR

Seonghyeon Go, Yumin Kim

Primary Subject: Early Research

Some of the required materials for this paper do not exist: Video

Abstract:

In music, unlike general audio data, there are characteristics according to progression such as structural information and BPM. However, in many MIR models, audio is often preprocessed under fixed information in consideration of compatibility with existing models. This work investigates BPM-aware and structure-based segmentation for music genre classification. We compare three approaches: 30-second whole track, fixed 5-second segments, and musically-informed 2-bar segments on GTZAN dataset. Our experiments demonstrate the feasibility and trade-offs of structure-aware preprocessing. The codes are available online.