Open Screen Soundtrack Library Version 2
Haven Kim, Leduo Chen, Bill Wang, Hao-Wen Dong, Julian McAuley
Primary Subject: Dataset
Some of the required materials for this paper do not exist: Video
Despite growing interest in video-to-music generation systems, their application in film production remains limited, primarily due to the lack of large-scale datasets containing aligned pairs of movie clips and soundtracks. Although prior work has attempted to construct such a dataset, this comprises only 36.5 hours of data, which is insufficient for training robust models. In this study, we present Open Screen Soundtrack Library Version 2, a novel dataset comprising pairs of video clips from films and their corresponding soundtracks, curated with a novel methodology that automatically identifies and extracts soundtrack segments from video clips. This dataset consists of 552.70 hours and 76,408 video clips sourced from both public domain movies as well as commercial ones from a publicly available dataset.