Learning a Representation for Cover Song Identification Using Convolutional Neural Network
About
Cover song identification represents a challenging task in the field of Music Information Retrieval (MIR) due to complex musical variations between query tracks and cover versions. Previous works typically utilize hand-crafted features and alignment algorithms for the task. More recently, further breakthroughs are achieved employing neural network approaches. In this paper, we propose a novel Convolutional Neural Network (CNN) architecture based on the characteristics of the cover song task. We first train the network through classification strategies; the network is then used to extract music representation for cover song identification. A scheme is designed to train robust models against tempo changes. Experimental results show that our approach outperforms state-of-the-art methods on all public datasets, improving the performance especially on the large dataset.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cover Song Identification | SHS100K (test) | MAP78.9 | 27 | |
| Cover Song Identification | Covers80 | MAP0.84 | 19 | |
| Cover Song Identification | Mazurkas | MAP0.933 | 9 | |
| Music Cover Retrieval | Covers80 (C80) (test) | Mean Rank @13.43 | 8 | |
| Music Cover Retrieval | Discogs-VI (D-VI) (test) | MR@1810.9 | 8 | |
| Cover Song Identification | YouTube | MAP91.7 | 7 | |
| Audio Cover Song Identification | Youtube350 (test) | MAP91.7 | 5 | |
| Audio Cover Song Identification | Covers80 (test) | MAP0.84 | 4 | |
| Cover Song Identification | Covers80 (full) | mAP84 | 4 |