Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription
About
We investigate the problem of modeling symbolic sequences of polyphonic music in a completely general piano-roll representation. We introduce a probabilistic model based on distribution estimators conditioned on a recurrent neural network that is able to discover temporal dependencies in high-dimensional sequences. Our approach outperforms many traditional models of polyphonic music on a variety of realistic datasets. We show how our musical language model can serve as a symbolic prior to improve the accuracy of polyphonic transcription.
Nicolas Boulanger-Lewandowski, Yoshua Bengio, Pascal Vincent• 2012
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Polyphonic music modeling | Nottingham (Nott) | NLL (nats)2.31 | 14 | |
| Polyphonic music modeling | JSB Chorales | Negative Log-Likelihood (nats)5.19 | 14 | |
| Polyphonic music modeling | Piano-midi.de | NLL (nats)7.05 | 12 | |
| Polyphonic music modeling | MuseData (Muse) | Negative Log-Likelihood (nats)5.6 | 12 | |
| Polyphonic Music Generation | Nottingham (test) | NLL4.46 | 11 | |
| Polyphonic Music Generation | JSB (test) | Negative Log-Likelihood8.71 | 5 | |
| Polyphonic Music Generation | Piano (test) | Negative Log-Likelihood8.37 | 5 | |
| Polyphonic Music Generation | Musedata (test) | NLL8.13 | 5 | |
| Music Modeling | Bach corpus quarter-note resolution (test) | Framewise NLL5.03 | 4 | |
| Music Modeling | Bach corpus eighth-note resolution (test) | Framewise NLL3.78 | 1 |
Showing 10 of 11 rows