Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multitrack Music Transformer

About

Existing approaches for generating multitrack music with transformer models have been limited in terms of the number of instruments, the length of the music segments and slow inference. This is partly due to the memory requirements of the lengthy input sequences necessitated by existing representations. In this work, we propose a new multitrack music representation that allows a diverse set of instruments while keeping a short sequence length. Our proposed Multitrack Music Transformer (MMT) achieves comparable performance with state-of-the-art systems, landing in between two recently proposed models in a subjective listening test, while achieving substantial speedups and memory reductions over both, making the method attractive for real time improvisation or near real time creative applications. Further, we propose a new measure for analyzing musical self-attention and show that the trained model attends more to notes that form a consonant interval with the current note and to notes that are 4N beats away from the current step.

Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, Taylor Berg-Kirkpatrick• 2022

Related benchmarks

TaskDatasetResultRank
Symbolic music generationSOD (test)
Mean NLL0.632
11
Symbolic music generationLakh (test)
Mean NLL0.376
11
Symbolic music generationPop1k7 (test)
Mean NLL1.396
11
Symbolic music generationPOP909 (test)
Mean NLL0.986
11
Unconditional Symbolic Music GenerationLakh MIDI and Meta MIDI (test)
PCE0.103
4
Showing 5 of 5 rows

Other info

Follow for update