Adaptive Accompaniment with ReaLchords

About

Jamming requires coordination, anticipation, and collaborative creativity between musicians. Current generative models of music produce expressive output but are not able to generate in an \emph{online} manner, meaning simultaneously with other musicians (human or otherwise). We propose ReaLchords, an online generative model for improvising chord accompaniment to user melody. We start with an online model pretrained by maximum likelihood, and use reinforcement learning to finetune the model for online use. The finetuning objective leverages both a novel reward model that provides feedback on both harmonic and temporal coherency between melody and chord, and a divergence term that implements a novel type of distillation from a teacher model that can see the future melody. Through quantitative experiments and listening tests, we demonstrate that the resulting model adapts well to unfamiliar input and produce fitting accompaniment. ReaLchords opens the door to live jamming, as well as simultaneous co-creation in other modalities.

Yusong Wu, Tim Cooijmans, Kyle Kastner, Adam Roberts, Ian Simon, Alexander Scarlatos, Chris Donahue, Cassie Tarakajian, Shayegan Omidshafiei, Aaron Courville, Pablo Samuel Castro, Natasha Jaques, Cheng-Zhi Anna Huang• 2025

Related benchmarks

Task	Dataset	Result
model jamming with fixed melodies	Out of Distribution (test)	Harmony Ratio47.5	5
model jamming with fixed melodies	Hooktheory, POP909, Nottingham, and Wikifonia (test)	Harmony Ratio48.4	5
Model jamming	Learned melody agent model-to-model interaction	Harmony0.626	4
Model jamming	User interaction real-time	Harmony0.462	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord