The Radio-Frequency Transformer for Signal Separation
About
We study a problem of signal separation: estimating a signal of interest (SOI) contaminated by an unknown non-Gaussian background/interference. Given the training data consisting of examples of SOI and interference, we show how to build a fully data-driven signal separator. To that end we learn a good discrete tokenizer for SOI and then train an end-to-end transformer on a cross-entropy loss. Training with a cross-entropy shows substantial improvements over the conventional mean-squared error (MSE). Our tokenizer is a modification of Google's SoundStream, which incorporates additional transformer layers and switches from VQVAE to finite-scalar quantization (FSQ). Across real and synthetic mixtures from the MIT RF Challenge dataset, our method achieves competitive performance, including a 122x reduction in bit-error rate (BER) over prior state-of-the-art techniques for separating a QPSK signal from 5G interference. The learned representation adapts to the interference type without side information and shows zero-shot generalization to unseen mixtures at inference time, underscoring its potential beyond RF. Although we instantiate our approach on radio-frequency mixtures, we expect the same architecture to apply to gravitational-wave data (e.g., LIGO strain) and other scientific sensing problems that require data-driven modeling of background and noise.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| RF Signal Reconstruction | EMI | Log10(BER)-3.52 | 11 | |
| RF Signal Reconstruction | CS2 | Log10(BER)-3.07 | 11 | |
| RF Signal Reconstruction | CS5G | log10(BER)-4.91 | 11 | |
| RF Signal Reconstruction | CS3 | log10(BER)-1.6 | 11 | |
| Source Separation | CommSignal5G (CS5G) (test) | MSE (dB)-46.32 | 6 | |
| Source Separation | EMI (test) | MSE (dB)-33.01 | 6 | |
| Source Separation | CommSignal2 (CS2) (test) | MSE (dB)-28.71 | 5 | |
| Source Separation | CommSignal3 (CS3) (test) | MSE (dB)-6.22 | 5 |