Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TQCodec: Towards neural audio codec for high-fidelity music streaming

About

We propose TQCodec, a neural audio codec designed for high-bitrate, high-fidelity music streaming. Unlike existing neural codecs that primarily target ultra-low bitrates (<= 16kbps), TQCodec operates at 44.1 kHz and supports bitrates from 32 kbps to 128 kbps, aligning with the standard quality of modern music streaming platforms. The model adopts an encoder-decoder architecture based on SEANet for efficient on-device computation and introduces several enhancements: an imbalanced network design for improved quality with low overhead, SimVQ for mid-frequency detail preservation, and a phase-aware waveform loss. Additionally, we introduce a perception-driven band-wise bit allocation strategy to prioritize perceptually critical lower frequencies. Evaluations on diverse music datasets demonstrate that TQCodec achieves superior audio quality at target bitrates, making it well-suited for high-quality audio applications.

Lixing He, Zhouxuan Chen, Mingshuai Liu, Xinran Sun, Wucheng Wang, Minfu Li, Lingcheng Kong, Weifeng Zhao, Wenjiang Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Audio compressionMusic Audio 44.1kHz
LSD (Low Freq)0.702
3
Subjective Audio Quality AssessmentSubjective Listening Test Dataset
Average MOS4.18
1
Showing 2 of 2 rows

Other info

Follow for update