Share your thoughts, 1 month free Claude Pro on usSee more

Streaming Speech Generation on Streaming generation scenarios 0.6s speech chunk

368.67First-chunk Latency (ms)

VocalNet-MDM

Updated 5mo ago

Evaluation Results

Method	Links
VocalNet-MDM 2026.02		368.67
VocalNet-MDM 2026.02		373.29
VocalNet-MDM 2026.02		382.36
VocalNet-MDM 2026.02		402.19
VocalNet-MDM 2026.02		427.45
VocalNet-8B 2026.02		462.32
Baseline-AR 2026.02		481.22
VITA-Audio 2026.02		512.64
Baseline-AR 2026.02		555.86
SLAM-Omni 2026.02		742.32
GLM-4-Voice 2026.02		1,066.02
MiniCPM-o 2026.02		1,329.52
Kimi-Audio 2026.02		1,371.48