Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UniSRM-BENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
utterance-level pairwise preference judgementUniSRM-BENCH T1
Accuracy65.06
12
multi-turn dialogue speech evaluationUniSRM-BENCH T4
Accuracy88.89
10
scenario-aware style consistency preference (Chinese)UniSRM-BENCH T3-Zh
Accuracy91.3
10
scenario-aware style consistency preference (English)UniSRM-BENCH T3-En
Accuracy85.61
10
fine-grained speech quality scoringUniSRM-BENCH T2
PCC0.551
9
Showing 5 of 5 rows