Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VoiceBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
General Audio UnderstandingVoiceBench
AlpacaEval Score4.78
19
Speech-to-TextVoiceBench
AlpacaEval Score4.78
15
Speech-to-text reasoning and semantic understandingVoiceBench (test)
Alpaca Eval4.8
13
ReasoningVoiceBench
MMSU Accuracy (Audio)72.9
13
Audio Instruction FollowingVoiceBench
AlpacaEval Score4.78
10
General capability evaluationVoicebench
HS Score76.91
8
Speech-to-Text Spoken Question AnsweringVoiceBench S2T (test)
AlpacaEval4.8
7
Voice ChattingVoiceBench
AlpacaEval4.94
7
Conversational IntelligenceVoiceBench
AlpacaEval4.57
6
Spoken Question AnsweringVoiceBench S2T
AlpacaEval4.94
4
Showing 10 of 10 rows