VoiceBench

Benchmarks

Task Name	Dataset Name	SOTA Result
Voice Evaluation	VoiceBench	SD Score (VoiceBench)90.1	20
General Audio Understanding	VoiceBench	Overall Score86.43	20
Speech-to-Text	VoiceBench	AlpacaEval Score4.78	15
Question Answering	VoiceBench	OpenbookQA Score89.23	13
Speech-to-text reasoning and semantic understanding	VoiceBench (test)	Alpaca Eval4.8	13
Reasoning	VoiceBench	MMSU Accuracy (Audio)72.9	13
Audio Instruction Following	VoiceBench	AlpacaEval Score4.78	10
Spoken Question Answering	VoiceBench	Accuracy76.79	9
Spoken Dialogue Evaluation	Voicebench	Alpa Score4.5	8
General capability evaluation	Voicebench	HS Score76.91	8
Empathetic Speech Generation	VoiceBench CommonEval	Empathy Score4.22	7
Speech-to-Text Spoken Question Answering	VoiceBench S2T (test)	AlpacaEval4.8	7
Voice Chatting	VoiceBench	AlpacaEval4.94	7
Speech-to-Text Reply Quality	VoiceBench AlpacaEval	Quality Score4.78	6
Conversational Intelligence	VoiceBench	AlpacaEval4.57	6
General Conversation	VoiceBench	AlpacaEval Score4.19	5
Spoken Question Answering	VoiceBench S2T	AlpacaEval4.94	4
Speech Question Answering	VoiceBench AlpacaEval	AlpacaEval Score4.81	3
Voice Interaction	VoiceBench	VoiceBench Average Score89.4	3
Dialogue	VoiceBench	Score93.1	3

Showing 20 of 20 rows