EN

Benchmarks

Task Name	Dataset Name	SOTA Result
Question Answering	en multifield	F1 Score44.29	21
Dialect Robustness	EN	Success Rate57	11
Graph parsing	en	LF Score94.15	7
Text-to-Speech	EN	N-MOS4.02	5
Spoken Dialogue Generation	en short (test)	WER2.79	3
Spoken Dialogue Generation	en (test)	cpSIM43.7	3
Text-to-Speech	EN	WER3.1	3
Function Invocation	EN Ver. (Dual)	Token Usage1,300.7	3
Function Invocation	EN Ver. (Single)	Invocation Accuracy0.9	3

Showing 9 of 9 rows