Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question Answeringen multifield
F1 Score44.29
21
Dialect RobustnessEN
Success Rate57
11
Text-to-SpeechEN
WER3.1
3
Function InvocationEN Ver. (Dual)
Token Usage1,300.7
3
Function InvocationEN Ver. (Single)
Invocation Accuracy0.9
3
Showing 5 of 5 rows