SOTA Overall Evaluation benchmarks and papers with code

Benchmarks

Dataset Name	SOTA Method	Metric
Demo-ICL-Bench		Average Score80.1	14	5mo ago
DEMO	GPT-4o	Overall Score6.779	10	5mo ago
Aggregate	openPangu-Embedded RL	Average Score68.73	9	2mo ago
Bilingual Full-Duplex-Bench English	SoulX-Duplug	Accuracy81.2	8	4mo ago
BlenderBench	VIGA	Improvement159.19	8	5mo ago
Bilingual Full-Duplex-Bench Chinese	SoulX-Duplug	Accuracy91.6	2	4mo ago

Showing 6 of 6 rows