Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Self-evaluation benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Self-evaluation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CVBench
VAUQ
AUROC
0.747
36
3mo ago
VisualCoT
VAUQ
AUROC
80.2
36
3mo ago
MMVet
VAUQ
AUROC
0.886
36
3mo ago
ViLP
VAUQ
AUROC
77
36
3mo ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task