Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary Classification on Raisin OOD (test)
Loading...
50.2
Accuracy
Naive
36.784
40.267
43.75
47.233
Mar 19, 2026
Accuracy
95% CI
Standard Error (SE)
Updated 27d ago
Evaluation Results
Method
Method
Links
Accuracy
95% CI
Standard Error (SE)
Naive
selection_criterion=so...
2026.03
50.2
46.3
1.9
Pseudo-labeling
selection_criterion=ps...
2026.03
42.8
41.2
0.8
Oracle
selection_criterion=ta...
2026.03
37.3
36.1
0.6
Feedback
Search any
task
Search any
task