Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Defer on Synthetic benchmark (test)

28.1Test True Risk

Augmented comp-sum surrogate

26.5836.8447.157.36Mar 15, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
28.10.149.699.3
2026.03
33.45.462.558.4
2026.03
34.26.2048.8
2026.03
43.215.2024.9
2026.03
5526.950.124.9
2026.03
66.138.149.624.5