Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Confidence Estimation on Global-MMLU Russian (test)

75AUROC

Seq. Likelihood

43.851.96068.1May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
75475462
2026.05
73495150
2026.05
72433942
2026.05
6257514
2026.05
45212121