Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

IOI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Component-level attributionIOI
Dissimilarity (dis.)0
32
Circuit DiscoveryIOI
AUC83.6
12
MCQ ClassificationIOI 2 v1 (Eva)
Accuracy100
6
MCQ ClassificationIOI v1 (Infer)
Accuracy1
6
Circuit DiscoveryIOI
KL Div0.668
6
Competitive ProgrammingIOI 2025
Score S100
4
AutointerpretationIOI
Accuracy76
4
Intrinsic Cluster Quality EvaluationIOI Pythia-160M
Mean Silhouette Score0.07
3
Intrinsic Cluster Quality EvaluationIOI
Silhouette Score (Mean)0.03
3
Circuit DiscoveryIOI
Sparsity96.74
3
Circuit DiscoveryIOI 400 examples v1
KL Divergence0.22
3
Circuit DiscoveryIOI 200 examples v1
KL Divergence0.25
3
Code ReasoningIOI 2025
Score439.28
2
Circuit DiscoveryIOI 100K examples v1
KL Divergence0.2
2
Showing 14 of 14 rows