Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Error Detection on Bamboogle

0.94F1 Score

Ensemble-A

0.40960.54730.6850.8227Feb 4, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
0.940.950.940.8
2026.02
0.94--0.8
2026.02
0.930.960.90.82
2026.02
0.93--0.82
2026.02
0.920.940.90.91
2026.02
0.92--0.91
2026.02
0.90.950.850.76
2026.02
0.9--0.76
2026.02
0.890.870.910.87
2026.02
0.89--0.87
2026.02
0.850.850.850.84
2026.02
0.85--0.84
2026.02
0.840.890.810.84
2026.02
0.84--0.84
2026.02
0.80.990.670.8
2026.02
0.8--0.8
2026.02
0.730.860.630.81
2026.02
0.730.940.590.66
2026.02
0.73--0.81
2026.02
0.73--0.66
2026.02
0.670.710.630.79
2026.02
0.67--0.79
2026.02
0.590.670.530.74
2026.02
0.590.940.430.62
2026.02
0.59--0.74
2026.02
0.59--0.62
2026.02
0.560.690.470.72
2026.02
0.56--0.72
2026.02
0.510.810.370.63
2026.02
0.51--0.63
2026.02
0.480.730.360.6
2026.02
0.48--0.6
2026.02
0.470.640.370.67
2026.02
0.47--0.66
2026.02
0.430.670.320.64
2026.02
0.43--0.64