Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Counterfactual Generation on SIB200

86.7SLFR

Gemma3-12B

45.93256.51667.177.684May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
86.778.925.4852.10.4670.533
2026.05
85-26.357--
2026.05
84.278.931.6155.10.5540.446
2026.05
8376.353.9530.4710.529
2026.05
83-23.658--
2026.05
82.857.616.967.322.704-21.704
2026.05
82.869.712.4539.97.464-6.464
2026.05
81.855.625.196.38.081-7.081
2026.05
81.473.314.8665.30.4460.554
2026.05
81-27.657--
2026.05
80-30.166--
2026.05
79.470.918.5455.70.5080.492
2026.05
78.866.713.3223.97.754-6.754
2026.05
77.969.121.0358.50.4940.506
2026.05
76.837.444.397.81.211-0.211
2026.05
76.837.440.498.40.9410.059
2026.05
76.865.712.5125.58.144-7.144
2026.05
76.772.443.7559.80.4380.562
2026.05
76.770.328.1357.40.4720.528
2026.05
76-28.565--
2026.05
75.840.434.9480.9510.049
2026.05
75.862.615.23247.769-6.769
2026.05
74-24.660--
2026.05
74-17.667--
2026.05
73.741.411.9526.77.64-6.64
2026.05
72-18.269--
2026.05
72-19.370--
2026.05
71.740.434.988.31.235-0.235
2026.05
71-23.265--
2026.05
71-24.466--
2026.05
70.961.245.1565.80.3060.694
2026.05
70-31.455--
2026.05
70-27.970--
2026.05
68.731.315.687.422.77-21.77
2026.05
68.742.413.5942.77.62-6.62
2026.05
68.161.422.6669.20.3080.692
2026.05
68-18.768--
2026.05
68-2469--
2026.05
67.857.826.1564.90.320.68
2026.05
67.743.410.831.77.959-6.959
2026.05
67.740.412.0428.88.039-7.039
2026.05
67.654.918.7168.10.4460.554
2026.05
67-26.168--
2026.05
66.262.935.0969.20.4670.533
2026.05
66-35.169--
2026.05
66-26.365--
2026.05
65.729.319.297.38.045-7.045
2026.05
65.256.832.0968.60.2630.737
2026.05
65-2471--
2026.05
65-16.567--
2026.05
65-18.771--
2026.05
64.659.140.7770.40.2510.749
2026.05
64-31.765--
2026.05
64-24.478--
2026.05
64-19.174--
2026.05
63.557.222.0869.60.2950.705
2026.05
63-48.861--
2026.05
63-27.675--
2026.05
62.755.948.7661.30.5290.471
2026.05
62-28.369--
2026.05
61.553.438.5968.10.320.68
2026.05
61-27.766--
2026.05
61-20.374--
2026.05
61-38.872--
2026.05
61-32.472--
2026.05
60.954.827.7466.30.4010.599
2026.05
60.551.420.2673.70.4590.541
2026.05
59.645.541.473.30.5620.438
2026.05
59.649.530.5766.10.5580.442
2026.05
59.645.522.4366.20.6090.391
2026.05
59.448.953.2473.10.2310.769
2026.05
59.249.928.5475.40.3890.611
2026.05
5950.818.8871.10.4560.544
2026.05
59-28.575--
2026.05
59-18.971--
2026.05
58.651.519.465.60.5830.417
2026.05
57.650.534.8873.50.1690.831
2026.05
56.614.118.6949.11.889-0.889
2026.05
55.624.222.452.11.512-0.512
2026.05
53.324.422.9384.40.3240.676
2026.05
53.148.258.5280.10.2240.776
2026.05
51.213.742.8571.40.3970.603
2026.05
50.544.432.2373.10.6120.388
2026.05
49.642.323.7379.30.1990.801
2026.05
47.539.420.3873.30.5790.421