Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Synthetic in-context reasoning on MAD synthetic (test)

55.5Compression Score

FEM-AFT

30.0236.63543.2549.865Feb 6, 2026Feb 24, 2026Mar 14, 2026Apr 2, 2026Apr 20, 2026May 8, 2026May 27, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.02
55.59.7890.380.190.293.469.9
2026.02
53.143.199.985.999.999.380.2
2026.02
5319.199.986.399.99974.9
2026.02
52.76.790.489.590.186.369.3
2026.02
52.339.199.985.899.999.479.4
2026.02
51.913.297.186.193.591.472.2
2026.02
51.235.499.985.998.59978.3
2026.02
51.212.492.285.192.489.270.4
2026.02
51.116.890.789.792.79773
2026.02
50.732.899.985.79897.677.5
2026.02
50.512.893.488.986.392.270.7
2026.02
50.59.156331.169.290.152.2
2026.02
50.33999.985.499.99878.8
2026.02
49.526.399.985.797.597.576.1
2026.02
47.19.491.783.492.588.568.8
4529.899.980.299.994.374.9
2026.02
4531.499.985.599.996.376.3
2026.02
44.814.49989.498.69373.2
2026.02
44.324.599.985.798.595.174.7
2026.02
43.621.196.486.996.793.373
2026.02
42.93999.983.797.195.876.4
2026.05
42.46.575.687.671.39563.1
2026.02
42.235.799.952.899.999.971.7
2026.02
40.28.591.381.386.876.864.2
2026.05
34.226.880.380.786.19567.2
2026.05
33.235.695.173.393.798.871.6
2026.02
33.18.29174.975.693.162.6
2026.05
32.5119279.290.793.966.5
2026.05
3121.999.886.199.956.865.9