Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Downstream Task Evaluation on DS-Avg 9 Downstream Tasks Suite

63.6ARC-c Accuracy

Uncompressed

2232.843.654.4May 13, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2026.05
63.680.586.385.988.183.451.874.688.378
2026.05
63.381.671.884.884.783.353.277.181.775.7
2026.05
62.584.688.878.883.179.852.674.674.875.5
2026.05
62.279.984.285.685.583.451.277.590.377.8
2026.05
62.179.587.982.584.98353.574.786.977.2
2026.05
61.981.774.683.583.583.152.975.682.675.5
2026.05
61.780.174.484.983.983.151.677.384.575.7
2026.05
61.282.379.380.183.281.951.176.487.975.9
2026.05
59.684.882.17178.376.949.67362.370.8
2026.05
57.1769081.579.88253.275.187.675.8
2026.05
56.578.590.876.87880.850.374.185.574.6
2026.05
56.373.991.382.580.782.954.776.385.175.9
2026.05
55.9768981.882.682.353.774.289.676.1
2026.05
54.473.190.580.678.881.253.87486.874.8
2026.05
52.271.175.3776980.151.576.478.170.1
2026.05
50.974.489.272.176.776.15170.633.266
2026.05
50.271.377.975.17079.850.57580.270
2026.05
49.676.689.563.869.272.947.167.522.162
2026.05
49.272.480.581.871.682.15475.682.472.2
2026.05
48.976.2757752.880.735.668.79.858.3
2026.05
46.867.683.770.565.475.548.873.630.462.5
2026.05
46.467.588.172.554.578.151.773.767.566.7
2026.05
4669.888.271.458.777.348.272.565.366.4
2026.05
45.164.689.278.362.58151.376.183.970.2
2026.05
42.764.487.267.951.774.146.970.93960.5
2026.05
41.567.266.469.234.677.333.364.82.650.8
2026.05
40.967.37373.638.778.135.167.82.453
2026.05
33.756.672.664.43572.635.264.52.548.6
2026.05
33.650.58662.934.672.44472.53454.5
2026.05
32.95862.555.326.971.73661.11.845.1
2026.05
32.850.362.86725.67341.764.92.246.7
2026.05
2947.585.154.932.769.447.267.634.652
2026.05
28.742.565.363.827.466.835.564.81.744.1
2026.05
27.139.363.35327.563.441.560.91.341.9
2026.05
26.341.862.343.327.265.139.250.31.239.6
2026.05
24.530.94832.325.154.250.753.2035.4
2026.05
24.330.442.828.825.353.250.250.9034
2026.05
24.229.538.63025.452.747.751.20.133.3
2026.05
23.630.843.327.923.554.15050.8033.8