Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on GSM8K FLORES-200 (10 low-resourced languages) (test)

67.93Kazakh (Cyrl) Accuracy

DIP

12.820427.127741.43555.7423Nov 2, 2024
Updated 13d ago

Evaluation Results

MethodLinks
2024.11
67.9346.1780.3667.153.343.2968.6160.563.6868.3161.92
2024.11
61.1136.7760.3557.1649.1322.6760.9620.8544.2835.4844.88
2024.11
23.6514.0319.9419.7919.2610.3121.919.5517.8217.4417.37
2024.11
23.4316.324.9421.2318.514.8623.5819.1822.1422.3720.65
2024.11
20.3912.0522.0615.4713.5712.4319.0313.9516.3818.216.35
2024.11
17.899.115.6211.5211.35.9114.255.849.559.411.04
2024.11
14.946.2913.8710.0810.246.3712.897.0510.248.1910.02