Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-task Language Understanding on MMLU (Average Metrics)

64.29MMLU Score

LESS

63.104463.412263.7264.0278Apr 9, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.04
64.2962.0412.99
2025.04
64.1957.98-80.25
2025.04
64.1759.69-40.99
2025.04
64.0962.7629.55
2025.04
64.0259.23-51.49
2025.04
64.0259.87-36.97
2025.04
64.0260.69-18.13
2025.04
63.7958.34-71.92
2025.04
63.659.82-38.06
2025.04
63.4761.480
2025.04
63.4758.3-72.84
2025.04
63.4657.49-91.42
2025.04
63.3557.12-100
2025.04
63.1559.51-45.1