Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-task Language Understanding on MMLU STEM (test)

72.4Accuracy

MAmmoTH2-8x7B-Plus

65.43267.24169.0570.859May 6, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.05
72.4
2024.05
65.7