Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Language Understanding and Reasoning on HuggingFace Open LLM Leaderboard (Composite)

62HellaSwag Accuracy

LCG-MultinomialNB-6k

60.096860.590961.08561.5791Feb 26, 2025
Updated 9d ago

Evaluation Results

MethodLinks
2025.02
6259.5140.5152.953.73
2025.02
61.9959.5140.3352.2253.51
2025.02
61.9459.2438.2951.6252.77
2025.02
61.6458.483751.8852.25
2025.02
61.6162.2353.7554.9558.14
2025.02
61.4362.6754.2854.7858.29
2025.02
61.1857.7331.6153.0750.9
2025.02
61.1461.0950.8753.556.65
2025.02
60.9859.3435.7149.8351.47
2025.02
60.9562.2652.9252.8257.23
2025.02
60.9458.9636.6248.8151.33
2025.02
60.8658.4535.152.0551.62
2025.02
60.8658.4535.153.0751.87
2025.02
60.8358.7535.0353.0751.92
2025.02
60.8358.7535.0353.0751.92
2025.02
60.5859.3437.3151.1152.09
2025.02
60.5862.1350.3451.1155.82
2025.02
60.5761.3646.152.4155.74
2025.02
60.3861.9550.3451.5455.36
2025.02
60.1762.1350.4250.2649.98