Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Model Accuracy Evaluation on LLM Evaluation Scenarios

85.44Accuracy

ICL Accuracy

-0.807221.583943.97566.3661Sep 29, 2025
Updated 6d ago

Evaluation Results

MethodLinks
2025.09
85.44
2025.09
81.93
2025.09
81.37
2025.09
81.33
2025.09
78.18
2025.09
75.59
2025.09
14.82
2025.09
12.52
2025.09
2.51