Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling on NLP Benchmark Suite Aggregate

-9.2Average Delta

LoFIT

-9.532-7.291-5.05-2.809Feb 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
-9.2
2026.02
-8.9
2026.02
-7.3
2026.02
-5.9
2026.02
-5.5
2026.02
-4.7
2026.02
-4.5
2026.02
-3.5
2026.02
-3.4
2026.02
-2.8
2026.02
-2.6
2026.02
-2.4
2026.02
-2.3
2026.02
-1.7
2026.02
-1.5
2026.02
-0.9