Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Inference on CrossFit NLI (test)

83.6Accuracy

ABMLL

56.5663.5870.677.62Aug 19, 2025
Updated 16d ago

Evaluation Results

MethodLinks
2025.08
83.620.8
2025.08
83.324.2
2025.08
82.623.1
2025.08
82.223.7
2025.08
79.726.9
2025.08
79.127
2025.08
78.531
2025.08
75.530.2
2025.08
69.329.8
2025.08
57.641.9