Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Natural Language Inference on AX-b
Loading...
58.42
Accuracy
OPUS
48.3424
50.9587
53.575
56.1913
Feb 5, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
OPUS
Model Backbone=GPT-2 XL
2026.02
58.42
GREATS
Model Backbone=GPT-2 XL
2026.02
57.34
FineWeb-Edu
Model Backbone=GPT-2 XL
2026.02
55.25
PPL
Model Backbone=GPT-2 XL
2026.02
54.98
QuRating
Model Backbone=GPT-2 XL
2026.02
54.35
DSIR
Model Backbone=GPT-2 XL
2026.02
53.53
Random
Model Backbone=GPT-2 XL
2026.02
52.54
DCLM-FastText
Model Backbone=GPT-2 XL
2026.02
52.08
UltraFineweb
Model Backbone=GPT-2 XL
2026.02
48.73
Feedback
Search any
task
Search any
task