Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Natural Language Inference on AX-g
Loading...
51.97
Accuracy
QuRating
48.1636
49.1518
50.14
51.1282
Feb 5, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
QuRating
Model Backbone=GPT-2 XL
2026.02
51.97
DCLM-FastText
Model Backbone=GPT-2 XL
2026.02
51.97
PPL
Model Backbone=GPT-2 XL
2026.02
51.12
GREATS
Model Backbone=GPT-2 XL
2026.02
50.84
OPUS
Model Backbone=GPT-2 XL
2026.02
50.56
Random
Model Backbone=GPT-2 XL
2026.02
50
FineWeb-Edu
Model Backbone=GPT-2 XL
2026.02
50
DSIR
Model Backbone=GPT-2 XL
2026.02
49.44
UltraFineweb
Model Backbone=GPT-2 XL
2026.02
48.31
Feedback
Search any
task
Search any
task