Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text to Text on ESNLI (test)
Loading...
80.6
Accuracy (ESNLI Test)
POME
59.8
65.2
70.6
76
Apr 5, 2026
Accuracy (ESNLI Test)
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy (ESNLI Test)
POME
Runtime (min)=24.7
2026.04
80.6
SIFT
Runtime (min)=51.3
2026.04
80.4
BLUR
Runtime (min)=46.1
2026.04
80.1
Muon
Runtime (min)=47.9
2026.04
79.9
AdamW
Runtime (min)=44.6
2026.04
78.9
Base
2026.04
60.6
Feedback
Search any
task
Search any
task