Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Quality Estimation on PAWS-X
Loading...
0.883
BCE
Sigmoid Head
-0.0014
5.9683
11.938
17.9077
Jan 2, 2026
BCE
Updated 4d ago
Evaluation Results
Method
Method
Links
BCE
Sigmoid Head
Model=Olmo
2026.01
0.883
Monte Carlo Seq. Entropy Sigmoid
Model=Olmo
2026.01
0.92
Monte Carlo Seq. Entropy Sigmoid
Model=Tower
2026.01
1.047
Sigmoid Head
Model=Tower
2026.01
1.056
LLM Self Judge
Model=Olmo
2026.01
6.416
Monte Carlo Seq. Entropy Softmax
Model=Tower
2026.01
11.828
Monte Carlo Seq. Entropy Softmax
Model=Olmo
2026.01
16.228
LLM Self Judge
Model=Tower
2026.01
22.993
Feedback
Search any
task
Search any
task