Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Consensus Ranking on PRISM Llama-3.2-1B
Loading...
94
Exact Match
Kemeny-Young
89.3
91.65
94
96.35
Feb 4, 2026
Exact Match
Phi (φ)
L-test (ℓtest)
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Match
Phi (φ)
L-test (ℓtest)
Kemeny-Young
Aggregation Method=Kem...
2026.02
94
-
-
Mallows (Kendall)
Aggregation Method=Mal...
2026.02
-
0.144
-14,253.5
Feedback
Search any
task
Search any
task