Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fairness-sensitive reasoning on UnQover
Loading...
99.9
Accuracy
C2PO
88.46
91.43
94.4
97.37
Dec 29, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
C2PO
Backbone=DeepSeek
2025.12
99.9
GPT-4
2025.12
88.9
Feedback
Search any
task
Search any
task