Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Preference Classification on WebGPT comparisons (test)

60.8Accuracy

UMM-RM

51.02453.56256.158.638Nov 30, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.11
60.8
2025.11
60.6
2025.11
59.6
2025.11
58.6
2025.11
57.8
2025.11
52.2
2025.11
51.4