Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Bias Detection on BABE (test)
Loading...
80.8
Macro F1
mini_listwise_BT
75.912
77.181
78.45
79.719
Dec 16, 2025
Macro F1
Recall
Accuracy
Precision
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1
Recall
Accuracy
Precision
mini_listwise_BT
Model=mini, Strategy=L...
2025.12
80.8
84.4
80.3
77.6
BERT + distant
Model=BERT, Strategy=D...
2025.12
80.4
-
-
-
mini_direct
Model=mini, Strategy=D...
2025.12
79.2
85.1
78
74.1
nano_24_pairwise_BT
Model=nano, Strategy=P...
2025.12
79
77.6
79.6
80.3
nano_24_pairwise_Elo
Model=nano, Strategy=P...
2025.12
78.8
79.2
79
78.4
nano_listwise_BT
Model=nano, Strategy=L...
2025.12
78.4
79.6
78.4
77.3
mini_listwise_Elo
Model=mini, Strategy=L...
2025.12
77.8
78.3
77.9
77.2
nano_listwise_Elo
Model=nano, Strategy=L...
2025.12
76.1
76.1
76.3
75.6
Feedback
Search any
task
Search any
task