| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Counterfactual Input Evaluation | CrowS-Pairs | SS42.14 | 33 | |
| Religious Bias Evaluation | Multilingual CrowS-Pairs (test) | Bias Score (DE)4.17 | 18 | |
| Racial Bias Evaluation | Multilingual CrowS-Pairs racial bias | Bias Score (DE)16.37 | 18 | |
| Gender Bias Mitigation | Multilingual CrowS-Pairs gender-sensitive attributes | Bias Score (DE)0.83 | 18 | |
| Fairness Evaluation | CrowS-Pairs | Score72.2 | 16 | |
| Bias Evaluation | Crows-pairs | Pct Stereotype51.25 | 15 | |
| Bias Evaluation | CrowS-Pairs | CS Score50.01 | 13 | |
| Bias Measurement | CrowS-Pairs (test) | Gender65.7 | 3 |