| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Pronoun Change | PRISM-∆ | Performance Score (P)99.66 | 35 | 1mo ago | |
| SEAT | CCPA | SEAT 60.181 | 13 | 1mo ago | |
| RedditBias Gender (test) | DExperts (Anti-only) | Regard3.54 | 9 | 1mo ago | |
| RealWorldQuestioning Health Recommendations | Shannon Entropy (T-test Statistic)0.75 | 4 | 1mo ago | ||
| RealWorldQuestioning Investment Recommendations | Shannon Entropy (T-test Statistic)0.9 | 4 | 1mo ago | ||
| RealWorldQuestioning Jobs Recommendations | Shannon Entropy (T-statistic)1.44 | 4 | 1mo ago | ||
| RealWorldQuestioning Education Recommendations | Shannon Entropy (T-stat)2 | 4 | 1mo ago | ||
| RealWorldQuestioning Health Recommendations 1.0 | Llama-3 | Proportion (Male More Info)80.89 | 4 | 1mo ago | |
| RealWorldQuestioning Investment Recommendations 1.0 (Entire Dataset) | Llama-3 | Male More Information77.37 | 4 | 1mo ago | |
| RealWorldQuestioning Jobs Recommendations 1.0 | Llama-3 | Male More Information70.54 | 4 | 1mo ago | |
| RealWorldQuestioning Education Recommendations 1.0 (Entire Dataset) | Llama-3 | Proportion Male More Information69.62 | 4 | 1mo ago |