Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Debiasing on BBQ and CrowS-Pairs In-Distribution (test)

0.94Mean Bias

UGID (OURS)

-0.06126.696913.45520.2131Mar 19, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
0.940.940.0070.058100121.11-----
2026.03
1.031.130.1483.5765010.66-----
2026.03
1.161.290.113.8131003.76-----
2026.03
3.477.88-----5.2918.811.081
2026.03
4.4211.8-----4.3216.911.191
2026.03
6.3419.420.2115.198100118.07-----
2026.03
7.1421.990.2115.198100118.07-----
2026.03
13.0427.28-----14.38184.972.321
2026.03
25.9758.250.2115.19875118.07-----