Share your thoughts, 1 month free Claude Pro on usSee more

Language Modeling on RedditBias Gender (test)

94.92LM Score

No Debiasing

Updated 4mo ago

Evaluation Results

Method	Links
No Debiasing 2024.12		94.92	10.36
Fine-tuned 2024.12		94.62	11
No Debiasing 2024.12		93.58	19.1
Fine-tuned 2024.12		93.05	27.12
DExperts (Proposed) 2024.12		92.84	11.03
DExperts (Proposed) 2024.12		92.4	20.12
DExperts (Anti-only) 2024.12		90.6	27.06
Trigger 2024.12		87.01	19.38
DExperts (Anti-only) 2024.12		83.83	15.63