Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on RedditBias Gender (test)
Loading...
94.92
LM Score
No Debiasing
83.3864
86.3807
89.375
92.3693
Dec 2, 2024
LM Score
Perplexity (PPL)
Updated 1mo ago
Evaluation Results
Method
Method
Links
LM Score
Perplexity (PPL)
No Debiasing
Target Model=Llama 3.2 3B
2024.12
94.92
10.36
Fine-tuned
Target Model=Llama 3.2 3B
2024.12
94.62
11
No Debiasing
Target Model=GPT-2 Medium
2024.12
93.58
19.1
Fine-tuned
Target Model=GPT-2 Medium
2024.12
93.05
27.12
DExperts (Proposed)
Target Model=Llama 3.2...
2024.12
92.84
11.03
DExperts (Proposed)
Target Model=GPT-2 Med...
2024.12
92.4
20.12
DExperts (Anti-only)
Target Model=GPT-2 Med...
2024.12
90.6
27.06
Trigger
Target Model=GPT-2 Medium
2024.12
87.01
19.38
DExperts (Anti-only)
Target Model=Llama 3.2...
2024.12
83.83
15.63
Feedback
Search any
task
Search any
task