Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on RedditBias Religion
Loading...
94.09
LM Score
Finetuned
74.8188
79.8219
84.825
89.8281
Dec 2, 2024
LM Score
PPL
Updated 1mo ago
Evaluation Results
Method
Method
Links
LM Score
PPL
Finetuned
Target Model=Llama 3.2...
2024.12
94.09
11
None
Target Model=Llama 3.2...
2024.12
92.18
10.36
None
Target Model=GPT-2 Med...
2024.12
90.46
19.1
Finetuned
Target Model=GPT-2 Med...
2024.12
89.79
27.12
Proposed
Target Model=GPT-2 Med...
2024.12
87.49
20.12
Anti-only
Target Model=GPT-2 Med...
2024.12
85.49
27.06
Proposed
Target Model=Llama 3.2...
2024.12
84.92
11.03
Anti-only
Target Model=Llama 3.2...
2024.12
75.56
15.63
Feedback
Search any
task
Search any
task