Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Politeness-controlled Text Generation on WIKIPOL
Loading...
79.2
Accuracy
SWAI
48.2912
56.3156
64.34
72.3644
Jan 16, 2026
Accuracy
F1 Score
Precision
Recall
Confidence Score
Cohen's Kappa
Matthews Corr. Coeff.
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
Precision
Recall
Confidence Score
Cohen's Kappa
Matthews Corr. Coeff.
SWAI
Model=Llama3.2 1B
2026.01
79.2
70.9
84.96
66.37
57.7
58.2
61.6
SWAI
Model=Llama3.1 8B
2026.01
77.2
73
82.89
69.28
60.3
59.4
62.6
Baseline
Model=Llama3.1 8B
2026.01
56.7
56.9
56.79
57.1
74.8
35.1
35.1
Baseline
Model=Llama3.2 1B
2026.01
49.48
49.4
49.34
49.59
73.4
24.1
24.1
Feedback
Search any
task
Search any
task