Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Argument Quality Evaluation on UKPConvArg 2
Loading...
92.86
Accuracy
Qwen3-8B-Arguinas-Target-SFT
76.8336
80.9943
85.155
89.3157
Mar 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-8B-Arguinas-Target-SFT
Model Backbone=Qwen3-8...
2026.03
92.86
Qwen3-4B-Arguinas-Target-SFT
Model Backbone=Qwen3-4...
2026.03
92.79
Qwen3-4B-AAAC-Target-SFT
Model Backbone=Qwen3-4...
2026.03
92.74
Qwen3-8B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-8...
2026.03
92.56
Qwen3-4B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-4...
2026.03
92.49
Qwen3-8B-Target-SFT
Model Backbone=Qwen3-8...
2026.03
92.49
Qwen3-4B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-4...
2026.03
92.45
Qwen3-4B-Target-SFT
Model Backbone=Qwen3-4...
2026.03
92.35
Qwen3-8B-AAAC-Target-SFT
Model Backbone=Qwen3-8...
2026.03
92.12
Qwen3-8B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-8...
2026.03
91.4
Qwen3-4B-Instruct-Arguinas-SFT
Model Backbone=Qwen3-4...
2026.03
80.22
Qwen3-4B-Instruct
Model Backbone=Qwen3-4...
2026.03
77.66
Qwen2.5-7B-Instruct-Arguinas-SFT
Model Backbone=Qwen2.5...
2026.03
77.59
Qwen2.5-7B-Instruct
Model Backbone=Qwen2.5...
2026.03
77.45
Feedback
Search any
task
Search any
task