Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Argument Quality Evaluation on WebisCMV 20
Loading...
56.7
Accuracy
Qwen3-8B-Arguinas-Target-SFT
49.9608
51.7104
53.46
55.2096
Mar 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-8B-Arguinas-Target-SFT
Model Backbone=Qwen3-8...
2026.03
56.7
Qwen3-4B-Arguinas-Target-SFT
Model Backbone=Qwen3-4...
2026.03
56.03
Qwen3-4B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-4...
2026.03
55.45
Qwen3-4B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-4...
2026.03
55.33
Qwen3-4B-Target-SFT
Model Backbone=Qwen3-4...
2026.03
53.8
Qwen3-4B-Instruct-Arguinas-SFT
Model Backbone=Qwen3-4...
2026.03
53.42
Qwen3-4B-AAAC-Target-SFT
Model Backbone=Qwen3-4...
2026.03
53.34
Qwen3-8B-Target-SFT
Model Backbone=Qwen3-8...
2026.03
53.3
Qwen2.5-7B-Instruct-Arguinas-SFT
Model Backbone=Qwen2.5...
2026.03
52.58
Qwen3-4B-Instruct
Model Backbone=Qwen3-4...
2026.03
52.5
Qwen3-8B-AAAC-Target-SFT
Model Backbone=Qwen3-8...
2026.03
51.46
Qwen2.5-7B-Instruct
Model Backbone=Qwen2.5...
2026.03
50.63
Qwen3-8B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-8...
2026.03
50.57
Qwen3-8B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-8...
2026.03
50.22
Feedback
Search any
task
Search any
task