Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Argument Reasoning on ArgsNovel
Loading...
53.59
Accuracy
Qwen3-4B-Instruct-Arguinas-SFT
45.6756
47.7303
49.785
51.8397
Mar 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-4B-Instruct-Arguinas-SFT
Model Backbone=Qwen3-4...
2026.03
53.59
Qwen3-8B-Arguinas-Target-SFT
Model Backbone=Qwen3-8...
2026.03
53.39
Qwen3-8B-Target-SFT
Model Backbone=Qwen3-8...
2026.03
53.05
Qwen3-4B-Arguinas-Target-SFT
Model Backbone=Qwen3-4...
2026.03
52.52
Qwen3-4B-Target-SFT
Model Backbone=Qwen3-4...
2026.03
51.99
Qwen3-4B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-4...
2026.03
49.88
Qwen3-8B-AAAC-Target-SFT
Model Backbone=Qwen3-8...
2026.03
49.85
Qwen2.5-7B-Instruct
Model Backbone=Qwen2.5...
2026.03
49.72
Qwen3-4B-AAAC-Target-SFT
Model Backbone=Qwen3-4...
2026.03
49.44
Qwen3-4B-Instruct
Model Backbone=Qwen3-4...
2026.03
49.3
Qwen3-4B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-4...
2026.03
49.07
Qwen2.5-7B-Instruct-Arguinas-SFT
Model Backbone=Qwen2.5...
2026.03
47.52
Qwen3-8B-ArgumentOnly-Target-SFT
Model Backbone=Qwen3-8...
2026.03
46.35
Qwen3-8B-EntailmentBank-Target-SFT
Model Backbone=Qwen3-8...
2026.03
45.98
Feedback
Search any
task
Search any
task