Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Argument Relation Prediction and Classification on ArgUNSC
Loading...
65
Macro F1
Mistral-7B-Instruct-v0.3
0.52
17.26
34
50.74
Aug 5, 2025
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1
Mistral-7B-Instruct-v0.3
Evaluation Protocol=fi...
2025.08
65
Llama-3.1-8B-Instruct
Evaluation Protocol=fi...
2025.08
52
gemma-3-4b-it
Evaluation Protocol=fi...
2025.08
43
gpt-4.1-nano
Evaluation Protocol=on...
2025.08
38
gemma-3-4b-it
Evaluation Protocol=3-...
2025.08
33
Mistral-7B-Instruct-v0.3
Evaluation Protocol=3-...
2025.08
32
gemma-3-4b-it
Evaluation Protocol=on...
2025.08
30
Mistral-7B-Instruct-v0.3
Evaluation Protocol=on...
2025.08
30
Llama-3.1-8B-Instruct
Evaluation Protocol=3-...
2025.08
29
gpt-4.1-nano
Evaluation Protocol=3-...
2025.08
27
Llama-3.1-8B-Instruct
Evaluation Protocol=on...
2025.08
3
Feedback
Search any
task
Search any
task