Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Argument Relation Prediction and Classification on ElecDeb
Loading...
68
Macro F1
Llama-3.1-8B-Instruct
-0.64
17.18
35
52.82
Aug 5, 2025
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1
Llama-3.1-8B-Instruct
Evaluation Protocol=fi...
2025.08
68
Mistral-7B-Instruct-v0.3
Evaluation Protocol=fi...
2025.08
67
gemma-3-4b-it
Evaluation Protocol=fi...
2025.08
63
gpt-4.1-nano
Evaluation Protocol=on...
2025.08
46
Mistral-7B-Instruct-v0.3
Evaluation Protocol=on...
2025.08
40
gpt-4.1-nano
Evaluation Protocol=3-...
2025.08
38
gemma-3-4b-it
Evaluation Protocol=on...
2025.08
37
Mistral-7B-Instruct-v0.3
Evaluation Protocol=3-...
2025.08
32
Llama-3.1-8B-Instruct
Evaluation Protocol=3-...
2025.08
31
gemma-3-4b-it
Evaluation Protocol=3-...
2025.08
27
Llama-3.1-8B-Instruct
Evaluation Protocol=on...
2025.08
2
Feedback
Search any
task
Search any
task