Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical reasoning multi-choice QA on LogiQA v2 (test)
Loading...
55.5
Macro F1 Score
T5-large
29.1048
35.9574
42.81
49.6626
Feb 17, 2025
Macro F1 Score
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1 Score
Accuracy
T5-large
Warmup=true
2025.02
55.5
55.62
T5-large
Warmup=false
2025.02
54.46
54.54
T5-base
Warmup=true
2025.02
50
50.06
T5-base
Warmup=false
2025.02
49.16
49.19
Llama-3.2-1B
Warmup=true
2025.02
32.63
33.55
Llama-3.2-1B
Warmup=false
2025.02
30.12
31.7
Feedback
Search any
task
Search any
task