Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Event Detection on MAVEN
Loading...
56.82
Macro Precision
Llama 3
49.8416
51.6533
53.465
55.2767
Jan 17, 2026
Macro Precision
Macro Recall
Micro F1 Score
Macro F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro Precision
Macro Recall
Micro F1 Score
Macro F1 Score
Llama 3
Size=8B, Temp=0.4, Sho...
2026.01
56.82
51.72
63.95
54.13
Qwen 2.5
Size=7B, Temp=0.4, Sho...
2026.01
55.51
50.68
61.87
52.99
Gemma
Size=8B, Temp=0.4, Sho...
2026.01
54.02
49.21
61.31
51.49
DeepSeek
Size=7B, Temp=0.4, Sho...
2026.01
53.13
48.14
61.01
50.53
Llama 3
Size=8B, Temp=0.4, Sho...
2026.01
51.36
46.19
36.1
24.11
Gemma
Size=8B, Temp=0.4, Sho...
2026.01
51.24
46.29
34.76
22.95
Qwen 2.5
Size=7B, Temp=0.4, Sho...
2026.01
50.11
45.15
33.23
22.75
Feedback
Search any
task
Search any
task