Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Serious Intent Span Extraction on SHINES
Loading...
84
F1 Score
Llama-3.1-8B-Instruct
78.8
80.15
81.5
82.85
Jun 5, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Llama-3.1-8B-Instruct
Protocol=Multitask Fin...
2025.06
84
Mental-Alpaca-7B
Protocol=Multitask Fin...
2025.06
82
MentaLLaMA-chat-7B
Protocol=Multitask Fin...
2025.06
81
Llama-3.1-8B-Instruct
Protocol=Multitask Fin...
2025.06
81
Mental-Alpaca-7B
Protocol=Multitask Fin...
2025.06
80
MentaLLaMA-chat-7B
Protocol=Multitask Fin...
2025.06
79
Feedback
Search any
task
Search any
task