Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Casual Mention Span Extraction on SHINES
Loading...
85
F1 Score
Llama-3.1-8B-Instruct
77.72
79.61
81.5
83.39
Jun 5, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Llama-3.1-8B-Instruct
Protocol=Multitask Fin...
2025.06
85
Mental-Alpaca-7B
Protocol=Multitask Fin...
2025.06
83
Llama-3.1-8B-Instruct
Protocol=Multitask Fin...
2025.06
83
Mental-Alpaca-7B
Protocol=Multitask Fin...
2025.06
81
MentaLLaMA-chat-7B
Protocol=Multitask Fin...
2025.06
80
MentaLLaMA-chat-7B
Protocol=Multitask Fin...
2025.06
78
Feedback
Search any
task
Search any
task