Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Token-level identification of extractive spans on Verifiability-Granular (test)
Loading...
76
Precision
GPT-4
44.8
52.9
61
69.1
May 28, 2024
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
GPT-4
2024.05
76
83
79
Llama-7b
2024.05
73
99
84
Mistral-7b
2024.05
73
99
84
Yi-6b
2024.05
73
99
84
GPT-3.5
2024.05
46
29
36
Feedback
Search any
task
Search any
task