Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Span Detection on FT-Summ
Loading...
29.04
F1 Score
ChatGPT-MoE
9.4048
14.5024
19.6
24.6976
Jun 5, 2024
F1 Score
Precision
Recall
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Precision
Recall
ChatGPT-MoE
Prompt setting=Few-Sho...
2024.06
29.04
27.9
30.27
ChatGPT-MoE
Prompt setting=Zero-Sh...
2024.06
28.04
27.62
28.46
ChatGPT-Span
Prompt setting=Few-Sho...
2024.06
24.31
27.54
23.59
ChatGPT-Span
Prompt setting=Zero-Sh...
2024.06
23.55
24.79
22.44
QAFactEval
2024.06
10.16
7.55
15.52
Feedback
Search any
task
Search any
task