Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Span Detection on LLM
Loading...
0.3322
F1 Score
ChatGPT-MoE
0.075528
0.142164
0.2088
0.275436
Jun 5, 2024
F1 Score
Precision
Recall
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Precision
Recall
ChatGPT-MoE
Prompt setting=Few-Sho...
2024.06
0.3322
0.356
0.3114
ChatGPT-MoE
Prompt setting=Zero-Sh...
2024.06
0.3129
0.3393
0.2904
ChatGPT-Span
Prompt setting=Zero-Sh...
2024.06
0.3059
0.3347
0.2817
ChatGPT-Span
Prompt setting=Few-Sho...
2024.06
0.3051
0.3337
0.281
QAFactEval
2024.06
0.0854
0.0749
0.0991
Feedback
Search any
task
Search any
task