Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
End-to-end Question Answering on Bird
Loading...
20.6
Accuracy
ARM
4.064
8.357
12.65
16.943
Jan 30, 2025
Accuracy
Exact Match
F1 Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Exact Match
F1 Score
ARM
Generation model=Llama...
2025.01
20.6
44
51.8
DR@5
Generation model=Llama...
2025.01
17.5
34.2
40.6
DRR@5
Generation model=Llama...
2025.01
16.8
39.8
46.9
DRR-D@5
Generation model=Llama...
2025.01
15.9
38.8
46.3
DR-D@5
Generation model=Llama...
2025.01
13.9
27.2
33.3
ReAct
Generation model=Llama...
2025.01
4.7
27
32.5
Feedback
Search any
task
Search any
task