Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
End-to-end Question Answering on Bird
Loading...
20.6
Accuracy
ARM
4.064
8.357
12.65
16.943
Jan 30, 2025
Accuracy
Exact Match
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Exact Match
F1 Score
ARM
Generation model=Llama...
2025.01
20.6
44
51.8
DR@5
Generation model=Llama...
2025.01
17.5
34.2
40.6
DRR@5
Generation model=Llama...
2025.01
16.8
39.8
46.9
DRR-D@5
Generation model=Llama...
2025.01
15.9
38.8
46.3
DR-D@5
Generation model=Llama...
2025.01
13.9
27.2
33.3
ReAct
Generation model=Llama...
2025.01
4.7
27
32.5
Feedback
Search any
task
Search any
task