Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Retrieval on Bird
Loading...
42.7
Precision
ARM
13.892
21.371
28.85
36.329
Jan 30, 2025
Precision
Recall
F1 Score
PR AUC
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
PR AUC
ARM
Backbone=Llama3.1-8B-I...
2025.01
42.7
96.5
56
92.7
ReAct
Backbone=GPT4o-mini, #...
2025.01
25.1
97
37.8
93.3
ReAct
Backbone=Llama3.1-8B-I...
2025.01
15
96.7
24.5
93.5
Feedback
Search any
task
Search any
task