Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Retrieval on OTT-QA
Loading...
47.3
Precision
ARM
14.02
22.66
31.3
39.94
Jan 30, 2025
Precision
Recall
F1 Score
PR Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
PR Score
ARM
Backbone=Llama3.1-8B-I...
2025.01
47.3
79.8
55
62.5
ReAct
Backbone=GPT4o-mini, #...
2025.01
21.7
80.6
30.9
62.7
ReAct
Backbone=Llama3.1-8B-I...
2025.01
15.3
76
23.1
55.1
Feedback
Search any
task
Search any
task