Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

InfoQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-hop Question AnsweringInfoQA Synthetic Multi-hop QA 1-hop
Average F1100
24
Document Visual Question AnsweringInfoQA 105 (test)
Score86.9
23
Multi-hop Question AnsweringInfoQA Synthetic Multi-hop QA (2–4 hop)
Avg F198
12
Multi-hop Question AnsweringInfoQA Synthetic Multi-hop QA (4-hop)
Avg F180
4
Multi-hop Question AnsweringInfoQA Synthetic Multi-hop QA 3-hop
Avg F1 Score98
4
Showing 5 of 5 rows