Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-hop

Benchmarks

Task NameDataset NameSOTA ResultTrend
Information RetrievalMulti-hop
NDCG@1058.16
12
Multi-hop RetrievalMulti-hop 4 datasets aggregate (test)
NDCG@1058.5
8
Multi-hop reasoningMulti-hop 2-hop N=500
Accuracy79
2
Showing 3 of 3 rows