Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMhops

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-hop Question AnsweringMMhops Comparison (test)
Accuracy29.39
13
Multi-hop Question AnsweringMMhops Bridging (test)
String Success Rate58.8
13
Showing 2 of 2 rows