Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LooGLE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Single-hop Question AnsweringLoogle SD
Score45.1
17
Question AnsweringLooGLE Long Dependency QA
BLEU-10.0942
12
SummarizationLooGLE ArXiv Paper Summarization
BLEU-129.15
11
Long Dependency Question AnsweringLooGLE
Retrieval40
9
Long-Context Question AnsweringLooGLE
EM66.3
6
Multi-hop Question AnsweringLooGLE CR 16k
Score19.78
5
Multi-hop Question AnsweringLooGLE-MR 16k
Score15.1
5
Single-hop Question AnsweringLooGLE-SD 16k
Score45.1
5
Showing 8 of 8 rows