Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LongSeAL

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-hop Question AnsweringLongSeal
EM15.35
50
Agentic SearchLongSeAL
String-F113.5
14
Long-context reasoningLongSeal
Accuracy64.96
10
Showing 3 of 3 rows