Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LongBookQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringLongBookQA en
F1 Score25.04
5
Question AnsweringLongBookQA-zh (test)
F139.44
5
Showing 2 of 2 rows