Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Question Answering on LongBench-Cite Average

77.6C Score

Claude-3-sonnet

51.0857.96564.8571.735Sep 4, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.09
77.678.399
2024.09
73.777.295
2024.09
73.676.496
2024.09
71.767.6107
2024.09
70.465.6109
2024.09
69.478.288
2024.09
62.370.888
2024.09
6265.595
2024.09
52.160.286