Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Retrieval on NIH

100Multi-needle Avg Recall

GPT-4

90.43292.91695.497.884Jul 31, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.07
100
2024.07
100
2024.07
98.8
98.1
97.5
90.8