Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Retrieval on NIH

100Multi-needle Avg Recall

GPT-4

Updated 1mo ago

Evaluation Results

Method	Links
GPT-4 2024.07		100
GPT-4o 2024.07		100
Llama 3 8B 2024.07		98.8
Llama 3 405B 2024.07		98.1
Llama 3 70B 2024.07		97.5
Claude 3.5 Sonnet 2024.07		90.8