Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Question Answering on NarrativeQA (EM)

61.7Exact Match

Qwen2.5-OpAmp-72B

Updated 4mo ago

Evaluation Results

Method	Links
Qwen2.5-OpAmp-72B 2025.02		61.7
Llama3.3-70B-inst 2025.02		61.5
GPT-4o-0806 2025.02		61.5
DeepSeek-V3 2025.02		60.5
Qwen2.5-72B-inst 2025.02		60.2
Llama3-ChatQA2-70B 2025.02		59.8
Llama3.1-OpAmp-8B 2025.02		57.4
Llama3.1-8B-inst 2025.02		55.9
Llama3-ChatQA2-8B 2025.02		53.1
Qwen2.5-7B-inst 2025.02		47.7
Mistral-7B-inst-v0.3 2025.02		44.7