Share your thoughts, 1 month free Claude Pro on usSee more

Long-Context Question Answering on LooGLE

66.3EM

Qwen2.5-OpAmp-72B

Updated 4mo ago

Evaluation Results

Method	Links
Qwen2.5-OpAmp-72B 2025.02		66.3
Qwen2.5-72B-inst 2025.02		64.9
DeepSeek-V3 2025.02		63.4
Llama3.3-70B-inst 2025.02		63
GPT-4o-0806 2025.02		62.7
Llama3-ChatQA2-70B 2025.02		59.1