Share your thoughts, 1 month free Claude Pro on usSee more

Multi-hop Reasoning on MultiHopRAG

89.6EM

Qwen2.5-OpAmp-72B

Updated 4mo ago

Evaluation Results

Method	Links
Qwen2.5-OpAmp-72B 2025.02		89.6
Qwen2.5-72B-inst 2025.02		89.2
DeepSeek-V3 2025.02		88.6
GPT-4o-0806 2025.02		87.7
Llama3.3-70B-inst 2025.02		83.7
Llama3-ChatQA2-70B 2025.02		78.2
Llama3.1-OpAmp-8B 2025.02		70.5
Mistral-7B-inst-v0.3 2025.02		69.5
Qwen2.5-7B-inst 2025.02		66.9
Llama3.1-8B-inst 2025.02		63.9
Llama3-ChatQA2-8B 2025.02		50.9