Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-Context Question Answering on LooGLE
Loading...
66.3
EM
Qwen2.5-OpAmp-72B
58.812
60.756
62.7
64.644
Feb 18, 2025
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
Qwen2.5-OpAmp-72B
Parameters=72B, Adapta...
2025.02
66.3
Qwen2.5-72B-inst
Parameters=72B, Type=I...
2025.02
64.9
DeepSeek-V3
Version=V3
2025.02
63.4
Llama3.3-70B-inst
Parameters=70B, Type=I...
2025.02
63
GPT-4o-0806
Version=0806
2025.02
62.7
Llama3-ChatQA2-70B
Parameters=70B, Versio...
2025.02
59.1
Feedback
Search any
task
Search any
task