Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on MetaQA
Loading...
95.5
HS
AURA
94.668
94.884
95.1
95.316
Jan 1, 2026
HS
ARR
Updated 3mo ago
Evaluation Results
Method
Method
Links
HS
ARR
AURA
Model=Qwen-2.5-7B
2026.01
95.5
100
AURA
Model=Llama2-7B
2026.01
95.2
100
AURA
Model=Gemini-2.5-flash
2026.01
94.9
100
AURA
Model=GPT-4o
2026.01
94.7
100
Feedback
Search any
task
Search any
task