Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on PQAref 10 samples (test)
Loading...
67
Recall
M2
61.8
63.15
64.5
65.85
Jan 16, 2026
Recall
Updated 5d ago
Evaluation Results
Method
Method
Links
Recall
M2
Model=Mistral-7B-Instr...
2026.01
67
GPT-4 T
Model=GPT-4 Turbo
2026.01
62
0-M2
Model=Mistral-7B-Instr...
2026.01
62
Feedback
Search any
task
Search any
task