Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on PQAref 823 samples (test)
Loading...
1
Missed Abstract Count
GPT-4 T
-6.36
43.32
93
142.68
Jan 16, 2026
Missed Abstract Count
Updated 5d ago
Evaluation Results
Method
Method
Links
Missed Abstract Count
GPT-4 T
Model=GPT-4 Turbo
2026.01
1
M2
Model=Mistral-7B-Instr...
2026.01
10
0-M2
Model=Mistral-7B-Instr...
2026.01
185
Feedback
Search any
task
Search any
task