Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge-intensive Question Answering on OpenScience

53.8Pass@1

P-POTS+Mirror

Updated 2mo ago

Evaluation Results

Method	Links
P-POTS+Mirror 2025.11		53.8
P-POTS+Mirror 2025.11		53
P-POTS+Mirror 2025.11		52.53
Qwen3-8B-Base 2025.11		51.62
P-POTS+Mirror 2025.11		50.8
Clipped 2025.11		49.33
P-POTS 2025.11		47.6
P-POTS 2025.11		47.47
EMA 2025.11		47.08
P-POTS 2025.11		46.8
Llama3.1-8B-Instruct 2025.11		46.59
Qwen2.5-7B-Instruct 2025.11		46.42
MIRROR 2025.11		46.38
Standard 2025.11		45.52
ISAD 2025.11		45.4
P-POTS 2025.11		45.34