Share your thoughts, 1 month free Claude Pro on usSee more

Selective Question Answering on AbstainQA (val)

21Accuracy

Single Best

Updated 1mo ago

Evaluation Results

Method	Links
Single Best 2026.05		21
Base 2026.05		18.5
EvoGM 2026.05		16.5
CMA 2026.05		14
PSO-Merging 2026.05		13
Model Swarm 2026.05		13
DARE 2026.05		9.5
Task Arithmetic 2026.05		7.5
TIES 2026.05		7.5
Model Soup 2026.05		7
MTL 2026.05		0.5