Share your thoughts, 1 month free Claude Pro on usSee more

Long Context Question Answering on MultiFieldQA (Accuracy)

57.33Accuracy

POP

Updated 4mo ago

Evaluation Results

Method	Links
POP 2026.02		57.33
Full Model 2026.02		55.9
Wanda 2026.02		55.28
Full Model 2026.02		54.57
Full Model 2026.02		53.53
POP 2026.02		52.88
Wanda 2026.02		52.87
Wanda 2026.02		52.8
POP 2026.02		52.34
SliceGPT 2026.02		40.76
ShortGPT 2026.02		21.44
SliceGPT 2026.02		12.35
SliceGPT 2026.02		10.83
ShortGPT 2026.02		6.8
ShortGPT 2026.02		1.58