Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Question Answering on DocMath

29.02F1 Score

GLM-4.1V-9B-Thinking VERA

Updated 4mo ago

Evaluation Results

Method	Links
GLM-4.1V-9B-Thinking VERA 2026.02		29.02
GLM-4.1V-9B-Thinking Random RAG 2026.02		21.99
GLM-4.1V-9B-Thinking ColPali RAG 2026.02		21.18
GLM-4.1V-9B-Thinking Embedding RAG 2026.02		17.49
GLM-4.1V-9B-Thinking 2026.02		15.71
Glyph 2026.02		13.61
Qwen3-VL-8B-Instruct VERA 2026.02		9.45
Qwen3-VL-8B-Instruct Random RAG 2026.02		5.61
Qwen3-VL-8B-Instruct OCR RAG 2026.02		4.72
Qwen3-VL-8B-Instruct 2026.02		3.47