Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Question Answering on LongBench Pro

34.2F1 Score

GLM-4.1V-9B-Thinking VERA

Updated 4mo ago

Evaluation Results

Method	Links
GLM-4.1V-9B-Thinking VERA 2026.02		34.2
GLM-4.1V-9B-Thinking 2026.02		31.06
Glyph 2026.02		28.94
GLM-4.1V-9B-Thinking Random RAG 2026.02		28.84
Qwen3-VL-8B-Instruct VERA 2026.02		28.74
GLM-4.1V-9B-Thinking ColPali RAG 2026.02		28.58
Qwen3-VL-8B-Instruct Random RAG 2026.02		28
Qwen3-VL-8B-Instruct 2026.02		27.56
Qwen3-VL-8B-Instruct OCR RAG 2026.02		26.4
GLM-4.1V-9B-Thinking Embedding RAG 2026.02		26.29