Share your thoughts, 1 month free Claude Pro on usSee more

Long-context answering with citations on GovReport

82.8Citation Recall

GLM-4

Updated 4mo ago

Evaluation Results

Method	Links
GLM-4 2024.09		82.8	93.4	87.1
Claude-3-sonnet 2024.09		77.4	93.9	84.1
LongCite-8B 2024.09		74	86.6	78.5
GPT-4o 2024.09		73.4	90.4	79.8
Mistral-Large-Instruct 2024.09		67.9	79.6	72.5
LongCite-9B 2024.09		63.4	76.5	68.2
Llama-3.1-70B-Instruct 2024.09		53.4	77.5	60.7
Llama-3.1-8B-Instruct 2024.09		16.2	25.3	16.8
GLM-4-9B-chat 2024.09		5.7	8.2	6.3