Our new X account is live! Follow @wizwand_team for updates

Long-context Question Answering on LongBench-Cite Average

77.6C Score

Claude-3-sonnet

Updated 4d ago

Evaluation Results

Method	Links
Claude-3-sonnet 2024.09		77.6	78.3	99
GLM-4 2024.09		73.7	77.2	95
Mistral-Large-Instruct 2024.09		73.6	76.4	96
LongCite-8B 2024.09		71.7	67.6	107
LongCite-9B 2024.09		70.4	65.6	109
GPT-4o 2024.09		69.4	78.2	88
GLM-4-9B-chat 2024.09		62.3	70.8	88
Llama-3.1-70B-Instruct 2024.09		62	65.5	95
Llama-3.1-8B-Instruct 2024.09		52.1	60.2	86