Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Answering with Citations on MultifieldQA

79Citation Recall

GPT-4o

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4o 2024.09		79	87.9	80.6
LongCite-8B 2024.09		74.7	93	80.8
GLM-4 2024.09		72.3	80.1	73.6
Mistral-Large-Instruct 2024.09		71.8	80.7	73.8
LongCite-9B 2024.09		67.3	91	74.8
Claude-3-sonnet 2024.09		64.7	85.8	71.3
Llama-3.1-70B-Instruct 2024.09		53.2	65.2	53.9
GLM-4-9B-chat 2024.09		51.1	60.6	52
Llama-3.1-8B-Instruct 2024.09		29.8	44.3	31.6