Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context Question Answering on LongBench-Cite Average
Loading...
77.6
C Score
Claude-3-sonnet
51.08
57.965
64.85
71.735
Sep 4, 2024
C Score
CLQA Score
CR Score
Updated 4d ago
Evaluation Results
Method
Method
Links
C Score
CLQA Score
CR Score
Claude-3-sonnet
Citation Strategy=LAC-S
2024.09
77.6
78.3
99
GLM-4
Citation Strategy=LAC-S
2024.09
73.7
77.2
95
Mistral-Large-Instruct
Citation Strategy=LAC-S
2024.09
73.6
76.4
96
LongCite-8B
Citation Strategy=LAC-S
2024.09
71.7
67.6
107
LongCite-9B
Citation Strategy=LAC-S
2024.09
70.4
65.6
109
GPT-4o
Citation Strategy=LAC-S
2024.09
69.4
78.2
88
GLM-4-9B-chat
Citation Strategy=LAC-S
2024.09
62.3
70.8
88
Llama-3.1-70B-Instruct
Citation Strategy=LAC-S
2024.09
62
65.5
95
Llama-3.1-8B-Instruct
Citation Strategy=LAC-S
2024.09
52.1
60.2
86
Feedback
Search any
task
Search any
task