Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context Question Answering on GovReport
Loading...
68.4
C Score
Claude-3-sonnet
45.104
51.152
57.2
63.248
Sep 4, 2024
C Score
CLQA Score
CR Score
Updated 4d ago
Evaluation Results
Method
Method
Links
C Score
CLQA Score
CR Score
Claude-3-sonnet
Citation Strategy=LAC-S
2024.09
68.4
70.1
98
LongCite-8B
Citation Strategy=LAC-S
2024.09
63
54.4
116
Mistral-Large-Instruct
Citation Strategy=LAC-S
2024.09
60.4
68.3
88
LongCite-9B
Citation Strategy=LAC-S
2024.09
59.6
46.4
128
GLM-4
Citation Strategy=LAC-S
2024.09
59.4
65.9
90
GLM-4-9B-chat
Citation Strategy=LAC-S
2024.09
59.3
61.6
96
Llama-3.1-70B-Instruct
Citation Strategy=LAC-S
2024.09
56.3
66.9
84
Llama-3.1-8B-Instruct
Citation Strategy=LAC-S
2024.09
49.6
62.1
80
GPT-4o
Citation Strategy=LAC-S
2024.09
46
61.3
75
Feedback
Search any
task
Search any
task