Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context Question Answering on Dureader
Loading...
81
C Score
GPT-4o
27.752
41.576
55.4
69.224
Sep 4, 2024
C Score
CLQA Score
CR Score
Updated 4d ago
Evaluation Results
Method
Method
Links
C Score
CLQA Score
CR Score
GPT-4o
Citation Strategy=LAC-S
2024.09
81
83.3
97
Mistral-Large-Instruct
Citation Strategy=LAC-S
2024.09
79
83.3
95
GLM-4
Citation Strategy=LAC-S
2024.09
76
75.8
100
Claude-3-sonnet
Citation Strategy=LAC-S
2024.09
75.8
80.3
94
LongCite-9B
Citation Strategy=LAC-S
2024.09
69
66.3
104
LongCite-8B
Citation Strategy=LAC-S
2024.09
68.5
62.3
110
GLM-4-9B-chat
Citation Strategy=LAC-S
2024.09
49.3
68.1
72
Llama-3.1-70B-Instruct
Citation Strategy=LAC-S
2024.09
43.3
42.5
102
Llama-3.1-8B-Instruct
Citation Strategy=LAC-S
2024.09
29.8
39.4
76
Feedback
Search any
task
Search any
task