Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context language modeling on LongBench (test)
Loading...
9.79
Qasper Score
MoQAE
9.114
9.2895
9.465
9.6405
Jun 9, 2025
Qasper Score
QMSum Score
MultiNews Score
TREC Accuracy
TriviaQA Score
SAMSum Score
LCC Score
RepoBench-P Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Qasper Score
QMSum Score
MultiNews Score
TREC Accuracy
TriviaQA Score
SAMSum Score
LCC Score
RepoBench-P Score
MoQAE
2025.06
9.79
21.23
3.47
66
87.89
41.37
66.53
59.94
CQ-4c8b
channels per group=4,...
2025.06
9.58
20.87
1.93
66
87.72
41.13
66.57
59.75
FP16
precision=full precisi...
2025.06
9.52
21.28
3.51
66
87.72
41.69
66.66
59.82
KIVI-2b
quantization bits=2-bit
2025.06
9.26
20.53
0.97
66
87.42
42.61
66.22
59.67
MiKV
quantization_type=mixe...
2025.06
9.14
20.63
0.85
65.88
87.21
41.44
66.18
59.55
Feedback
Search any
task
Search any
task