Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reasoning on LiveCodeBench (pass@1 accuracy)
Loading...
55.45
pass@1 Accuracy
Baseline
33.7244
39.3647
45.005
50.6453
Oct 1, 2025
pass@1 Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
pass@1 Accuracy
Baseline
Model=QwQ-32B, Bit-Wid...
2025.10
55.45
ThinKV (k=1024)
Model=QwQ-32B, Bit-Wid...
2025.10
50.47
Baseline
Model=R1-Qwen-14B, Bit...
2025.10
47.9
PM-KVQ
Model=QwQ-32B, Bit-Wid...
2025.10
46.68
ThinKV (k=1024)
Model=R1-Qwen-14B, Bit...
2025.10
45.84
PM-KVQ
Model=R1-Qwen-14B, Bit...
2025.10
41.97
KIVI
Model=QwQ-32B, Bit-Wid...
2025.10
40.75
KIVI
Model=R1-Qwen-14B, Bit...
2025.10
34.56
Feedback
Search any
task
Search any
task