Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open Book Question Answering on OpenBookQA
Loading...
89.4
Normalized Log Accuracy
T-Free
24.92
41.66
58.4
75.14
Mar 16, 2026
Mar 17, 2026
Mar 19, 2026
Mar 20, 2026
Mar 22, 2026
Mar 23, 2026
Mar 25, 2026
Normalized Log Accuracy
Memory Usage (Bytes/Seq)
Updated 23d ago
Evaluation Results
Method
Method
Links
Normalized Log Accuracy
Memory Usage (Bytes/Seq)
T-Free
shots=10-shot
2026.03
89.4
4.85
HATified
shots=10-shot
2026.03
86.8
4.85
Llama
shots=10-shot
2026.03
84.6
4.35
No pruning
Sparsity=0, Backbone=G...
2026.03
47.2
-
Magnitude-Dim
Sparsity=10%, Backbone...
2026.03
43.2
-
DIET
Sparsity=10%, Backbone...
2026.03
40
-
Magnitude-Dim
Sparsity=20%, Backbone...
2026.03
31.6
-
DIET
Sparsity=20%, Backbone...
2026.03
31.2
-
PuDDing
Sparsity=10%, Backbone...
2026.03
30.2
-
PuDDing
Sparsity=20%, Backbone...
2026.03
29.2
-
SliceGPT
Sparsity=20%, Backbone...
2026.03
27.6
-
SliceGPT
Sparsity=10%, Backbone...
2026.03
27.4
-
Feedback
Search any
task
Search any
task