Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on LM Eval Harness Suite

48.89LM Eval Score

SparseGPT+SEFT

34.267638.063841.8645.6562May 29, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
48.89-----------
2025.05
48.33-----------
2025.05
47.95-----------
2025.05
47.7-----------
2025.05
47.27-----------
2025.05
47.09-----------
2025.05
46.11-----------
2025.05
45.61-----------
2025.05
45.42-----------
2025.05
44.73-----------
2025.05
44.55-----------
2025.05
44.46-----------
2025.05
44.29-----------
2025.05
43.68-----------
2025.05
43.56-----------
2025.05
43.32-----------
2025.05
41.7-----------
2025.05
41.51-----------
2025.05
34.92-----------
2025.05
34.83-----------
2026.03
-50.0967.4376.8274.3343.5261.6464.01418093.465.22
2026.03
-48.9266.9477.2673.8240.5360.1465.4139.67892.264.28
2026.03
-30.2564.4976.6172.6441.0455.6458.3540.27987.660.58
2026.03
-43.9467.3676.8275.6743.5258.9663.79418291.364.44
2026.03
-49.1867.5776.7175.5144.9758.1764.1641.4829265.17
2026.03
-43.962.9975.6371.9738.9157.5463.1237.27591.461.76
2026.03
-50.1868.6276.5572.6443.0960.6263.6143799365.03
2026.03
-50.267.2377.0973.0641.6459.1262.3940.88292.964.64
2026.03
-51.7468.4476.5576.8143.0961.0963.2140.48193.365.56
2026.03
-50.9268.3577.8674.6642.2460.5462.5740.48392.865.33
2026.03
-49.8468.1977.0475.1742.5859.0465.2642.48393.965.64
2026.03
-51.5468.6978.1874.4541.8160.365.08428293.965.8
2026.03
-51.868.9976.8276.7343.660.6963.340.28193.565.66