Share your thoughts, 1 month free Claude Pro on usSee more

Language Modeling Utility on LM Eval Harness

0.48HellaSwag Accuracy

Pre Edit

Updated 1mo ago

Evaluation Results

Method	Links
Pre Edit 2025.06		0.48	3.98	5.96	10.88	0.65	0.76
PME 2025.06		0.48	4.07	6.48	10.89	0.65	0.76
MEMIT 2025.06		0.48	4.24	6.59	10.93	0.64	0.76