Share your thoughts, 1 month free Claude Pro on usSee more

General Performance on MMLU, HellaSwag, TruthfulQA, GSM8K, MATH, MBPP, HumanEval

40.35Average Score

Sens-Merging (DARE)

Updated 5mo ago

Evaluation Results

Method	Links
Sens-Merging (DARE) 2025.02		40.35
Sens-Merging (Ties-Merging) 2025.02		40.22
Sens-Merging (Task Arithmetic) 2025.02		40.2
DARE 2025.02		40.13
Ties-Merging 2025.02		39.89
Math 2025.02		37.42
Task Arithmetic 2025.02		34.52
Code 2025.02		31.21
Chat 2025.02		31.18