Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MUSE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Machine UnlearningMUSE Books
Privacy Leakage-76.1834
35
Machine UnlearningMUSE-News Llama 2 7B
Privacy Leakage-99.8951
27
UnlearningMUSE-Books 1.0 (test)
Unlearn Score86
24
Reasoning SegmentationMUSE (val)
gIoU (overall)48
21
Machine UnlearningMUSE NEWS
VerbMem (Df)58.42
18
Machine UnlearningMUSE
VerbMem on DF0
16
Reasoning SegmentationMUSE (test)
gIoU (overall)42.3
16
Machine UnlearningMUSE-Books Relearn 50%
Forgetting Score (No VerbMem)90.974
15
Machine UnlearningMUSE (forget set (Df) and retain set (Dr))
VerbMem (Df)58.4
15
UnlearningMUSE-Books Harry Potter 100 samples (forget set)
R-Forget32.13
13
Machine UnlearningMUSE News
Rel Score8.3
9
Machine UnlearningMUSE Books
Rel7.55
9
Knowledge RetentionMUSE Retain set (Dr)
KnowMem56
9
Knowledge UnlearningMUSE (forget set Df)
VerbMem Df Pre57.9
8
Relearning AttackMUSE
RAP43
8
Bilingual Lexicon InductionMUSE (test)
P@1 (en-es →)89.9
7
Cross-lingual Word AlignmentMUSE
Alignment Score (IT-EN)81.84
7
Multi-target reasoning segmentationMUSE (val)
Overall gIoU52.4
6
Conversational RecommendationMUSE Multimodal Fashion (test)
R@110.2
5
Conversational RecommendationMUSE (n=200)
Recommendation Quality (Rec.Q)4.16
3
Bilingual Lexicon InductionMUSE zh-en (test)
Precision96.6
2
Showing 21 of 21 rows