Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UltraChat

Benchmarks

Task NameDataset NameSOTA ResultTrend
Memorization ReductionUltrachat
Memorization Reduction95.7
20
Unlearning DetectionUltraChat
Accuracy99.86
12
Natural Language GenerationUltraChat
BLEU36.8
8
Conversational Instruction FollowingUltraChat
Overall Score9.1
6
Personalized Interactionultrachat Synthetic
Personalization Score7.26
6
Unlearning detection (distinguishing original vs. RMU-unlearned)UltraChat (test)
Accuracy0.8746
4
Showing 6 of 6 rows