Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UltraChat

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue GenerationUltraChat
ASR Accuracy98.7
32
Memorization ReductionUltrachat
Memorization Reduction95.7
20
Unlearning DetectionUltraChat
Accuracy99.86
12
Natural Language GenerationUltraChat
BLEU36.8
8
Conversational Instruction FollowingUltraChat
Overall Score9.1
6
Personalized Interactionultrachat Synthetic
Personalization Score7.26
6
Unlearning detection (distinguishing original vs. RMU-unlearned)UltraChat (test)
Accuracy0.8746
4
Instruction FollowingUltraChat
RM Score67.8
2
Showing 8 of 8 rows