Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Kimi

Benchmarks

Task NameDataset NameSOTA ResultTrend
Out-of-distribution AI-generated text detectionKimi Out-of-distribution (OOD) K2.5 (unseen domains test)
Legal Accuracy98.7
16
Transferable Adversarial AttackKimi K2.5
ASR (%)69
16
Showing 2 of 2 rows