Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Ours

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal DiscoveryOurs Noisy
AUROC82.3
9
Causal DiscoveryOurs Original
AUROC0.821
9
Instruction Following EvaluationOurs hard seed data
Score56.73
5
Language DetoxificationOurs (test)
Overall Offensiveness Score1.145
5
Makeup TransferOurs (test)
FID11.67
4
Showing 5 of 5 rows