Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Translation misgendering evaluation set

Benchmarks

Task NameDataset NameSOTA ResultTrend
TranslationTranslation misgendering evaluation set into English zero-shot SynthBio v3 (test)
Accuracy (Overall)97.2
2
Showing 1 of 1 rows