Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Knowledge Editing

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge EditingKnowledge Editing (Edit on English, Test on Chinese) 1.0 (ZH)
Reliability80.23
7
Knowledge EditingKnowledge Editing Edit on English, Test on Russian 1.0 (RU)
Reliability84.28
7
Knowledge EditingKnowledge Editing Edit on English, Test on German 1.0 (DE)
Reliability90.45
7
Knowledge EditingKnowledge Editing Edit on English, Test on Czech 1.0 (CS)
Reliability87.61
7
Knowledge EditingKnowledge Editing Edit on English, Test on English 1.0
Reliability100
7
Knowledge EditingKnowledge Editing Edit on Chinese, Test on English
Reliability45.76
3
Showing 6 of 6 rows