Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Counterfact

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge EditingCounterfact
Efficacy9,387
91
Subject inference attackCounterFact batch-edit tasks
Recall100
36
Sequential Knowledge EditingCounterFact sequential editing 10,000 Samples
Efficacy Success99.5
33
Model EditingCounterFact
Reliability92
30
Knowledge EditingCounterfact 10,000 facts
Relational Score10,000
27
Model EditingCounterFact
Reliability79.4
26
Model EditingCounterFact
Efficacy96.12
24
Sequential model editingCounterfact
Efficacy99.55
24
Classification ProbingCounterfact (test)
Probe Acc (Best Layer)89.6
21
Knowledge EditingCounterfact Full (test)
Rel. Accuracy99
21
Lifelong Knowledge EditingCOUNTERFACT
Reliability67.1
14
Model EditingCounterFact 3,000 samples (test)
Reliability9,980
13
Knowledge EditingCounterfact (test)
RwA99.86
12
Prompt recovery attackCounterFact
Top-1 Accuracy60
12
Sequential Model EditingCounterFact full (10K sequential edits) (test)
Efficacy94.45
10
Sequential Knowledge EditingCounterFact
Efficacy100
10
Prompt recovery attackCounterFact (test)
Top-1 Accuracy54
9
Model EditingCOUNTERFACT 7,500-record GPT-2 XL (test)
Score89.2
9
Model EditingCounterFact
Rel (QA Context)64.6
8
Hallucination Detectioncounterfact
AUROC0.84
8
Knowledge EditingCounterfact (val)
Relational Score1
8
Knowledge EditingCounterfact (first 2000 edits)
Accuracy99.95
8
Knowledge EditingCounterfact (first 150 edits)
DI Score98.67
8
Knowledge EditingCounterFact 15000 (test)
Efficacy91.22
6
Subject inference attackCounterFact
Attack Performance100
6
Showing 25 of 31 rows