Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pheme

Benchmarks

Task NameDataset NameSOTA ResultTrend
Misinformation DetectionPHEME
Accuracy69.2
26
Rumor DetectionPHEME
DeepWordBug ASR56.86
16
Harmful Content DetectionPHEME New Attacks: ExplainDrive (test)
Accuracy82.91
15
Rumour DetectionPheme
Precision87.7
14
Rumor VerificationPHEME (test)
Macro-F166.6
12
Rumor DetectionPheme
Accuracy87.2
11
Harmful Content DetectionPHEME Known Attacks: DeepWordBug, TFAdjusted, TREPAT (test)
Accuracy85.59
10
Early Rumor DetectionPHEME
Macro F10.646
9
CalibrationPHEME
ECE0.018
6
ClassificationPHEME
AUC0.852
6
Causal influence recoveryPHEME
Precision@1081
5
Showing 11 of 11 rows