Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Wikipedia

Benchmarks

Task NameDataset NameSOTA ResultTrend
Membership Inference AttackWikipedia
AUC0.9
52
Inductive dynamic link predictionWikipedia (inductive)
AUC-ROC0.9848
44
Dynamic Link PredictionWikipedia Inductive
AP98.59
44
Dynamic Graph Anomaly DetectionWikipedia S2
AUROC83.39
42
Response correctness and completeness evaluationWikipedia
F1 Score68
38
transductive dynamic link predictionWikipedia
AUC ROC99.31
37
Membership Inference AttackWikipedia Pythia
ROC AUC74
36
Membership InferenceWikipedia Pythia (train)
TPR@1%FPR22.7
36
Reliability of post-edit LLMsWikipedia
BLEU100
36
Language ModelingWikipedia
Perplexity9.17
35
Dynamic link predictionWikipedia
AP99.03
27
Membership Inference AttackWikipedia en
AUC0.79
26
Document ClassificationWikipedia (test)
Classification Error30.24
24
Dynamic Link PredictionWikipedia
AUC-ROC0.8768
22
Link PredictionWikipedia (inductive)
AP99.04
21
Link PredictionWikipedia transductive
AP99.31
21
Fact MemorizationWikipedia corpus annotated (train)
Fact Accuracy929.25
20
Language ModelingWikipedia 20k sentences
Perplexity (Wikipedia 20k)9.06
20
Unconditional Text GenerationWikipedia
Mauve Score90.1
18
Graph ClusteringWikipedia
NMI0.516
15
Machine-paraphrased plagiarism detectionWikipedia SpinBot paraphrased (test)
F1-Micro89.55
15
Node classificationWikipedia
AUC88.32
15
AI-generated text detectionWikipedia OPT-13B generations (+ 60L,600)
Accuracy (1% FPR)97.2
14
Page ClassificationWikipedia (90% train ratio)
Macro-F1 Score83.66
13
Link predictionWikipedia
AUC99.2
12
Showing 25 of 134 rows