Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Wikipedia

Benchmarks

Task NameDataset NameSOTA ResultTrend
Inductive dynamic link predictionWikipedia (inductive)
AUC-ROC0.9952
80
Dynamic Link PredictionWikipedia Inductive
AP99.55
80
transductive dynamic link predictionWikipedia
AUC ROC99.74
76
Membership Inference AttackWikipedia
AUC0.9
75
Link PredictionWikipedia (inductive)
AP99.04
51
Language ModelingWikipedia
Perplexity9.17
43
Dynamic Graph Anomaly DetectionWikipedia S2
AUROC83.39
42
Node classificationWikipedia
AUC88.32
40
Response correctness and completeness evaluationWikipedia
F1 Score68
38
Membership Inference AttackWikipedia Pythia
ROC AUC74
36
Membership InferenceWikipedia Pythia (train)
TPR@1%FPR22.7
36
Reliability of post-edit LLMsWikipedia
BLEU100
36
Temporal Link PredictionWikipedia (Transductive)
AP (%)99.79
33
Link PredictionWikipedia transductive
AP99.57
32
Temporal Link PredictionWikipedia (inductive)
AUC-ROC99.15
30
Dynamic link predictionWikipedia
AP99.03
27
Membership Inference AttackWikipedia en
AUC0.79
26
Document ClassificationWikipedia (test)
Classification Error30.24
24
Dynamic Node ClassificationWikipedia (test)
AUC-ROC88.37
22
Dynamic Link PredictionWikipedia
AUC-ROC0.8768
22
Link PredictionWikipedia
AP99.37
20
Fact MemorizationWikipedia corpus annotated (train)
Fact Accuracy929.25
20
Language ModelingWikipedia 20k sentences
Perplexity (Wikipedia 20k)9.06
20
Unconditional Text GenerationWikipedia
Mauve Score90.1
18
Multi-view ClusteringWikipedia
Accuracy (ACC)62.18
16
Showing 25 of 163 rows