Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Wiki

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWiki
Perplexity (PPL)2
251
eXtreme Multi-label ClassificationWiki 500K
P@181.26
30
Temporal Knowledge Graph ReasoningWIKI
MRR0.4603
28
Entity LinkingWIKI (test)
Micro F184.5
27
Entity DisambiguationWIKI (test)
Micro F189.2
24
Node ClassificationWiki
Micro F10.5907
23
Relation ClassificationWiki ZSL (test)
Precision (%)71.54
22
Node Classificationwiki (test)
Accuracy65.13
22
Probabilistic Forecastingwiki
CRPS0.214
21
Macroscopic time series forecastingWiki
SMAPE0.0362
20
Temporal Knowledge Graph ReasoningWIKI (meta-test)
MRR33.5
19
Time series forecastingwiki (test)
CRPS0.214
19
Definition ModelingWiki
BLEU62.07
18
Temporal Point Process modelingWiki real-world (test)
Negative Log-Likelihood-1.3727
18
Temporal Reasoning PredictionWIKI (test)
Positive Performance99.28
17
Semantic SimilarityWIKI (test)
BLEU-455.52
17
TKG reasoningWIKI (test)
MRR30.9
17
Relation ExtractionWiki ZSL (test)
Micro-F153.71
16
Extractive Question AnsweringWiki (test)
EM78.6
16
ClusteringWiki
F1 Score51
16
Graph ClusteringWiki
ARI35.8
15
Text SegmentationWiki-50
Pk16.5
15
Zero-shot Triplet ExtractionWiki ZSL (test)
Accuracy21.49
15
Multi-view ClusteringWIKI
Accuracy46.16
14
Multi-view ClusteringWIKI 100% aligned
ACC60.33
14
Showing 25 of 89 rows