Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BBC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Next Token PredictionBBC
Next Token Accuracy40.05
32
Classificationbbc
Accuracy99.72
20
Topic ModelingBBC
NPMI0.38
17
Cluster count selectionBbc
Selected Cluster Count9
16
Multi-view ClassificationBBC
Accuracy97.23
16
Topic ModelingBBC
IRBO1
13
Document ClusteringBBC (test)
NMI0.729
13
Dataset Additionbbc-embedding
DA Score81
12
Dataset Removalbbc embedding
DR0.9
12
Noisy label detectionbbc-embedding
NLD Score0.11
12
Word Intrusion DetectionBBC
Accuracy63.15
10
Document ClassificationBBC
Accuracy100
10
Synthetic Text GenerationBBC
Mean Embedding Similarity0.43
10
Node ClassificationBBC rich-text graph (test)
Accuracy98.4
10
Node ClassificationBBC (test)
NMI0.949
10
ClusteringBbc
SIL0.188
5
ClusteringBbc
AMI83.9
5
Next Token PredictionBBC
Accuracy (ϵ=∞)25.75
5
Dataset AdditionBBC
Accuracy (5% Threshold)68
5
Dataset RemovalBBC
Accuracy (5%)90
5
Noisy Label DetectionBBC
F1 (5%)19
5
Scene DetectionBBC
AP0.4364
5
Shot Boundary DetectionBBC
F1 Score97.1
5
Density EstimationBBC Twenty Datasets (test)
Log-Likelihood-252.14
4
Shot Transition DetectionBBC Planet Earth documentary series
F1 Score0.962
4
Showing 25 of 27 rows