Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BBC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Classificationbbc
Accuracy99.72
61
Next Token PredictionBBC
Next Token Accuracy40.05
32
Reference-grounded News VerificationBBC
Classification Accuracy (ACCcls)97.09
20
Topic ModelingBBC
NPMI0.38
17
Cluster count selectionBbc
Selected Cluster Count9
16
Multi-view ClassificationBBC
Accuracy97.23
16
Topic ModelingBBC
IRBO1
13
Document ClusteringBBC (test)
NMI0.729
13
Multi-view ClusteringBBC
NMI70.55
12
Dataset Additionbbc-embedding
DA Score81
12
Dataset Removalbbc embedding
DR0.9
12
Noisy label detectionbbc-embedding
NLD Score0.11
12
Word Intrusion DetectionBBC
Accuracy63.15
10
Document ClassificationBBC
Accuracy100
10
Synthetic Text GenerationBBC
Mean Embedding Similarity0.43
10
Node ClassificationBBC rich-text graph (test)
Accuracy98.4
10
Node ClassificationBBC (test)
NMI0.949
10
ClusteringBbc
SIL0.188
5
ClusteringBbc
AMI83.9
5
Next Token PredictionBBC
Accuracy (ϵ=∞)25.75
5
Dataset AdditionBBC
Accuracy (5% Threshold)68
5
Dataset RemovalBBC
Accuracy (5%)90
5
Noisy Label DetectionBBC
F1 (5%)19
5
Scene DetectionBBC
AP0.4364
5
Shot Boundary DetectionBBC
F1 Score97.1
5
Showing 25 of 30 rows