Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

StackOverflow

Benchmarks

Task NameDataset NameSOTA ResultTrend
Event PredictionStackOverFlow
ACC49.6
58
Open intent recognitionStackOverflow
Accuracy88.98
54
Short Text ClusteringStackOverflow
Accuracy83.3
38
Generalized Category DiscoveryStackOverflow (test)
Accuracy89.4
28
New Intent DiscoveryStackOverflow
NMI80.73
27
Event PredictionSTACKOVERFLOW (test)
OTD19.938
22
Next Word PredictionStackoverflow (test)
Test Accuracy99
22
Marked Temporal Point ProcessStackOverflow (test)
RMSE0.948
20
RerankingStackOverflow (test)
MAP40.92
16
Multi-horizon forecastingStackOverflow
Inter-event Time RMSE0.825
15
ClusteringStackOverflow (test)
ARI52.59
14
ClusteringStackOverflow
NMI78.8
13
Question RetrievalStackOverflow-Tag
Recall@10.498
12
Document RetrievalStackOverflow (test)
Precision@559.2
11
Topic ModelingStackOverflow
Cv0.397
11
Unknown Intent DetectionStackOverflow 50% seen classes (test)
Accuracy86.4
11
ClassificationStackoverflow (test)
Accp > Accg Percentage92.74
8
Event Time PredictionStackOverflow
RMSE1.12
7
Unknown Intent DetectionStackOverflow 75% seen classes (test)
Accuracy81.71
6
Unknown Intent DetectionStackOverflow 25% seen classes (test)
Accuracy68.74
6
Open Intent ClassificationStackOverflow 75% known classes (test)
Accuracy82.78
5
Open Intent ClassificationStackOverflow 25% known classes (test)
Accuracy86.72
5
Data TransformationStackOverflow
Accuracy65.3
3
SQL-to-text generationStackoverflow
BLEU-423.3
3
Showing 24 of 24 rows