Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

StackExchange

Benchmarks

Task NameDataset NameSOTA ResultTrend
Upvote PredictionStackexchange
ROC-AUC85.02
45
Churn PredictionStackexchange
ROC-AUC84.22
45
ClusteringMTEB StackExchange P2P
V1 Score37.27
17
ClusteringMTEB StackExchange S2S
V1 Score61.49
17
Question AnsweringStackExchange (test)
Accuracy65.6
12
user-churnStackExchange 4DBInfer (test)
AUC0.8796
9
post-upvoteStackExchange 4DBInfer (test)
AUC0.8896
9
Question AnsweringStackExchange Q&A (test)
Accuracy (Bio.)82.2
8
Present keyphrase generationStackExchange
F1@327.2
8
Topic ClassificationStackExchange (test)
Acc67.56
6
Language ModelingStackExchange (val)
Perplexity4.43
3
Absent keyphrase generationStackExchange
Recall@54.6
3
Reward ModelingStackExchange (val)
REWARD0
1
Showing 13 of 13 rows