Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CatBoost: gradient boosting with categorical features support

About

In this paper we present CatBoost, a new open-sourced gradient boosting library that successfully handles categorical features and outperforms existing publicly available implementations of gradient boosting in terms of quality on a set of popular publicly available datasets. The library has a GPU implementation of learning algorithm and a CPU implementation of scoring algorithm, which are significantly faster than other gradient boosting libraries on ensembles of similar sizes.

Anna Veronika Dorogush, Vasily Ershov, Andrey Gulin• 2018

Related benchmarks

TaskDatasetResultRank
ClassificationPetfinder (test)
Accuracy38.69
16
RegressionCrossed Barrel Dataset (80% uniform sampling)
R^20.76
10
RegressionCogni-e-Spin Dataset (80% uniform sampling)
R^20.55
10
Surrogate ModelingCrossed Barrel Dataset biased sampling
R^20.55
10
Surrogate ModelingCogni-e-Spin Dataset biased sampling
R^20.36
10
RegressionLattice Dataset (80% uniform sampling)
R^20.95
10
Surrogate ModelingLattice Dataset biased sampling
R^20.67
10
ClassificationAirbnb (test)
Accuracy43.56
8
ClassificationPAD-UFES-20 (PU20) (test)
Accuracy80.43
8
ClassificationCBIS-DDSM Calc (test)
Accuracy72.09
8
Showing 10 of 15 rows

Other info

Follow for update