Learned Cardinalities: Estimating Correlated Joins with Deep Learning
About
We describe a new deep learning approach to cardinality estimation. MSCN is a multi-set convolutional network, tailored to representing relational query plans, that employs set semantics to capture query features and true cardinalities. MSCN builds on sampling-based estimation, addressing its weaknesses when no sampled tuples qualify a predicate, and in capturing join-crossing correlations. Our evaluation of MSCN using a real-world dataset shows that deep learning significantly enhances the quality of cardinality estimation, which is the core problem in query optimization.
Andreas Kipf, Thomas Kipf, Bernhard Radke, Viktor Leis, Peter Boncz, Alfons Kemper• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cardinality Estimation | CEB-1a-varied Out-of-Distribution Query Center Move | RMSE0.045 | 6 | |
| Cardinality Estimation | CEB-1a-varied In-Distribution Query Granularity Shift | RMSE0.014 | 6 | |
| Cardinality Estimation | CEB 1a-varied In-Distribution Query Center Move | RMSE0.012 | 6 | |
| Cardinality Estimation | CEB 1a-varied Out-of-Distribution Query Granularity Shift | RMSE0.117 | 6 | |
| Cardinality Estimation | IMDb small In-Distribution Query Center Move | RMSE0.02 | 4 | |
| Cardinality Estimation | IMDb small In-Distribution Query Granularity Shift | RMSE0.021 | 4 | |
| Cardinality Estimation | DSB In-Distribution, Query Granularity Shift | RMSE0.027 | 4 | |
| Cardinality Estimation | DSB In-Distribution, Query Center Move | RMSE0.057 | 4 | |
| Cardinality Estimation | DSB Out-of-Distribution, Query Granularity Shift | RMSE0.345 | 4 | |
| Cardinality Estimation | IMDb small Out-of-Distribution Query Center Move | RMSE0.7 | 4 |
Showing 10 of 12 rows