Dimensionality Reduction Considered Harmful (Some of the Time)
About
Visual analytics now plays a central role in decision-making across diverse disciplines, but it can be unreliable: the knowledge or insights derived from the analysis may not accurately reflect the underlying data. In this dissertation, we improve the reliability of visual analytics with a focus on dimensionality reduction (DR). DR techniques enable visual analysis of high-dimensional data by reducing it to two or three dimensions, but they inherently introduce errors that can compromise the reliability of visual analytics. To this end, I investigate reliability challenges that practitioners face when using DR for visual analytics. Then, I propose technical solutions to address these challenges, including new evaluation metrics, optimization strategies, and interaction techniques. We conclude the thesis by discussing how our contributions lay the foundation for achieving more reliable visual analytics practices.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Structural Complexity Ground Truth Prediction | 96 datasets | Local MRREs0.0142 | 25 | |
| Rank Correlation Analysis | Across-dataset aggregate 96 datasets (test) | AMI0.8955 | 20 | |
| Predicting maximum achievable accuracy of Dimensionality Reduction (DR) techniques | 96 datasets | T&C Score0.9471 | 18 |