SOLVAR: Fast covariance-based heterogeneity analysis with pose refinement for cryo-EM
About
Cryo-electron microscopy (cryo-EM) has emerged as a powerful technique for resolving the three-dimensional structures of macromolecules. A key challenge in cryo-EM is characterizing continuous heterogeneity, where molecules adopt a continuum of conformational states. Covariance-based methods offer a principled approach to modeling structural variability. However, estimating the covariance matrix efficiently remains a challenging computational task. In this paper, we present SOLVAR (Stochastic Optimization for Low-rank Variability Analysis), which leverages a low-rank assumption on the covariance matrix to provide a tractable estimator for its principal components, despite the apparently prohibitive large size of the covariance matrix. Under this low-rank assumption, our estimator can be formulated as an optimization problem that can be solved quickly and accurately. Moreover, our framework enables refinement of the poses of the input particle images, a capability absent from most heterogeneity-analysis methods, and all covariance-based methods. Numerical experiments on both synthetic and experimental datasets demonstrate that the algorithm accurately captures dominant components of variability while maintaining computational efficiency. SOLVAR achieves state-of-the-art performance across multiple datasets in a recent heterogeneity benchmark. The code of the algorithm is freely available at https://github.com/RoeyYadgar/SOLVAR.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Continuous heterogeneity analysis | IgG-1D Cryobench (test) | Median0.389 | 8 | |
| Continuous heterogeneity analysis | IgG-RL Cryobench (test) | Median0.368 | 8 | |
| Continuous heterogeneity analysis | Spike-MD Cryobench (test) | Median0.355 | 8 | |
| Continuous heterogeneity analysis | Ribosembly Cryobench (test) | Median Score0.432 | 7 | |
| Continuous heterogeneity analysis | Tomotwin-100 Cryobench (test) | Median Score0.238 | 7 | |
| Ab initio reconstruction | Cryobench IgG-1D | Median AUC FSC0.387 | 5 | |
| Ab initio reconstruction | Cryobench Ribosembly | Median FSC AUC42.4 | 5 | |
| Ab initio reconstruction | Cryobench Tomotwin-100 | Median FSC AUC0.084 | 5 | |
| Ab initio reconstruction | Cryobench Spike-MD | Median AUC (FSC)0.327 | 5 | |
| Ab initio reconstruction | Cryobench IgG-RL | Median AUC (FSC)0.332 | 5 |