DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
About
Foundation models in computer vision have demonstrated exceptional performance in zero-shot and few-shot tasks by extracting multi-purpose features from large-scale datasets through self-supervised pre-training methods. However, these models often overlook the severe corruption in cryogenic electron microscopy (cryo-EM) images by high-level noises. We introduce DRACO, a Denoising-Reconstruction Autoencoder for CryO-EM, inspired by the Noise2Noise (N2N) approach. By processing cryo-EM movies into odd and even images and treating them as independent noisy observations, we apply a denoising-reconstruction hybrid training scheme. We mask both images to create denoising and reconstruction tasks. For DRACO's pre-training, the quality of the dataset is essential, we hence build a high-quality, diverse dataset from an uncurated public database, including over 270,000 movies or micrographs. After pre-training, DRACO naturally serves as a generalizable cryo-EM image denoiser and a foundation model for various cryo-EM downstream tasks. DRACO demonstrates the best performance in denoising, micrograph curation, and particle picking tasks compared to state-of-the-art baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Particle picking | EMPIAR-10289 | F1 Score36.7 | 12 | |
| Particle picking | EMPIAR-10291 | F1 Score62 | 12 | |
| Particle picking | EMPIAR-10081 | F1 Score41.7 | 12 | |
| Particle picking | Human HCN1 (test) | Precision83 | 7 | |
| Particle picking | 70S ribosome (test) | Precision80.3 | 7 | |
| Particle picking | LetB (test) | Precision67.8 | 7 | |
| 3D Reconstruction | EMPIAR-10289 | Resolution (Å)5.907 | 7 | |
| 3D Reconstruction | EMPIAR-10081 | Resolution (Å)6.389 | 7 | |
| 3D Reconstruction | EMPIAR-10291 | Resolution (Å)6.079 | 7 | |
| Micrograph Denoising | Human Apoferritin (EMPIAR-10421) (test) | SNR2.01 | 6 |