A Multimodal 3D Foundation Model for Light Sheet Fluorescence Microscopy Enables Few-Shot Segmentation, Classification, and Deblurring
About
Light sheet fluorescence microscopy (LSM) enables high-resolution, three-dimensional (3D) imaging of biological specimens, providing rich volumetric data for studying cellular organization, pathology, and vascular networks. However, the size, dimensionality, and annotation burden of LSM data make supervised deep learning approaches costly and difficult to scale. Additionally, despite the abundance of unannotated LSM volumes, foundation models for this modality remain underexplored due to computational challenges and the complexity of volumetric representation learning. In this work, we introduce a 3D foundation model for LSM data, pretrained on a large curated collection of 3D images spanning multiple organisms, stains, and imaging protocols. We learn transferable volumetric representations by jointly optimizing for masked reconstruction and image-text alignment. The pretrained backbone drastically reduces the annotation burden, enabling efficient, few-shot adaptation for varied downstream tasks. We evaluate this approach on downstream segmentation, classification, and deblurring. Our results demonstrate consistent improvements over baselines, (1) when measured using standard evaluation metrics and (2) when rigorously assessed by domain experts. This highlights the potential of foundation model pretraining to reduce annotation requirements while improving performance across diverse LSM analysis tasks. Pretrained model weights and code for pretraining and finetuning are publicly available: https://github.com/AdinaScheinfeld/lsm_fm_public_repo.git.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | LSM held-out patches (test) | Accuracy (ACC)74 | 16 | |
| Deblurring | LSM Amyloid Plaque held-out patches (test) | SSIM69 | 12 | |
| Deblurring | LSM Cell Nucleus patches (test) | SSIM89 | 12 | |
| Deblurring | LSM Vessels held-out patches (test) | SSIM92 | 12 | |
| Segmentation | Cell Nucleus Few-shot | Total Dice80 | 12 | |
| Segmentation | Cell Nucleus Many-shot | Total Dice82 | 12 | |
| Segmentation | Amyloid Plaque Few-shot | Total Dice68 | 9 | |
| Segmentation | Amyloid Plaque Many-shot | Total Dice80 | 9 | |
| Segmentation | Vessels Few-shot | Total Dice89 | 9 | |
| Segmentation | Vessels Many-shot | Total Dice92 | 9 |