SleepMaMi: A Universal Sleep Foundation Model for Integrating Macro- and Micro-structures
About
While the shift toward unified foundation models has revolutionized many deep learning domains, sleep medicine remains largely restricted to task-specific models that focus on localized micro-structure features. These approaches often neglect the rich, multi-modal context of Polysomnography (PSG) and fail to capture the global macro-structure of a full night's sleep. To address this, we introduce SleepMaMi , a Sleep Foundation Model engineered to master both hour-long sleep architectures and fine-grained signal morphologies. Our framework utilizes a hierarchical dual-encoder design: a Macro-Encoder to model full-night temporal dependencies and a Micro-Encoder to capture short-term characteristics from biosignals. Macro-Encoder is trained via Demographic-Guided Contrastive Learning, which aligns overnight sleep patterns with objective subject metadata, such as age, sex and BMI to refine global representations. Micro-Encoder is optimized via a hybrid Masked Autoencoder (MAE) and multi-modal contrastive objective. Pre-trained on a massive corpus of $>$20,000 PSG recordings (158K hours),SleepMaMi outperforms existing foundation models across a diverse suite of downstream tasks, demonstrating superior generalizability and label-efficient adaptation for clinical sleep analysis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| SDB segmentation | KISS (test) | Accuracy81.8 | 4 | |
| Sleep Stage Classification | SHHS1 (test) | Accuracy81.9 | 4 | |
| Sleep Stage Classification | KISS (test) | Accuracy62.8 | 4 | |
| SDB segmentation | SHHS1 (test) | Accuracy77.3 | 4 | |
| Disease prediction | SHHS 1 (test) | Angina77.8 | 2 |