Hibou: A Family of Foundational Vision Transformers for Pathology
About
Pathology, the microscopic examination of diseased tissue, is critical for diagnosing various medical conditions, particularly cancers. Traditional methods are labor-intensive and prone to human error. Digital pathology, which converts glass slides into high-resolution digital images for analysis by computer algorithms, revolutionizes the field by enhancing diagnostic accuracy, consistency, and efficiency through automated image analysis and large-scale data processing. Foundational transformer pretraining is crucial for developing robust, generalizable models as it enables learning from vast amounts of unannotated data. This paper introduces the Hibou family of foundational vision transformers for pathology, leveraging the DINOv2 framework to pretrain two model variants, Hibou-B and Hibou-L, on a proprietary dataset of over 1 million whole slide images (WSIs) representing diverse tissue types and staining techniques. Our pretrained models demonstrate superior performance on both patch-level and slide-level benchmarks, surpassing existing state-of-the-art methods. Notably, Hibou-L achieves the highest average accuracy across multiple benchmark datasets. To support further research and application in the field, we have open-sourced the Hibou models, which can be accessed at https://github.com/HistAI/hibou.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Slide-level classification | Camelyon16 | -- | 52 | |
| Cancer Subtyping | cohort of lung cancer H1 (internal) | Mean AUC0.9197 | 46 | |
| Cancer Grading | Gastric Cancer H4 (external cohort) | Mean AUC0.8483 | 23 | |
| HER2 Status Prediction | Gastric Cancer Internal Cohort H1+H3+H4 (test) | Mean AUC0.5671 | 23 | |
| Intestinal Metaplasia Classification | gastric cancer H7 (internal cohort) | Mean AUC0.9569 | 23 | |
| Vascular Invasion Detection | Gastric Cancer Cohort H3 (external) | Mean AUC72.79 | 23 | |
| Perineural Invasion Detection | Gastric Cancer Internal Cohort H1 (evaluation) | Mean AUC0.9237 | 23 | |
| S-100 Status Prediction | Gastric Cancer Internal Cohort H1+H3+H4 (test) | Mean AUC0.8172 | 23 | |
| Cancer Grading | Gastric Cancer H3 external cohort | Mean AUC0.8312 | 23 | |
| Estrogen Receptor (ER) status prediction | Breast cancer H2 (internal cohort) | Mean AUC0.8919 | 23 |