Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles
About
We present ARCH, a computational pathology (CP) multiple instance captioning dataset to facilitate dense supervision of CP tasks. Existing CP datasets focus on narrow tasks; ARCH on the other hand contains dense diagnostic and morphological descriptions for a range of stains, tissue types and pathologies. Using intrinsic dimensionality estimation, we show that ARCH is the only CP dataset to (ARCH-)rival its computer vision analog MS-COCO Captions. We conjecture that an encoder pre-trained on dense image captions learns transferable representations for most CP tasks. We support the conjecture with evidence that ARCH representation transfers to a variety of pathology sub-tasks better than ImageNet features or representations obtained via self-supervised or multi-task learning on pathology images alone. We release our best model and invite other researchers to test it on their CP tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Colon Tissue Classification | Kather (test) | Accuracy83.9 | 14 | |
| Gleason Scoring Classification | Arvaniti (test) | Accuracy79.4 | 7 | |
| Head & Neck Tissue Classification | Shaban (test) | Accuracy75.2 | 7 | |
| Lung Tissue Classification | Alsubaie (test) | Accuracy87 | 7 | |
| Lymphoma Tissue Classification | Janowczyk (test) | Accuracy47.3 | 7 | |
| Meningioma Tissue Classification | Qureshi (test) | Accuracy84.8 | 7 | |
| Nuclear Atypia Classification | Roux (test) | Accuracy80.2 | 7 | |
| Nuclear Mitosis Classification | Veta (test) | Accuracy70.4 | 7 | |
| Ovary Tissue Classification | Kobel (test) | Accuracy77 | 7 | |
| Breast Tissue Classification | Litjens (test) | Accuracy0.905 | 7 |