Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

About

We present ARCH, a computational pathology (CP) multiple instance captioning dataset to facilitate dense supervision of CP tasks. Existing CP datasets focus on narrow tasks; ARCH on the other hand contains dense diagnostic and morphological descriptions for a range of stains, tissue types and pathologies. Using intrinsic dimensionality estimation, we show that ARCH is the only CP dataset to (ARCH-)rival its computer vision analog MS-COCO Captions. We conjecture that an encoder pre-trained on dense image captions learns transferable representations for most CP tasks. We support the conjecture with evidence that ARCH representation transfers to a variety of pathology sub-tasks better than ImageNet features or representations obtained via self-supervised or multi-task learning on pathology images alone. We release our best model and invite other researchers to test it on their CP tasks.

Jevgenij Gamper, Nasir Rajpoot• 2021

Related benchmarks

Task	Dataset	Result
Colon Tissue Classification	Kather (test)	Accuracy83.9	14
Gleason Scoring Classification	Arvaniti (test)	Accuracy79.4	7
Head & Neck Tissue Classification	Shaban (test)	Accuracy75.2	7
Lung Tissue Classification	Alsubaie (test)	Accuracy87	7
Lymphoma Tissue Classification	Janowczyk (test)	Accuracy47.3	7
Meningioma Tissue Classification	Qureshi (test)	Accuracy84.8	7
Nuclear Atypia Classification	Roux (test)	Accuracy80.2	7
Nuclear Mitosis Classification	Veta (test)	Accuracy70.4	7
Ovary Tissue Classification	Kobel (test)	Accuracy77	7
Breast Tissue Classification	Litjens (test)	Accuracy0.905	7

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord