Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions

About

Diffusion-based generative models have shown promise in synthesizing histopathology images to address data scarcity caused by privacy constraints. Diagnostic text reports provide high-level semantic descriptions, and masks offer fine-grained spatial structures essential for representing distinct morphological regions. However, public datasets lack paired text and mask data for the same histopathological images, limiting their joint use in image generation. This constraint restricts the ability to fully exploit the benefits of combining both modalities for enhanced control over semantics and spatial details. To overcome this, we propose PathDiff, a diffusion framework that effectively learns from unpaired mask-text data by integrating both modalities into a unified conditioning space. PathDiff allows precise control over structural and contextual features, generating high-quality, semantically accurate images. PathDiff also improves image fidelity, text-image alignment, and faithfulness, enhancing data augmentation for downstream tasks like nuclei segmentation and classification. Extensive experiments demonstrate its superiority over existing methods.

Mahesh Bhosale, Abdul Wasi, Yuanhao Zhai, Yunjie Tian, Samuel Border, Nan Xi, Pinaki Sarder, Junsong Yuan, David Doermann, Xuan Gong• 2025

Related benchmarks

TaskDatasetResultRank
Mask-to-Image FaithfulnessBLCA TCGA (test)
Faithfulness Score78.56
10
Mask-to-Image FaithfulnessBRCA TCGA (test)
Faithfulness Score79.12
10
Mask-to-Image FaithfulnessGBMLGG TCGA (test)
Faithfulness Score77.34
10
Mask-to-Image FaithfulnessLUAD TCGA (test)
Faithfulness Score80.15
10
Image-text similarityBLCA 20x magnification TCGA (test)
PLIP Cosine Similarity22.48
3
Image-text similarityBRCA 20x magnification TCGA (test)
PLIP Cosine Similarity21.35
3
Image-text similarityGBMLGG 20x magnification TCGA (test)
PLIP Cosine Similarity21.92
3
Image-text similarityLUAD 20x magnification TCGA (test)
PLIP Cosine Similarity22.15
3
Image-text similarityBLCA 5x magnification TCGA (test)
PLIP Cosine Similarity18.34
3
Image-text similarityBRCA 5x magnification TCGA (test)
PLIP Cosine Similarity19.45
3
Showing 10 of 12 rows

Other info

Follow for update