CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning

About

Recent advancements in Contrastive Language-Image Pre-training (CLIP) have demonstrated notable success in self-supervised representation learning across various tasks. However, the existing CLIP-like approaches often demand extensive GPU resources and prolonged training times due to the considerable size of the model and dataset, making them poor for medical applications, in which large datasets are not always common. Meanwhile, the language model prompts are mainly manually derived from labels tied to images, potentially overlooking the richness of information within training samples. We introduce a novel language-image Contrastive Learning method with an Efficient large language model and prompt Fine-Tuning (CLEFT) that harnesses the strengths of the extensive pre-trained language and visual models. Furthermore, we present an efficient strategy for learning context-based prompts that mitigates the gap between informative clinical diagnostic data and simple class labels. Our method demonstrates state-of-the-art performance on multiple chest X-ray and mammography datasets compared with various baselines. The proposed parameter efficient framework can reduce the total trainable model size by 39% and reduce the trainable language model to only 4% compared with the current BERT encoder.

Yuexi Du, Brian Chang, Nicha C. Dvornek• 2024

Related benchmarks

Task	Dataset	Result
Medical Image Classification	COVID	Accuracy84.18	91
Classification	RSNA Pneumonia	Accuracy78.25	67
Classification	RSNA	Accuracy80.36	38
Classification	CheXpert 5x200 1.0	Accuracy66.74	27
Image Classification	NIH ChestX-ray	Accuracy78.95	21
Image-Text Retrieval	MIMIC 5x200	Precision@152.75	15
Phrase grounding	MS-CXR	Atelectasis Accuracy0.6871	15
Classification	MIMIC-5 × 200	Accuracy75.47	15
BI-RADS Classification	EMBED (test)	Accuracy33.29	10
Density Classification	EMBED (test)	Accuracy74.95	10

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord