Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Enhancing Single-Image Facial Demorphing using Multimodal Large Language Models

About

Face recognition systems are increasingly vulnerable to morphing attacks, where a composite image is crafted to match multiple identities, enabling unauthorized access and identity fraud. Existing detection methods identify morphed images but cannot recover constituent images or identities, limiting their forensic utility. This paper presents a novel reference-free facial demorphing framework that leverages Multimodal Large Language Models (MLLMs) to guide a coupled diffusion-based reconstruction process. Our key innovation lies in extracting semantic embeddings from intermediate MLLM layers to condition the demorphing, providing high-level reasoning about facial attributes and identity cues that complement low-level pixel information. We formulate demorphing as a coupled conditional generation problem, where both constituent faces are synthesized jointly through a denoising diffusion model operating directly in the RGB domain, ensuring inter-identity consistency while preserving fine-grained perceptual details. Unlike prior approaches that rely on compressed latent representations or assume identity overlap between training and testing sets, our method bypasses lossy text generation-reencoding cycles by directly utilizing MLLM hidden states as conditioning signals, enabling the denoising network to attend to subtle visual cues such as hair, background, and facial textures. Ablation studies further reveal that middle MLLM layers encode more identity-discriminative representations, RGB-domain demorphing outperforms latent-space approaches by 30--40\% at strict operating points, and full MLLM embeddings provide substantial advantages over raw ViT features through enhanced semantic structuring from multimodal pretraining.

Nitish Shukla, Arun Ross• 2026

Related benchmarks

TaskDatasetResultRank
Image DemorphingAMSL
Restoration Accuracy99.98
26
Image Demorphingopencv
Restoration Accuracy100
26
Image DemorphingFMorph
Restoration Accuracy100
26
Image DemorphingStyleGAN
Restoration Accuracy67.23
26
Image DemorphingWmorph
Restoration Accuracy99.91
26
Image DemorphingMorDIFF
Restoration Accuracy99.79
26
Face DemorphingAMSL
PSNR18.36
6
Face Demorphingopencv
PSNR18.34
6
Face DemorphingFMorph
PSNR18.26
6
Face DemorphingWmorph
PSNR17.99
6
Showing 10 of 13 rows

Other info

Follow for update