Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery

About

Rapid and accurate structural damage assessment following natural disasters is critical for effective emergency response and recovery. However, remote sensing imagery often suffers from low spatial resolution, contextual ambiguity, and limited semantic interpretability, reducing the reliability of traditional detection pipelines. In this work, we propose a novel hybrid framework that integrates AI-based super-resolution, deep learning object detection, and Vision-Language Models (VLMs) for comprehensive post-disaster building damage assessment. First, we enhance pre- and post-disaster satellite imagery using a Video Restoration Transformer (VRT) to upscale images from 1024x1024 to 4096x4096 resolution, improving structural detail visibility. Next, a YOLOv11-based detector localizes buildings in pre-disaster imagery, and cropped building regions are analyzed using VLMs to semantically assess structural damage across four severity levels. To ensure robust evaluation in the absence of ground-truth captions, we employ CLIPScore for reference-free semantic alignment and introduce a multi-model VLM-as-a-Jury strategy to reduce individual model bias in safety-critical decision making. Experiments on subsets of the xBD dataset, including the Moore Tornado and Hurricane Matthew events, demonstrate that the proposed framework enhances the semantic interpretation of damaged buildings. In addition, our framework provides helpful recommendations to first responders for recovery based on damage analysis.

Bijay Shakya, Catherine Hoier, Khandaker Mamun Ahmed• 2026

Related benchmarks

TaskDatasetResultRank
Building Damage AssessmentxBD Moore Tornado (cropped building images, upscaled)
Average CLIPScore59.66
4
Building Damage AssessmentxBD Hurricane Matthew (cropped building images, upscaled)
Average CLIPScore58.11
4
Building damage classificationMoore Tornado subset
Accuracy87.1
4
Damage AssessmentxBD Moore Tornado subset cropped buildings
VLM-As-A-Jury88.6
4
Damage AssessmentxBD Matthew Hurricane cropped buildings
VLM-As-A-Jury87.64
4
Damage AssessmentxBD Moore Tornado upscaled full images v1.0
Average CLIPScore63.34
4
Damage AssessmentxBD Matthew Hurricane upscaled full images v1.0
Average CLIPScore62.42
4
Damage AssessmentxBD Moore Tornado upscaled full image
VLM-As-A-Jury93.93
4
Damage AssessmentxBD Hurricane Matthew upscaled full image
VLM-As-A-Jury90.22
4
Damage AssessmentxBD general upscaled full images v1.0--
2
Showing 10 of 10 rows

Other info

Follow for update