Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Prototype-Based Low Altitude UAV Semantic Segmentation

About

Semantic segmentation of low-altitude UAV imagery presents unique challenges due to extreme scale variations, complex object boundaries, and limited computational resources on edge devices. Existing transformer-based segmentation methods achieve remarkable performance but incur high computational overhead, while lightweight approaches struggle to capture fine-grained details in high-resolution aerial scenes. To address these limitations, we propose PBSeg, an efficient prototype-based segmentation framework tailored for UAV applications. PBSeg introduces a novel prototype-based cross-attention (PBCA) that exploits feature redundancy to reduce computational complexity while maintaining segmentation quality. The framework incorporates an efficient multi-scale feature extraction module that combines deformable convolutions (DConv) with context-aware modulation (CAM) to capture both local details and global semantics. Experiments on two challenging UAV datasets demonstrate the effectiveness of the proposed approach. PBSeg achieves 71.86\% mIoU on UAVid and 80.92\% mIoU on UDD6, establishing competitive performance while maintaining computational efficiency. Code is available at https://github.com/zhangda1018/PBSeg.

Da Zhang, Gao Junyu, Zhao Zhiyuan• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationUAVid (test)
mIoU71.86
47
Semantic segmentationUDD6 (test)
Facade Score0.7667
11
Semantic segmentationUDD6 (val)
Façade IoU76.67
11
Semantic segmentationUAVid (val)
Building IoU92.13
10
Showing 4 of 4 rows

Other info

Follow for update