SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
About
Despite the remarkable progress in synthetic aperture radar automatic target recognition (SAR ATR), recent efforts have concentrated on detecting and classifying a specific category, e.g., vehicles, ships, airplanes, or buildings. One of the fundamental limitations of the top-performing SAR ATR methods is that the learning paradigm is supervised, task-specific, limited-category, closed-world learning, which depends on massive amounts of accurately annotated samples that are expensively labeled by expert SAR analysts and have limited generalization capability and scalability. In this work, we make the first attempt towards building a foundation model for SAR ATR, termed SARATR-X. SARATR-X learns generalizable representations via self-supervised learning (SSL) and provides a cornerstone for label-efficient model adaptation to generic SAR target detection and classification tasks. Specifically, SARATR-X is trained on 0.18 M unlabelled SAR target samples, which are curated by combining contemporary benchmarks and constitute the largest publicly available dataset till now. Considering the characteristics of SAR images, a backbone tailored for SAR ATR is carefully designed, and a two-step SSL method endowed with multi-scale gradient features was applied to ensure the feature diversity and model scalability of SARATR-X. The capabilities of SARATR-X are evaluated on classification under few-shot and robustness settings and detection across various categories and scenes, and impressive performance is achieved, often competitive with or even superior to prior fully supervised, semi-supervised, or self-supervised algorithms. Our SARATR-X and the curated dataset are released at https://github.com/waterdisappear/SARATR-X to foster research into foundation models for SAR image interpretation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| SAR Object Detection | SSDD | mAP5097.3 | 27 | |
| SAR Classification | MSTAR SOCs (Standard operating conditions) 10-way | Accuracy95.9 | 25 | |
| Change Detection | SkyCap SAR Capella X-band 1.0 (test) | F1 Score40.03 | 18 | |
| SAR Classification | MSTAR EOCs (Depression angle variations) 4-way | Accuracy98.9 | 12 | |
| SAR Classification | MSTAR EOCs (Target configuration variations) 4-way | Accuracy78.3 | 12 | |
| SAR Classification | MSTAR EOCs (Target version variations) 4-way | Accuracy82.8 | 12 | |
| Horizontal Object Detection | SARDet-100K (test) | mAP0.573 | 8 | |
| Horizontal Object Detection | SSDD (test) | mAP67.5 | 7 | |
| Object Detection | SARDet-100K | mAP57.3 | 6 | |
| Aircraft detection | SAR-Aircraft | mAP5086.1 | 5 |