SATS: Self-Attention Transfer for Continual Semantic Segmentation
About
Continually learning to segment more and more types of image regions is a desired capability for many intelligent systems. However, such continual semantic segmentation suffers from the same catastrophic forgetting issue as in continual classification learning. While multiple knowledge distillation strategies originally for continual classification have been well adapted to continual semantic segmentation, they only consider transferring old knowledge based on the outputs from one or more layers of deep fully convolutional networks. Different from existing solutions, this study proposes to transfer a new type of information relevant to knowledge, i.e. the relationships between elements (Eg. pixels or small local regions) within each image which can capture both within-class and between-class knowledge. The relationship information can be effectively obtained from the self-attention maps in a Transformer-style segmentation model. Considering that pixels belonging to the same class in each image often share similar visual properties, a class-specific region pooling is applied to provide more efficient relationship information for knowledge transfer. Extensive evaluations on multiple public benchmarks support that the proposed self-attention transfer method can further effectively alleviate the catastrophic forgetting issue, and its flexible combination with one or more widely adopted strategies significantly outperforms state-of-the-art solutions.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | Pascal VOC 15-1 setting 2012 (val) | mIoU (all)76.61 | 88 | |
| Semantic segmentation | Pascal VOC 15-5 setting 2012 (val) | mIoU (All)78.72 | 82 | |
| Semantic segmentation | Pascal VOC 10-1 protocol 2012 (val) | mIoU (0-10)76.21 | 46 | |
| Semantic segmentation | Pascal VOC 5-3 protocol 2012 (val) | mIoU (Classes 0-5)75.58 | 29 | |
| Continual Semantic Segmentation | ADE20k 100-10 (6 tasks) (val) | mIoU (101-150)0.1909 | 24 | |
| Class-Incremental Semantic Segmentation | PASCAL VOC 1-1 2012 | mIoU0.1797 | 9 | |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 (5-1) | mIoU37.86 | 9 | |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 (2-2) | mIoU39.74 | 9 | |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 (2-1) | mIoU5.89 | 9 | |
| Semantic segmentation | ADE20K 100-10 protocol (test) | mIoU (Classes 0-100)41.55 | 8 |