Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification
About
Data Augmentation (DA), i.e., synthesizing faithful and diverse samples to expand the original training set, is a prevalent and effective strategy to improve the performance of various data-scarce tasks. With the powerful image generation ability, diffusion-based DA has shown strong performance gains on different image classification benchmarks. In this paper, we analyze today's diffusion-based DA methods, and argue that they cannot take account of both faithfulness and diversity, which are two critical keys for generating high-quality samples and boosting classification performance. To this end, we propose a novel Diffusion-based DA method: Diff-II. Specifically, it consists of three steps: 1) Category concepts learning: Learning concept embeddings for each category. 2) Inversion interpolation: Calculating the inversion for each image, and conducting circle interpolation for two randomly sampled inversions from the same category. 3) Two-stage denoising: Using different prompts to generate synthesized images in a coarse-to-fine manner. Extensive experiments on various data-scarce image classification tasks (e.g., few-shot, long-tailed, and out-of-distribution classification) have demonstrated its effectiveness over state-of-the-art diffusion-based DA methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | Aircraft (AirC) 5-shot | Accuracy54.9 | 30 | |
| Image Classification | CUB 5-shot | Accuracy70.41 | 9 | |
| Long-tail Classification | Flower-LT IF=10 | Accuracy97.62 | 7 | |
| Long-tail Classification | Flower-LT IF=100 | Accuracy (Many)99.82 | 7 | |
| Long-tail Classification | Flower-LT IF=50 | Accuracy95.2 | 7 | |
| Long-tail Classification | CUB-LT IF=100 | Accuracy (Many)87.54 | 7 | |
| Long-tail Classification | CUB-LT IF=50 | Accuracy64.52 | 7 | |
| Long-tail Classification | CUB-LT IF=10 | Accuracy70.28 | 7 |