Revisiting Fine-tuning for Few-shot Learning
About
Few-shot learning is the process of learning novel classes using only a few examples and it remains a challenging task in machine learning. Many sophisticated few-shot learning algorithms have been proposed based on the notion that networks can easily overfit to novel examples if they are simply fine-tuned using only a few examples. In this study, we show that in the commonly used low-resolution mini-ImageNet dataset, the fine-tuning method achieves higher accuracy than common few-shot learning algorithms in the 1-shot task and nearly the same accuracy as that of the state-of-the-art algorithm in the 5-shot task. We then evaluate our method with more practical tasks, namely the high-resolution single-domain and cross-domain tasks. With both tasks, we show that our method achieves higher accuracy than common few-shot learning algorithms. We further analyze the experimental results and show that: 1) the retraining process can be stabilized by employing a low learning rate, 2) using adaptive gradient optimizers during fine-tuning can increase test accuracy, and 3) test accuracy can be improved by updating the entire network when a large domain-shift exists between base and novel classes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Category-agnostic Pose Estimation | MP-100 (split1-5) | Split 1 Performance71.67 | 14 | |
| Category-agnostic Pose Estimation | MP-100 (test) | Split 1 Performance71.67 | 14 | |
| 5-way Classification | Low-resolution Single-domain 5-shot | Accuracy74.5 | 9 | |
| 5-way Classification | Low-resolution Single-domain 1-shot | Accuracy54.9 | 9 | |
| Category-agnostic Pose Estimation | MP-100 (Split 1) | PCK@0.270.6 | 8 | |
| Category-agnostic Pose Estimation | MP-100 (Split 2) | PCK@0.257.04 | 8 | |
| Category-agnostic Pose Estimation | MP-100 (Split 3) | PCK@0.266.06 | 8 | |
| Category-agnostic Pose Estimation | MP-100 (Split 4) | PCK@0.265 | 8 | |
| Category-agnostic Pose Estimation | MP-100 (Split 5) | PCK@0.259.2 | 8 | |
| Category-agnostic Pose Estimation | MP-100 (Average) | PCK@0.263.58 | 8 |