Learning to Count Anything: Reference-less Class-agnostic Counting with Weak Supervision
About
Current class-agnostic counting methods can generalise to unseen classes but usually require reference images to define the type of object to be counted, as well as instance annotations during training. Reference-less class-agnostic counting is an emerging field that identifies counting as, at its core, a repetition-recognition task. Such methods facilitate counting on a changing set composition. We show that a general feature space with global context can enumerate instances in an image without a prior on the object type present. Specifically, we demonstrate that regression from vision transformer features without point-level supervision or reference images is superior to other reference-less methods and is competitive with methods that use reference images. We show this on the current standard few-shot counting dataset FSC-147. We also propose an improved dataset, FSC-133, which removes errors, ambiguities, and repeated images from FSC-147 and demonstrate similar performance on it. To the best of our knowledge, we are the first weakly-supervised reference-less class-agnostic counting method.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Counting | FSC-147 (test) | MAE17.12 | 297 | |
| Crowd Counting | ShanghaiTech Part A (test) | MAE240.1 | 227 | |
| Object Counting | FSC-147 (val) | MAE17.49 | 211 | |
| Crowd Counting | ShanghaiTech Part B (test) | MAE66.6 | 191 | |
| Car Object Counting | CARPK (test) | MAE9.21 | 116 | |
| Object Counting | FSC-147 1.0 (val) | MAE17.49 | 50 | |
| Object Counting | FSC-147 1.0 (test) | MAE17.12 | 50 | |
| Counting | FSC-133 (val) | MAE19.84 | 11 | |
| Counting | FSC-133 (test) | MAE14.23 | 11 | |
| Object Counting and Detection | FSCD147 19 (val) | MAE17.49 | 7 |