Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dynamic Routing Between Capsules

About

A capsule is a group of neurons whose activity vector represents the instantiation parameters of a specific type of entity such as an object or an object part. We use the length of the activity vector to represent the probability that the entity exists and its orientation to represent the instantiation parameters. Active capsules at one level make predictions, via transformation matrices, for the instantiation parameters of higher-level capsules. When multiple predictions agree, a higher level capsule becomes active. We show that a discrimininatively trained, multi-layer capsule system achieves state-of-the-art performance on MNIST and is considerably better than a convolutional net at recognizing highly overlapping digits. To achieve these results we use an iterative routing-by-agreement mechanism: A lower-level capsule prefers to send its output to higher level capsules whose activity vectors have a big scalar product with the prediction coming from the lower-level capsule.

Sara Sabour, Nicholas Frosst, Geoffrey E Hinton• 2017

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)--
3381
Image ClassificationCIFAR-10 (test)
Accuracy89.4
906
Image ClassificationMNIST (test)
Accuracy99.75
882
Image ClassificationFashion MNIST (test)
Accuracy93.65
568
Image ClassificationCIFAR-10--
507
Image ClassificationMNIST--
395
Image ClassificationSVHN (test)
Accuracy95.7
362
Image ClassificationMNIST
Accuracy99.75
263
Long-Tailed Image ClassificationImageNet-LT (test)
Top-1 Acc (Overall)46.5
220
Showing 10 of 21 rows

Other info

Follow for update