Going Deeper with Convolutions

About

We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014). The main hallmark of this architecture is the improved utilization of the computing resources inside the network. This was achieved by a carefully crafted design that allows for increasing the depth and width of the network while keeping the computational budget constant. To optimize quality, the architectural decisions were based on the Hebbian principle and the intuition of multi-scale processing. One particular incarnation used in our submission for ILSVRC 2014 is called GoogLeNet, a 22 layers deep network, the quality of which is assessed in the context of classification and detection.

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich• 2014

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	--	3518
Image Classification	CIFAR-10 (test)	--	3381
Image Classification	ImageNet-1k (val)	Top-1 Accuracy69.8	1498
Image Classification	ImageNet (val)	Top-1 Acc68.7	1206
Image Classification	ImageNet 1k (test)	--	939
Image Classification	Tiny ImageNet (test)	Accuracy46	859
Person Re-Identification	MSMT17	mAP0.23	546
Person Re-Identification	MSMT17 (test)	Rank-1 Acc47.6	517
Image Classification	ImageNet	Top-1 Accuracy30.2	431
Image Classification	ImageNet (val)	Top-1 Accuracy83.1	354

Showing 10 of 80 rows

...

Other info

Follow for update

@wizwand_team Discord