ImageNet Classification with Deep Convolutional Neural Networks “Krizhevsky et. al.‘s 2012 paper”

2 min readFeb 4, 2021

Introduction:

To recognize objects in realistic settings, it is necessary to use much larger training sets. In this study, ImageNet was used, which has 15 million labeled images in over 22,000 categories.

But due to the complexity of the recognition task, a large data set is not enough, using a Convolutional neural network (CNNs) is convenient because their learning capacity can be adapted, they can also make strong and correct assumptions about the nature of images and, they have fewer connections and parameters and so they are easier to train compared to other feedforward neural networks.

The model presented was trained on the subsets of ImageNet (said subset was used in the ILSVRC-2010 and ILSVRC-2012 competitions) and achieved to the moment when the paper was published, the best results reported on these datasets.

Procedures :

Pre-processing of the images:

AlexNet system requires a constant input dimensionality of 256 x 256, so they simply re-scaled their input images.

Architecture:

the network has 60 million parameters and 650 000 neurons. It consists of 5 convolutional layers, some of which are followed by max-pooling layers and 3 fully connected layers with a final 1000-way softmax activation function.

Reducing over-fitting:

Alex used data augmentation which I guess the oldest trick in the industry for this purpose ,maybe? well it’s efficient.
Also they used Dropout in the first two fully-connected layers which they found is very useful saying that AlexNet “exhibits substantial over-fitting”.

Results:

Krizhevski’s Net achieved top-1 and top-5 test error rates of 37.5% and 17.0% , comparing it to the best published results in that time, 45.7% and 25.7% respectively, that’s quite an improvement.

Conclusions:

Deep CNNs can achieve good results on a big dataset using just supervised learning.
It was seen that network’s performance degrades if a single convolutional layer was removed.

Personal notes:

Whenever I read an article about deep convolution network, 80% of the time, it mentioned the AlexNet in it, so, pretty much, the improvement it presents are considered to be groundbreaking achievement and merits the State-of-the-art title for that.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Hamdi Ghorbel

8 Followers

3 Following

student at holbertonschool

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

Interpreting Support Vector Machine Coefficients: A Comprehensive Analysis

D.H. Jang

Interpreting Support Vector Machine Coefficients: A Comprehensive Analysis

In the rapidly advancing landscape of artificial intelligence (AI) and machine learning (ML), specific methodologies and their…

Nov 3, 2024

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

260

Lists

Staff picks

826 stories1649 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2818 saves

15 AI Agent Business Ideas to Get Rich in 2025

Everyday AI

Manpreet Singh

15 AI Agent Business Ideas to Get Rich in 2025

Feb 6

Data Science All Algorithm Cheatsheet 2025

Artificial Intelligence in Plain English

Ritesh Gupta

Data Science All Algorithm Cheatsheet 2025

Stories, strategies, and secrets to choosing the perfect algorithm.

Jan 5

Convolutional Neural Network (CNN) — Part 1

Rishabh Singh

Convolutional Neural Network (CNN) — Part 1

When classifying images, traditional neural networks struggle because each pixel is treated as an independent feature, which misses…

Nov 6, 2024

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

731

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams