CS计算机代考程序代写 database algorithm The University of Sydney Page 1

The University of Sydney Page 1

Convolutional
Neural Networks

Dr Chang Xu
School of Computer Science

The University of Sydney Page 2

History of CNNs

Neocognitron （Kunihiko Fukushima, 1980）

The University of Sydney Page 3

History of CNNs

LeNet-5 （LeCun et al, 1998）
– Built the modern framework of CNNs: Convolutional Layer, Pooling

Layer, and Fully-Connected Layer
– Trained with the backpropagation algorithm
– Classify handwritten digits. However, it can not perform well on more

complex problems, e.g., large-scale image and video classification

The University of Sydney Page 4

History of CNNs

Linear classifier: 8% ~ 12% error
K-nearest-neighbor: 1.x% ~ 5% error
Support Vector Machine: 0.6% ~ 1.4% error
(Conventional) Neural Nets: 1% ~ 5% error

“The MNIST Database”

The University of Sydney Page 5

History of CNNs

AlexNet （Krizhevsky et al, 2012）
– Significant improvements on the image classification task, ImageNet 2012
– The network achieved a top-5 error of 15.3%, more than 10.8 percentage

points ahead of the runner up.
– Basic framework of CNNs with a deeper structure
– Benefit from ImageNet dataset, GPUs, ReLU, Dropout …

5 convolutional layers and 3 fully connected layers

The University of Sydney Page 6

Today, CNNs are everywhere
– Image classification, Image segmentation, Pose estimation, Style

transfer, Image detection, Image caption …

(Krizhevsky et al, 2012) (Shaoli et al, 2017)

(Jianfeng et al, 2017)(Xinyuan et al, 2018)

The University of Sydney Page 7

Basic CNNs Components

The University of Sydney Page 8

A general CNN
– Convolutional Layer
– Pooling
– Fully-connected Layer

(https://leonardoaraujosantos.gitbooks.io)

The University of Sydney Page 9

https://github.com/pytorch/examples/blob/master/mnist/main.py

A toy example

https://pytorch.org/docs/stable/nn.html#linear

https://pytorch.org/docs/stable/nn.html#convolution-layers

https://github.com/pytorch/examples/blob/master/mnist/main.py
https://pytorch.org/docs/stable/nn.html
https://pytorch.org/docs/stable/nn.html

The University of Sydney Page 10

Convolution layers in PyTorch

https://pytorch.org/docs/stable/nn.html#convolution-layers

The University of Sydney Page 11

Convolutional Layer

Grayscale Image: !

Filter: ”

Feature

– Give a simple example: take a grayscale image as input

#!,# #!,$

#%,% #%,# #%,$

##,% ##,# ##,$

#!,%

The University of Sydney Page 12

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1

– Convolution

The University of Sydney Page 13

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2

– Convolution

The University of Sydney Page 14

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0

– Convolution

The University of Sydney Page 15

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0 0

– Convolution

The University of Sydney Page 16

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0 0

0 1 3 -2

0 0 -1 4

3 -1 -2 -2

– Convolution

The University of Sydney Page 17

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1

– Stride

Stride = 1

The stride size is defined by how much you want to shift your filter at each step.

The University of Sydney Page 18

Convolutional Layer

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1
-1

– Stride

Stride = 3

The University of Sydney Page 19

Convolutional Layer

0 0 0 0 0 0 0 0

– Zero padding (pad = 1)
By doing this you can apply the filter to every element of your input matrix.

The University of Sydney Page 20

Convolutional Layer
– Output Size

Output Size

”

Output Size = !”#$%&’ + 1

The University of Sydney Page 21

Learn multiple filters

The University of Sydney Page 22

Learn multiple filters

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0 0

0 1 3 -2

0 0 -1 4

3 -1 -2 -2

0 2 1

0 1 -1

-1 1 0

Filter 1

Filter 2

…

The University of Sydney Page 23

0 2 1

-1 1 0

0 1 -1

Learn multiple filters

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0 0

0 1 3 -2

0 0 -1 4

3 -1 -2 -2

Filter 1

Filter 2

…

The University of Sydney Page 24

0 2 1

-1 1 0

0 1 -1

Learn multiple filters

1 2 0 1 0 1

2 1 1 0 0 1

1 0 0 2 1 0

2 0 0 0 2 1

0 1 1 2 0 2

1 0 1 0 1 1

1 0 -1

-1 0 0

0 0 1

-1 2 0 0

0 1 3 -2

0 0 -1 4

3 -1 -2 -2

3 2 4 -1

1 0 1 4

1 2 4 1

-1 0 3 4

Filter 1

Filter 2

…

The University of Sydney Page 25

Convolutional Layer
– Above, we have only considered a 2-D image as input
– When the input has depth (e.g. RGB images), the

convolution ops should be…

$%×&%×’%

(&×(‘×’%

$#×&#

The University of Sydney Page 26

Convolutional Layer