CS计算机代考程序代写 python deep learning IOS cuda GPU flex android c++ Keras algorithm Deep Learning - COSC2779 - Deep Learning Hardware and software

Deep Learning – COSC2779 – Deep Learning Hardware and software

Deep Learning – COSC2779
Deep Learning Hardware and software

Dr. Ruwan Tennakoon

July 26, 2021

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 1 / 19

Why Now?

Big Data
Larger Data sets.

Easier collection and storage.

Computation
Graphic Processing Units.
Massively parallelizable.

Software
Improved Algorithms

Widely available open source
frameworks.

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 2 / 19

Computation

Most neural network operations can be represented as matrix manipulations.




h1
h2
h3
h4


 =




w11 w12 w13
w21 w22 w23
w31 w32 w33
w41 w42 w43





x1x2




How can we do such operations faster?

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 3 / 19

CPU vs GPU

A GPU is a specialized processor with dedicated memory that conventionally perform floating
point operations required for rendering graphics

Image: Fast.ai

Good article on CPU vs GPU

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 4 / 19

https://blogs.nvidia.com/blog/2009/12/16/whats-the-difference-between-a-cpu-and-a-gpu/

Simple Comparison – CPU vs GPU

Simple matrix multiplication.
The code was run on a colab
instance with Nvidia Tesla
K80 GPU. Speedup of
≈ 175x

Numpy: 3.8s

import numpy as np
import timeit
A = np.random.rand(5000, 5000).astype(np.float32)
B = np.random.rand(5000, 5000).astype(np.float32)

timer = timeit.Timer(“numpy.dot(A, B)”,
“import numpy; from __main__ import A, B”)

numpy_times_list = timer.repeat(10, 1)
print(‘Numpy mean time (s); ‘, np.mean(numpy_times_list))

Tensorflow: 0.02s

import tensorflow as tf

A = tf.convert_to_tensor(A)
B = tf.convert_to_tensor(B)
timer = timeit.Timer(“tensorflow.matmul(A, B)”,

setup=”import tensorflow; from __main__ import A, B”)
tensorflow_times_list = timer.repeat(10, 1)

print(‘TensorFlow mean time (s); ‘, np.mean(tensorflow_times_list))

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 5 / 19

CPU vs GPU in practice

Data from: https://github.com/jcjohnson/cnn-benchmarks
∗ cuDNN can optimize CPU performance somewhat.

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 6 / 19

https://github.com/jcjohnson/cnn-benchmarks

GPU Programming

CUDA
NVIDIA GPUs only.
Write C-like code that runs directly on
the GPU.

OpenCL
Any GPU type.

C code:

void linear_serial(int n, float a, float *x, float *y)
for(int i = 0; i < n; i++) y[i] = a*x[i] + y[i]; } // Invoke serial function linear_serial(n, 2.0, x, y); CUDA Code: __global__ linear_parallel(int n, float a, float *x, float *y){ int i = blockIdx.x*blockDim.x + threadIdx.x; if(i>>(n, 2.0, x, y)

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 7 / 19

Deep Learning Frameworks

Tensorflow: Google
PyTorch: Facebook, NYU
Caffe: UC Berkeley
PaddlePaddle: Baidu
CNTK: Microsoft
MXNet: Amazon

We will be using TensorFlow with Keras.

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 8 / 19

Why TensorFlow?

“Python like” coding With
TensorFlow 2.0 (eager execution).
Keras (now integrated to
TensorFlow) is a very easy to use
framework ideal for those who are
just starting out.
Ability to run models on mobile
platforms like iOS and Android.
Backed by Google which indicate
that it will stay around for a
while.
Most popular in industry.

Image: https://twitter.com/karpathy/status/972295865187512320

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 9 / 19

Unique mentions of deep learning frameworks in arxiv papers (full text) over time, based on 43K ML papers over last 6 years. So far TF mentioned in 14.3% of all papers, PyTorch 4.7%, Keras 4.0%, Caffe 3.8%, Theano 2.3%, Torch 1.5%, mxnet/chainer/cntk <1%. (cc @fchollet) pic.twitter.com/YOYAvc33iN

— Andrej Karpathy (@karpathy) March 10, 2018

Why TensorFlow?

Tensorflow 2.0 supports
dynamic graphs.
Easy to use multi GPU
for model and data
parallelization.
Also has a c++ front
end.

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 10 / 19

How to Build a Simple Model

Dense 1
128, ’ReLu’

Input
[28 × 28] Dropout

Dense 2
10, ’SoftM’ Class

TensorFlow supports multiple ways of
building models.

Keras Sequential API is the easiest way to
define models. Not so flexible.

Good explanation in article: Three ways to create a Keras model

with TensorFlow 2.0

Keras Sequential API

model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation=’relu’),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation=’softmax’)

])

model = tf.keras.models.Sequential()
model.add(tf.keras.layers.Flatten(input_shape=(28, 28)))
model.add(tf.keras.layers.Dense(128, activation=’relu’))
model.add(tf.keras.layers.Dropout(0.2))
model.add(tf.keras.layers.Dense(10, activation=’softmax’))

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 11 / 19

3 ways to create a Keras model with TensorFlow 2.0 (Sequential, Functional, and Model Subclassing)

How to Build a Simple Model

Dense 1
128, ’ReLu’

Input
[28 × 28] Dropout

Dense 2
10, ’SoftM’ Class

Keras Functional API is more flexible.
Can create more complex models with
branches easily.
Build directed acyclic graphs (DAGs).
Share layers inside the architecture.

Good explanation in article: Three ways to create a Keras model

with TensorFlow 2.0

Keras Functional API

inputs = tf.keras.layers.Input(shape=(28,28))

x = tf.keras.layers.Flatten()(inputs)
x = tf.keras.layers.Dense(128, activation=’relu’)(x)
x = tf.keras.layers.Dropout(0.2)(x)
x = tf.keras.layers.Dense(10, activation=’softmax’)(x)

model = Model(inputs, x, name=”simpleNet”)

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 12 / 19

3 ways to create a Keras model with TensorFlow 2.0 (Sequential, Functional, and Model Subclassing)

How to Build a Simple Model

Dense 1
128, ’ReLu’

Input
[28 × 28] Dropout

Dense 2
10, ’SoftM’ Class

Model sub-classing
Keras the Model class is the root class
used to define a model architecture.
We can subclass the Model class and
then insert our architecture definition.
Model sub-classing is fully-customizable
and enables to implement custom
forward-pass of the model.

Good explanation in article: Three ways to create a Keras model

with TensorFlow 2.0

Model sub-classing

class MNISTModel(Model):
def __init__(self):

super(MNISTModel, self).__init__()
self.flatten = Flatten(input_shape=(28, 28))
self.d1 = Dense(128, activation=’relu’)
self.d2 = Dense(10, activation=’softmax’)
self.drop = Dropout(0.2)

def call(self, x):
x = self.flatten(x)
x = self.d1(x)
x = self.drop(x)
return self.d2(x)

model = MNISTModel()

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 13 / 19

3 ways to create a Keras model with TensorFlow 2.0 (Sequential, Functional, and Model Subclassing)

Attaching a Loss Function & Training

Dense 1
128, ’ReLu’

Input
[28 × 28] Dropout

Dense 2
10, ’SoftM’ Class

model.compile() Configures the model for
training.
model.fit() Train the parameters of the
model.
model.evaluate() Test the model.

More on building and training simple models in labs week 2,3.

Model training

# initialize the optimizer and compile the model
model.compile(loss=”categorical_crossentropy”, optimizer=’SGD’,

metrics=[“accuracy”])
# train the network
model.fit(x_train, y_train, epochs=5)

test_loss, test_acc = model.evaluate(x_test, y_test)
print(‘Test accuracy:’, test_acc)

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 14 / 19

Lab 2 & 3

Minimal example – linear regression with single neuron
Small MLP – MNIST digit classification (“Keras way”)
Another method for doing MLP (“advanced”)
Check pointing – saving/loading models
Visualizing results – Tensorboard.
Self exercises – Fasion MNIST

most online tutorials still contain TensorFlow 1.x code, therefore be careful when using online
resources – if you see any reference to sessions then most probably that is TensorFlow 1.x code

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 15 / 19

Lab 4

Particle Physics with Deep Learning.

Understand/implementation of the key elements of deep feed forward neural
networks.

Try different activations
Try different models with varying capacities
Experiment with regularisation
Try different optimisation techniques

Will only use subset of data to save time.

Baldi, P., P. Sadowski, and D. Whiteson. “Searching for Exotic Particles in High-energy
Physics with Deep Learning.” Nature Communications 5 (July 2, 2014)

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 16 / 19

Lab 5/6

Loading Image data and Augmentation.

Learn to do data augmentation
Explore different data loading mechanisms in TensorFlow
Implement CNN
Write your own dataloader
Explore more functionality in TensorBoard

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 17 / 19

Lab 5/6

Loading Image data.

Read all the images, labels to memory and feed to .fit() function.
tf.data API: Allows you to do advanced operations like augmentation.
Can pipe to create advanced data pipelines.
(easy to use) Keras Image generator. Simple easy to use framework to
read images. Has a relatively large set of advanced functionality including
augmentation.
(advanced) Write your own data loader sub-class. Most flexible option.
Can use when you have multi-input or multi-output models.

Examples:
Face pose example (single output classification).

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 18 / 19

Lab 5/6

Data Augmentation.

Simple data augmentation with tf.data API.
(easy to use) Augmentation with Keras Image generator. Simple easy to
use framework to read images.

Examples:
MNIST example
Face pose example.

Lecture 2 (Part 2) Deep Learning – COSC2779 July 26, 2021 19 / 19

Related Posts