Programming assignment 09: Deep Learning

In [1]:

import numpy as np
import matplotlib.pyplot as plt
import tensorflow as tf
%matplotlib inline

Your Task

In this notebook, the skeleton for learning a feed-forward neural network is given. Your task is to complete the functions where required.

Submission

Submit your notebook by January 6, 2019 as a .html file with your code and the respective cell outputs.

You can also take part in our challenge to fit the best model to the cell phone data by submitting your predictions for the test data. The winners of the challenge will be announced after the Christmas break! See the bottom of this file for more information.

Load and Preprocess Data

I this assignment we will work with a dataset containing accelerometer and gyroscope sensor signals of a cell-phone. Given the sensor signals, the task is to predict the activity that the cell-phone user is doing, e.g, running, walking, biking, or moving upstairs.

The data consists of 8032 samples. Each feature vector contains 6 measurements from the cell-phone sensors in 20 consequent time steps. Indeed, each feature vector is 120-dimensional. The task is to classify these instances into the following classes: “Walking”, “Standing”, “Sitting”, “Running”, “Upstairs”, “Downstairs”.

More details can be found here: https://becominghuman.ai/deep-learning-for-sensor-based-human-activity-recognition-970ff47c6b6b

Before you start download the data from https://syncandshare.lrz.de/dl/fiXnRq4MRd8fGJBXH2jQmNcN/homework_09_data.npz and place the file into the same directory as this notebook.In [2]:

loader = np.load("homework_09_data.npz")
train_data = loader['train_data']
train_labels = loader['train_labels']

val_data = loader['val_data']
val_labels = loader['val_labels']

test_data = loader['test_data']

The skeleton of the class FeedForwardNet is provided in the following. This class implements a feed-forward neural network in Tensorflow. Your task is to complete the parts where it says ### YOUR CODE HERE ###.

To complete the code properly, make sure that you make the computation graph based on the placeholders self.X and self.Y. These two placeholders are created in the build function; you don’t need to create them. You only need to use them.

self.X: a placeholder of shape [None,D] where the none dimension will be replaced by the number of instances, and DD is number of features.
self.Y: a placeholder of shape [None,K] where the none dimension will be replaced by the number of instances, and KK is number of classes.

In [3]:

def batch_data(num_data, batch_size):
    """ Yield batches with indices until epoch is over.
    
    Parameters
    ----------
    num_data: int
        The number of samples in the dataset.
    batch_size: int
        The batch size used using training.

    Returns
    -------
    batch_ixs: np.array of ints with shape [batch_size,]
        Yields arrays of indices of size of the batch size until the epoch is over.
    """
    
    data_ixs = np.random.permutation(np.arange(num_data))
    ix = 0
    while ix + batch_size < num_data:
        batch_ixs = data_ixs[ix:ix+batch_size]
        ix += batch_size
        yield batch_ixs
    

class FeedForwardNet:
    """
    Simple feed forward neural network class
    """
    
    def __init__(self, hidden_sizes, layer_types, name, l2_reg=0.0):
        """ FeedForwardNet constructor.
        
        Parameters
        ----------
        hidden_sizes: list of ints
            The sizes of the hidden layers of the network.
        name: str
            The name of the network (used for a VariableScope)
        l2_reg: float
            The strength of L2 regularization (0 means no regularization)
        """
        
        self.hidden_sizes = hidden_sizes
        self.layer_types = layer_types
        self.name = name
        self.dropout = tf.placeholder_with_default(0.0, shape=(), name="dropout")
        self.l2_reg = l2_reg
        self.weights =[]
        self.biases =[]
    
    def build(self, data_dim, num_classes):
        """ Construct the model.
        
        Parameters
        ----------
        data_dim: int
            The dimensions of the data samples.

        Returns
        -------
        None

        """
        self.X = tf.placeholder(shape=[None, data_dim], dtype=tf.float32, name="data") #[NxD]
        self.Y = tf.placeholder(shape=[None, num_classes], dtype=tf.float32, name="labels") #[Nx1]
    
        with tf.variable_scope(self.name):
        
            hidden = self.X

            for ix, hidden_size in enumerate(self.hidden_sizes):
                ### YOUR CODE HERE ###
                hidden = ### YOUR CODE HERE ###
                    
            ### YOUR CODE HERE ###
            
            self.logits = ### YOUR CODE HERE ###
            self.l2_norm = ### YOUR CODE HERE ###
            self.cross_entropy_loss = ### YOUR CODE HERE ###
            self.accuracy = ### YOUR CODE HERE ###
            
            self.loss = ### YOUR CODE HERE ###
            
            self.optimizer = tf.train.AdamOptimizer()
            self.opt_op = self.optimizer.minimize(self.loss, var_list=[*self.weights, *self.biases])
            
        
    def train(self, train_data, train_labels, val_data, val_labels, epochs=20, dropout=0.0, batch_size=512):
        """ Train the feed forward neural network.
        
        Parameters
        ----------
        train_data: np.array, dtype float32, shape [N, D]
            The training data. N corresponds to the number of training samples, D to the dimensionality of the data samples/
        train_labels: np.array, shape [N, K]
            The labels of the training data, where K is the number of classes.
        val_data: np.array, dtype float32, shape [N_val, D]
            The validation data. N_val corresponds to the number of validation samples, D to the dimensionality of the data samples/
        val_labels: np.array, shape [N_val, K]
            The labels of the training data, where K is the number of classes.
        epochs: int
            The number of epochs to train for.
        dropout: float
            The dropout rate used during training. 0 corresponds to no dropout.
        batch_size: int
            The batch size used for training.

        Returns
        -------
        None

        """
        train_losses = []
        train_accs = []
        val_losses = []
        val_accs = []
        
        self.session = tf.Session()
        session = self.session
        
        with session.as_default():
            session.run(tf.global_variables_initializer())
            
            tr_loss, tr_acc= session.run(### YOUR CODE HERE ###)
            val_loss, val_acc= session.run(### YOUR CODE HERE ###)
                
            train_losses.append(tr_loss)
            train_accs.append(tr_acc)
            
            val_losses.append(val_loss)
            val_accs.append(val_acc)
                        
            for epoch in range(epochs):
                if (epoch + 1) % 25 == 0:
                    print(f"Epoch {epoch+1}/{epochs}")
                for batch_ixs in batch_data(len(train_data), batch_size):
                    _ = session.run(### YOUR CODE HERE ###)
                    
                tr_loss, tr_acc= session.run(### YOUR CODE HERE ###)
                val_loss, val_acc= session.run(### YOUR CODE HERE ###)
                train_losses.append(tr_loss)
                train_accs.append(tr_acc)

                val_losses.append(val_loss)
                val_accs.append(val_acc)    

    
        self.hist={'train_loss': np.array(train_losses),
           'train_accuracy': np.array(train_accs),
           'val_loss': np.array(val_losses),
           'val_accuracy': np.array(val_accs)}

Building a Feed-forward Neural Network

In this part, specify the FFNN. To do so, you can set the following fields in the next cell:

hidden_sizes: a list that contains the number of hidden neurons in different layers.
layer_types: a list containing the activation functions of the layers.

For instance, the values of the following cell specify a FFNN having 3 ReLU layers and a softmax layer. Note that we do not explicitly mention ‘softmax’; Because we know that for the classification task, the last layer is softmax. Moreover, we do not specify DD and KK in the variable hidden_sizes, because we know that the first layer has DD neurons and the last layer has KK neurons.

You can change the configuration of the network. The sample solution is built by the following configuration.

Let’s start without any regularization. You can set the values num_epochs and batch_size in the following cell.In [4]:

#You can change layer types and the number of neurons by changing the following variables.
layer_types = [tf.nn.relu, tf.nn.relu, tf.nn.relu]
hidden_sizes = [64, 32, 16]
epochs = 250
batch_size = 512

Training the Network Using Different Regularizations

In this part, we learn the neural network in three different settings:

Without any regularization
With ℓ2ℓ2 regularization
With dropout

For each case, we are going to see how the training/validation/test losses will change during training.In [5]:

NN_no_regularization = FeedForwardNet(hidden_sizes, layer_types, "no_regularization")
NN_no_regularization.build(train_data.shape[1], num_classes=train_labels.shape[1])

In [6]:

NN_no_regularization.train(train_data, train_labels, val_data, val_labels, epochs,
                          batch_size=batch_size)

Epoch 25/250
Epoch 50/250
Epoch 75/250
Epoch 100/250
Epoch 125/250
Epoch 150/250
Epoch 175/250
Epoch 200/250
Epoch 225/250
Epoch 250/250

Plot the training and validation losses over the epochs. What do you notice?

In [ ]:

plt.figure(figsize=(10,5))
plt.plot(NN_no_regularization.hist['train_loss'][5::], label="Training")
plt.plot(NN_no_regularization.hist['val_loss'][5::], label="Validation")

plt.xlabel("Epoch", fontsize=20)
plt.ylabel("Loss", fontsize=20)
plt.legend()
plt.show()

Now plot the training and validation accuracies over the epochs.

In [ ]:

plt.figure(figsize=(10,5))

plt.plot(NN_no_regularization.hist['train_accuracy'])
plt.plot(NN_no_regularization.hist['val_accuracy'])

plt.xlabel("Epoch", fontsize=20)
plt.ylabel("Accuracy", fontsize=20)

plt.show()

ℓ2ℓ2 regularization

Afterwards, we use ℓ2ℓ2 regularization, and we investigate the train/validation/test loss. Set the regularization parameter to 0.01.In [9]:

NN_L2_regularization = FeedForwardNet(hidden_sizes, layer_types, "L2_regularization", l2_reg=1e-2)
NN_L2_regularization.build(train_data.shape[1], num_classes=train_labels.shape[1])

In [10]:

NN_L2_regularization.train(train_data, train_labels, val_data, val_labels, epochs,
                          batch_size=batch_size)

Epoch 25/250
Epoch 50/250
Epoch 75/250
Epoch 100/250
Epoch 125/250
Epoch 150/250
Epoch 175/250
Epoch 200/250
Epoch 225/250
Epoch 250/250

Dropout

Finally, we train a model using dropout. Use a dropout rate of 0.5.In [11]:

NN_dropout_regularization = FeedForwardNet(hidden_sizes, layer_types, "dropout_regularization")
NN_dropout_regularization.build(train_data.shape[1], num_classes=train_labels.shape[1])

In [12]:

NN_dropout_regularization.train(train_data, train_labels, val_data, val_labels, epochs,
                          batch_size=batch_size, dropout=0.5)

Epoch 25/250
Epoch 50/250
Epoch 75/250
Epoch 100/250
Epoch 125/250
Epoch 150/250
Epoch 175/250
Epoch 200/250
Epoch 225/250
Epoch 250/250

Comparing the models

Now, compare the final training and validation losses achieved by the different models.In [13]:

train_acc_noreg = NN_no_regularization.hist['train_accuracy'][-1]
val_acc_noreg = NN_no_regularization.hist['val_accuracy'][-1]

train_acc_L2reg = NN_L2_regularization.hist['train_accuracy'][-1]
val_acc_L2reg = NN_L2_regularization.hist['val_accuracy'][-1]

train_acc_dropoutreg = NN_dropout_regularization.hist['train_accuracy'][-1]
val_acc_dropoutreg = NN_dropout_regularization.hist['val_accuracy'][-1]

In [2]:

print(f"Training accuracy without regularization: {train_acc_noreg:.3f}")
print(f"Validation accuracy without regularization: {val_acc_noreg:.3f}")
print()

print(f"Training accuracy with L2 regularization: {train_acc_L2reg:.3f}")
print(f"Validation accuracy with L2 regularization: {val_acc_L2reg:.3f}")
print()

print(f"Training accuracy with dropout regularization: {train_acc_dropoutreg:.3f}")
print(f"Validation accuracy with dropout regularization: {val_acc_dropoutreg:.3f}")

Training accuracy without regularization: XXX
Validation accuracy without regularization: XXX

Training accuracy with L2 regularization: XXX
Validation accuracy with L2 regularization: XXX

Training accuracy with dropout regularization: XXX
Validation accuracy with dropout regularization: XXX

Plot the losses and accuracies of the models in one plot to compare them.

In [ ]:

plt.figure(figsize=(10,5))
plt.plot(NN_no_regularization.hist['train_loss'][5::], label="Training (no regularization)",
        color="darkgreen")
plt.plot(NN_no_regularization.hist['val_loss'][5::], label="Validation (no regularization)",
        color="darkgreen", linestyle="--")

plt.plot(NN_L2_regularization.hist['train_loss'][5::], label="Training (L2 regularization)",
        color="royalblue")
plt.plot(NN_L2_regularization.hist['val_loss'][5::], label="Validation (L2 regularization)",
        color="royalblue", linestyle="--")

plt.plot(NN_dropout_regularization.hist['train_loss'][5::], 
         label="Training (dropout regularization)", color="purple")
plt.plot(NN_dropout_regularization.hist['val_loss'][5::], 
         label="Validation (dropout regularization)", color="purple", linestyle="--")

plt.xlabel("Epoch", fontsize=20)
plt.ylabel("Loss", fontsize=20)
plt.legend()
plt.show()

In [ ]:

plt.figure(figsize=(10,5))

plt.plot(NN_no_regularization.hist['train_accuracy'], label="Training (no regularization)",
        color="darkgreen")
plt.plot(NN_no_regularization.hist['val_accuracy'], label="Validation (no regularization)",
        color="darkgreen", linestyle="--")

plt.plot(NN_L2_regularization.hist['train_accuracy'], label="Training (L2 regularization)",
        color="royalblue")
plt.plot(NN_L2_regularization.hist['val_accuracy'], label="Validation (L2 regularization)",
        color="royalblue", linestyle="--")

plt.plot(NN_dropout_regularization.hist['train_accuracy'], 
         label="Training (dropout regularization)", color="purple")
plt.plot(NN_dropout_regularization.hist['val_accuracy'], 
         label="Validation (dropout regularization)", color="purple", linestyle="--")

plt.xlabel("Epoch", fontsize=20)
plt.ylabel("Accuracy", fontsize=20)
plt.legend()
plt.show()

Challenge

Notice that we also have the variable test_data. Get creative and build a model yourself!

Submit your predictions of the test data to Moodle to enter the leaderboard. We will announce the winning team after the Christmas break!

You can output the predictions of the test data using:In [ ]:

test_preds = YOUR_NN_MODEL.logits.eval({YOUR_NN_MODEL.X: test_data},
                                        session=YOUR_NN_MODEL.session).argmax(1)

We will only consider submission as plain text files with exactly the following formatting as our sample_submission.txtIn [53]:

string = ""
with open("sample_submission.txt", "r") as f:
    string = f.read()
print(string[:19])
print("...")

You can use the following command to save your predictions:In [ ]:

np.savetxt("your_submission.txt", test_preds, fmt='%i')

Programming assignment 09: Deep Learning

Your Task

Submission

Load and Preprocess Data

Building a Feed-forward Neural Network

Training the Network Using Different Regularizations

Plot the training and validation losses over the epochs. What do you notice?

Now plot the training and validation accuracies over the epochs.

ℓ2ℓ2 regularization

Dropout

Comparing the models

Plot the losses and accuracies of the models in one plot to compare them.

Challenge

Related Posts