0% found this document useful (0 votes)

94 views

Introduction To Deep Learning Assignment 0: September 2023

This document provides an introduction to an assignment on deep learning. The assignment involves three tasks: 1) classifying handwritten digits using dimensionality reduction and distance-based classifiers, 2) implementing a multi-class perceptron algorithm, and 3) implementing a neural network to solve the XOR problem from scratch using gradient descent. Students are asked to form groups, submit a report and code, and the deadline is September 29th.

Uploaded by

christiaanbergsma03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views

Introduction To Deep Learning Assignment 0: September 2023

Uploaded by

christiaanbergsma03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Introduction to Deep Learning

Assignment 0

September 2023

Note: This assignment is not going to be graded. However, you do need to form groups of three people
and submit your work on Brightspace.
Your submission should consist of a report in pdf format (at most 3 pages) and neatly organized code
(Jupyter notebooks) that you’ve used in your experiments so that we can reproduce your results. You can find
more information about report writing on the Brightspace → Assignments section. Do not use compression
on the files (.zip/.7z/.rar) - submit every file individually.
This assignment is mainly to get accustomed to the requirements of this course and Brightspace. In case
your report has some clear issues, we will provide feedback so that you are more prepared for the graded
assignments.

Deadline: Friday, 29th September, 23:30PM

Introduction
The objective of this assignment is to develop and evaluate several algorithms for classifying images of
handwritten digits and designing your own neural network from scratch for the XOR problem. You will
work with a simplified version of the famous MNIST data set: a collection of 2707 digits represented by
16x16 vectors. The data is split into a training set (1707 images) and a test set (1000 images). These data
sets are stored in 4 files: train in.csv, train out.csv, test in.csv, test out.csv, where in and out
refer to the input records (images) and the corresponding digits (class labels), respectively. These files are
stored in data.zip.
You may find more information about the original problem of handwritten digit recognition, more data
sets, and an overview of the accuracies of best classifiers (it is about 99.7%!) at
http://yann.lecun.com/exdb/mnist/.

Task 1: Data dimensionality, distance-based classifiers

The purpose of this task is to develop some intuitions about clouds of points in high-dimensional spaces. In
particular, you are supposed to use dimensionality reduction techniques to visualize your data, develop a
very simple algorithm for classifying hand-written digits, and compare it to another distance-based classifier.
1. For each digit d, (d = 0, 1, ..., 9), let us consider a cloud of points in 256-dimensional space, Cd , which
consists of all training images (vectors) that represent d. For each cloud Cd we can calculate its center,
cd , which is just a 256-dimensional vector of means over all coordinates of vectors that belong to Cd .
Once we have these centers, we can easily classify new images: by calculating the distance from the
vector that represents this image to each of the 10 centers, the closest center defines the label of the
image. Next, calculate the distances between the centers of the 10 clouds, distij = dist(ci , cj ), for
i, j = 0, 1, ...9. Given all these distances, try to say something about the expected accuracy of your
classifier. What pairs of digits seem to be most difficult to separate?
2. Experiment with three dimensionality reduction algorithms: PCA, U-MAP, T-SNE and apply them
to the MNIST data to generate a visualization of the different classes, preferably in 2D. You are free
to use any library to do this (preferably scikit-learn and umap-learn packages from PyPI.).
Does the visualization agree with your intuitions and the between-class distance matrix distij ?

1
3. Use the inter-class distances obtained in part 1 to implement a Nearest mean classifier. Apply your
classifier to all points from the training set and calculate the percentage of correctly classified digits.
Do the same with the test set, using the centers that were calculated from the training set.
4. A less naive distance-based approach is the KNN (K-Nearest-Neighbor) classifier (you can either im-
plement it yourself or use the one from sklearn package). Repeat the same procedure as in part 3
by using this method. Then, for both classifiers, generate a confusion matrix which should provide a
deeper insight into classes that are difficult to separate. A confusion matrix is here a 10-by-10 matrix
(cij ), where cij contains the percentage (or count) of digits i that are classified as j. Which digits are
most difficult to classify correctly? Again, for calculating and visualising confusion matrices you may
use the sklearn package. Describe your findings, and compare the performance of your classifiers on
the train and test sets.

Task 2: Implement a multi-class perceptron algorithm

Implement (from scratch) a multi-class perceptron training algorithm (”Perceptron learning rule” from slide
34, second lecture) and use it for training a single layer perceptron with 10 nodes (one per digit), each node
having 256+1 inputs (inputs and bias) and 1 output. Train your network on the train set and evaluate
on both the train and the test set, in the same way as you did in the previous task. As your algorithm is
non-deterministic (results depend on how you initialize weights), repeat your experiments a few times to get
a feeling of the reliability of your accuracy estimates.
Try to make your code efficient. In particular, try to limit the number of loops, using matrix multiplication
whenever possible. For example, append to your train and test data a column of ones that will represent
the bias. The weights of your network can be stored in a matrix W of size 257x10. Then the output of the
network on all inputs is just a dot product of two matrices: T rain and W, where T rain denotes the matrix
of all input vectors (one per row), augmented with 1’s (biases). To find the output node with the strongest
activation use the numpy argmax() function. An efficient implementation of your algorithm shouldn’t take
more than a few seconds to converge on the training set (yes, the training set consists of patterns that are
linearly separable so the perceptron algorithm will converge).
How does the accuracy of this single-layer multi-class perceptron compare to the distance-based methods
in task 1?

Task 3: Implement the XOR network and the Gradient Descent Algorithm

This is probably the last time in your life that you are asked to implement a neural network from scratch –
therefore, have fun! Proceed as follows:

1. Implement the function xor net(inputs, weights) that simulates a network with two inputs, two hidden
nodes and one output node. The vector weights denote 9 weights (tunable parameters): Each non-
input node has three incoming weights: one connected to the bias node that has value 1, and two
other connections that lead from the input nodes to a hidden node or from the two hidden nodes to
the output node. Assume that all non-input nodes use the sigmoid activation function.
2. Implement the error function, which returns the mean squared error made by your network on 4
possible input vectors (0, 0),(0, 1),(1, 0),(1, 1) and the corresponding targets: 0, 1, 1, 0.
3. Implement the gradient of the mse(weights) function, grdmse(weights). Note that the vector of values
that are returned by grdmse(weights) should have the same length as the input vector weights: it
should be the vector of partial derivatives of the mse function over each element of the weights vector.
4. Finally, implement the gradient descent algorithm:
(a) Initialize weights to some random values,
(b) Iterate: weights = weights − η ∗ grdmse(weights),
where η is a small positive constant (called “step size” or “learning rate”).

2
Use your program to train the network on the XOR data. During training, monitor two values: the
MSE obtained by your network on the training set, and the number of misclassified inputs. (The network
returns a value between 0 and 1; we may agree that values bigger than 0.5 are interpreted as “1”, otherwise
as “0”.) Run your program several times using various initialization strategies and values of the learning
rate. Additionally, try the “lazy approach”: just keep generating random weights of the network, testing if it
computes the XOR function, and stop as soon as you have found such weights. To get an idea of how many
sets of weights should be tried before finding a good one repeat your experiment several times. Describe
your work and findings in the report.
You may experiment with alternative activation functions, e.g., hyperbolic tangent (tanh) or a linear
rectifier, relu(x) = max(0, x). How do they affect the training process of your network, how would you
explain these differences?

1HCP Digital Bonus Package-Fillable
100% (1)
1HCP Digital Bonus Package-Fillable
19 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Qlik Interview Questions & Answers Updated
No ratings yet
Qlik Interview Questions & Answers Updated
20 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
25 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
54 pages
Deep Learning Manual (1)
No ratings yet
Deep Learning Manual (1)
53 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
AD3511-DEEP LEARNING-LAB MANUAL
No ratings yet
AD3511-DEEP LEARNING-LAB MANUAL
53 pages
Soft Computing
No ratings yet
Soft Computing
16 pages
Pdf
No ratings yet
Pdf
41 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
HW46
No ratings yet
HW46
5 pages
SC - LAB
No ratings yet
SC - LAB
22 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
Supervised Assignment 2
No ratings yet
Supervised Assignment 2
2 pages
Homework1 PDF
No ratings yet
Homework1 PDF
2 pages
Pythonfile
No ratings yet
Pythonfile
36 pages
AD3511 - Deep Learning Lab Manual - - Copy
No ratings yet
AD3511 - Deep Learning Lab Manual - - Copy
61 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
748747019-ad3511-deep-learning-lab-manual-iii-yearjnn (1)-1
No ratings yet
748747019-ad3511-deep-learning-lab-manual-iii-yearjnn (1)-1
51 pages
Dl Lab Manual_organized
No ratings yet
Dl Lab Manual_organized
50 pages
DL - Assignment 1
No ratings yet
DL - Assignment 1
12 pages
Deep Learning Manual .
No ratings yet
Deep Learning Manual .
34 pages
Capstone Project Report (Digit-Recognition Using CNN)
No ratings yet
Capstone Project Report (Digit-Recognition Using CNN)
11 pages
Project
No ratings yet
Project
4 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Software Laboratory II Code
No ratings yet
Software Laboratory II Code
27 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
DL LAB RECORD 1
No ratings yet
DL LAB RECORD 1
69 pages
sc lab manual (1)
No ratings yet
sc lab manual (1)
23 pages
DEEP LEARNING LAB - Manual
No ratings yet
DEEP LEARNING LAB - Manual
64 pages
1735550619101_ad3511-deep-learning-lab-manual_241230_204240
No ratings yet
1735550619101_ad3511-deep-learning-lab-manual_241230_204240
63 pages
DL Practical
No ratings yet
DL Practical
23 pages
lab 8
No ratings yet
lab 8
10 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
Machine Learning Engineer Nanodegree: Deep Learning Project: Build A Digit Recognition Program
No ratings yet
Machine Learning Engineer Nanodegree: Deep Learning Project: Build A Digit Recognition Program
5 pages
Homework2
No ratings yet
Homework2
3 pages
2802ICT Programming Assignment 2
No ratings yet
2802ICT Programming Assignment 2
6 pages
Chapter02 Mathematical-Building-Blocks
No ratings yet
Chapter02 Mathematical-Building-Blocks
9 pages
DL Record
No ratings yet
DL Record
36 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Instruction Assignment9
No ratings yet
Instruction Assignment9
3 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
46 pages
Assignment-2_CNN-1[1]
No ratings yet
Assignment-2_CNN-1[1]
3 pages
hand writing using _cnn (1)
No ratings yet
hand writing using _cnn (1)
5 pages
Neural Networks Essay
No ratings yet
Neural Networks Essay
5 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Deeplearning
No ratings yet
Deeplearning
9 pages
Soft Computing Lab Record
100% (1)
Soft Computing Lab Record
35 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Building A Neural Network From Scratch C++: Name of Student: Niranjan Class: XII Year: 2019 - 2020
No ratings yet
Building A Neural Network From Scratch C++: Name of Student: Niranjan Class: XII Year: 2019 - 2020
40 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
CCS364-Soft Computing Lab Manual (1)
No ratings yet
CCS364-Soft Computing Lab Manual (1)
30 pages
Dl Lab Manual
No ratings yet
Dl Lab Manual
18 pages
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Kinemetrics ROCK Family Basalt
No ratings yet
Kinemetrics ROCK Family Basalt
2 pages
Hohl2018 Article BackToTheFutureOriginsAndDirec PDF
No ratings yet
Hohl2018 Article BackToTheFutureOriginsAndDirec PDF
27 pages
3.How to communication between RAC-050 and LIS
No ratings yet
3.How to communication between RAC-050 and LIS
8 pages
Graphic Design Exampleslines and Shapes
No ratings yet
Graphic Design Exampleslines and Shapes
15 pages
Web Engineering Web Development Processes: Muhammad Umair Naru
No ratings yet
Web Engineering Web Development Processes: Muhammad Umair Naru
20 pages
Sap Jam Admin Guide
No ratings yet
Sap Jam Admin Guide
366 pages
SolidEdge ch9 Titleblock and Borders PDF
No ratings yet
SolidEdge ch9 Titleblock and Borders PDF
12 pages
Design A Theme Park Homework
100% (1)
Design A Theme Park Homework
7 pages
Splc780d Ds
No ratings yet
Splc780d Ds
48 pages
04 - Algorithms Arrays
No ratings yet
04 - Algorithms Arrays
17 pages
A5 Essay Reflection
No ratings yet
A5 Essay Reflection
2 pages
Kunstler Script (TrueType)
No ratings yet
Kunstler Script (TrueType)
1 page
1 - MIT - Coimbatore Prospectus
No ratings yet
1 - MIT - Coimbatore Prospectus
43 pages
Srs Document Hms-1
No ratings yet
Srs Document Hms-1
12 pages
Lecture 0 INT306
No ratings yet
Lecture 0 INT306
38 pages
Zoom G1X Four - 4 Secondary Modes
No ratings yet
Zoom G1X Four - 4 Secondary Modes
1 page
University MIS System Presentation MIS Group Presentation On 18112022
No ratings yet
University MIS System Presentation MIS Group Presentation On 18112022
18 pages
Halcyon Physisc P1 Comprimido
No ratings yet
Halcyon Physisc P1 Comprimido
241 pages
Peramalan Indeks Harga Properti Residensial Di Kota Bandung: Politeknik Statistika STIS
No ratings yet
Peramalan Indeks Harga Properti Residensial Di Kota Bandung: Politeknik Statistika STIS
48 pages
Circle Assignment 2
No ratings yet
Circle Assignment 2
2 pages
e-PG PATHSHALA-Computer Science Computer Architecture
No ratings yet
e-PG PATHSHALA-Computer Science Computer Architecture
9 pages
Crypton_Future_Media
No ratings yet
Crypton_Future_Media
10 pages
Log
No ratings yet
Log
2 pages
Pitch Drift Va Pitch Shift - Google Search
No ratings yet
Pitch Drift Va Pitch Shift - Google Search
1 page
First Assignment Answers
No ratings yet
First Assignment Answers
22 pages
Department of Master of Computer Application: M.C.A. First Year (I Semester)
No ratings yet
Department of Master of Computer Application: M.C.A. First Year (I Semester)
57 pages
Indian Institute of Management-Lucknow: Summer Placements 2020-2021 Job Description Form
No ratings yet
Indian Institute of Management-Lucknow: Summer Placements 2020-2021 Job Description Form
2 pages
Vector Part3 Methods-Tech Piece-Recut-Additive en
No ratings yet
Vector Part3 Methods-Tech Piece-Recut-Additive en
6 pages