0% found this document useful (0 votes)

266 views9 pages

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

This document describes implementing a simple neural network from scratch in Python. It loads the Iris dataset and defines the network architecture as having 4 input units, 8 hidden units, and 3 output units. It then defines the hyperparameters like learning rate and number of iterations. Functions for initialization, forward propagation, backward propagation, and updating parameters are outlined. Forward propagation calculates activations and outputs. Backward propagation calculates error terms to update the weights and biases in order to minimize loss and improve accuracy over iterations.

Uploaded by

Thambi Smith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

266 views9 pages

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

Uploaded by

Thambi Smith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Implementing Neural Network from

Scratch.
Neural Networks are really powerful algorithms used for classification.
Dataset = Iris_Dataset
Link = http://scikit-learn.org/stable/auto_examples/datasets/plot_iris_dataset.html
(http://scikit-learn.org/stable/auto_examples/datasets/plot_iris_dataset.html)

Import required libraries

In [ ]: from sklearn import datasets #for dataset

import numpy as np #for maths
import matplotlib.pyplot as plt #for plotting

Get Dataset

In [ ]: iris = datasets.load_iris() #load the dataset

data = iris.data #get features
target = iris.target #get labels

shape = data.shape #shape of data

#convert into numpy array

data = np.array(data).reshape(shape[0],shape[1])
target = np.array(target).reshape(shape[0],1)

#print shape
print("Data Shape = {}".format(data.shape))
print("Target Shape = {}".format(target.shape))
print('Classes : {}'.format(np.unique(target)))
print('Sample data : {} , Target = {}'.format(data[70],target[70]))

Define Parameters and Hyperparameters

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 1 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

One hidden layer Neural Network.

Input Units = 4
Hidden Units = 8
Output Units = 3

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 2 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #HYPERPARAMETERS

#num of target labels

num_classes = len(np.unique(target))

#define layer_neurons
input_units = 4 #neurons in input layer
hidden_units = 8 #neurons in hidden layer
output_units = 3 #neurons in output layer

#define hyper-parameters
learning_rate = 0.03

#regularization parameter
beta = 0.00001

#num of iterations
iters = 4001

Dimesions of Parameters
Shape of layer1_weights (Wxh) = (4,8)
Shape of layer1_biasess (Bh) = (8,1)
Shape of layer2_weights (Why) = (8,3)
Shape of layer2_biasess (By) = (3,1)

In [ ]: #PARAMETERS

#initialize parameters i.e weights

def initialize_parameters():
#initial values should have zero mean and 0.1 standard deviation
mean = 0 #mean of parameters
std = 0.03 #standard deviation

layer1_weights = np.random.normal(mean,std,(input_units,hidden_unit
layer1_biases = np.ones((hidden_units,1))
layer2_weights = np.random.normal(mean,std,(hidden_units,output_uni
layer2_biases = np.ones((output_units,1))

parameters = dict()
parameters['layer1_weights'] = layer1_weights
parameters['layer1_biases'] = layer1_biases
parameters['layer2_weights'] = layer2_weights
parameters['layer2_biases'] = layer2_biases

return parameters

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 3 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Activation Function
Sigmoid

0.5

0
−6 −4 −2 0 2 4 6

In [ ]: #activation function
def sigmoid(X):
return 1/(1+np.exp((-1)*X))

#softmax function for output

def softmax(X):
exp_X = np.exp(X)
exp_X_sum = np.sum(exp_X,axis=1).reshape(-1,1)
exp_X = (exp_X/exp_X_sum)
return exp_X

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 4 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Define Utility Functions

1. Forward Propagation

---- Logits = matmul(X,Wxh) + Bh

---- A = sigmoid(logits)
---- logits = matmul(A,Why) + By
---- output = softmax(logits)

Store output and A in cache to use it in backward propagation

2. Backward Propagation

---- Error_output = output - train_labels

---- Error_activation = (matmul(error_output,Why.T))(A)(1-A)
---- dWhy = (matmul(A.T,error_output))/m
---- dWxh = (matmul(train_dataset.T,error_activation))/m

m = len(train_dataset)
Store derivatives in derivatives dict

3. Update Parameters

---- Wxh = Wxh - learning_rate(dWxh + betaWxh)

---- Why = Why - learning_rate(dWhy + betaWhy)

4. Calculate Loss and Accuracy

---- Loss = (-1(Y log(prediction)) + (1-Y) (log(1-predictions))) + beta * (sum(Wxh^2) +

sum(Why^2)))/m
---- Accuracy = sum(Y==predictions)/m

In [ ]: #forward propagation
def forward_propagation(train_dataset,parameters):
cache = dict() #to store the intermediate values for bac
m = len(train_dataset) #number of training examples

#get the parameters

layer1_weights = parameters['layer1_weights']
layer1_biases = parameters['layer1_biases']
layer2_weights = parameters['layer2_weights']
layer2_biases = parameters['layer2_biases']

#forward prop
logits = np.matmul(train_dataset,layer1_weights) + layer1_biases
activation1 = np.array(sigmoid(logits)).reshape(m,hidden_units)
activation2 = np.array(np.matmul(activation1,layer2_weights) + laye
output = np.array(softmax(activation2)).reshape(m,num_classes)

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 5 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

#fill in the cache

cache['output'] = output
cache['activation1'] = activation1

return cache,output

#backward propagation
def backward_propagation(train_dataset,train_labels,parameters,cache
derivatives = dict() #to store the derivatives

#get stuff from cache

output = cache['output']
activation1 = cache['activation1']

#get parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#calculate errors
error_output = output - train_labels
error_activation1 = np.matmul(error_output,layer2_weights.T)
error_activation1 = np.multiply(error_activation1,activation1)
error_activation1 = np.multiply(error_activation1,1-activation1)

#calculate partial derivatives

partial_derivatives2 = np.matmul(activation1.T,error_output)/len
partial_derivatives1 = np.matmul(train_dataset.T,error_activation1

#store the derivatives

derivatives['partial_derivatives1'] = partial_derivatives1
derivatives['partial_derivatives2'] = partial_derivatives2

return derivatives

#update the parameters

def update_parameters(derivatives,parameters):
#get the parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#get the derivatives

partial_derivatives1 = derivatives['partial_derivatives1']
partial_derivatives2 = derivatives['partial_derivatives2']

#update the derivatives

layer1_weights -= (learning_rate*(partial_derivatives1 + beta*layer
layer2_weights -= (learning_rate*(partial_derivatives2 + beta*layer

#update the dict

parameters['layer1_weights'] = layer1_weights
parameters['layer2_weights'] = layer2_weights

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 6 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

return parameters

#calculate the loss and accuracy

def cal_loss_accuray(train_labels,predictions,parameters):
#get the parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#cal loss and accuracy

loss = -1*np.sum(np.multiply(np.log(predictions),train_labels) +
accuracy = np.sum(np.argmax(train_labels,axis=1)==np.argmax(predict
accuracy /= len(train_dataset)

return loss,accuracy

Train Function
1. Initialize Parameters
2. Forward Propagation
3. Backward Propagation
4. Calculate Loss and Accuracy
5. Update the parameters

Repeat the steps 2-5 for the given number of iterations

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 7 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #Implementation of 3 layer Neural Network

#training function
def train(train_dataset,train_labels,iters=2):
#To store loss after every iteration.
J = []

#WEIGHTS
global layer1_weights
global layer1_biases
global layer2_weights
global layer2_biases

#initialize the parameters

parameters = initialize_parameters()

layer1_weights = parameters['layer1_weights']
layer1_biases = parameters['layer1_biases']
layer2_weights = parameters['layer2_weights']
layer2_biases = parameters['layer2_biases']

#to store final predictons after training

final_output = []

for j in range(iters):
#forward propagation
cache,output = forward_propagation(train_dataset,parameters)

#backward propagation
derivatives = backward_propagation(train_dataset,train_labels

#calculate the loss and accuracy

loss,accuracy = cal_loss_accuray(train_labels,output,parameters

#update the parameters

parameters = update_parameters(derivatives,parameters)

#append loss
J.append(loss)

#update final output

final_output = output

#print accuracy and loss

if(j%500==0):
print("Step %d"%j)
print("Loss %f"%loss)
print("Accuracy %f%%"%(accuracy*100))

return J,final_output

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 8 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #shuffle the dataset

z = list(zip(data,target))
np.random.shuffle(z)
data,target = zip(*z)

#make train_dataset and train_labels

train_dataset = np.array(data).reshape(-1,4)
train_labels = np.zeros([train_dataset.shape[0],num_classes])

#one-hot encoding
for i,label in enumerate(target):
train_labels[i,label] = 1

#normalizations
for i in range(input_units):
mean = train_dataset[:,i].mean()
std = train_dataset[:,i].std()
train_dataset[:,i] = (train_dataset[:,i]-mean)/std

In [ ]: #train data
J,final_output = train(train_dataset,train_labels,iters=4001)

Reached an Accuracy of 97%

Plot the loss vs iteration graph

In [ ]: #plot loss graph

plt.plot(list(range(1,len(J))),J[1:])
plt.xlabel('Iterations')
plt.ylabel('Loss')
plt.title('Iterations VS Loss')
plt.show()

In [ ]:

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 9 of 9

Deep Learning
No ratings yet
Deep Learning
4 pages
Trainina A NN Backpropagation
No ratings yet
Trainina A NN Backpropagation
6 pages
555610a19 DL Exp4
No ratings yet
555610a19 DL Exp4
11 pages
Lab Manual Ann
No ratings yet
Lab Manual Ann
12 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
Build XOR Neural Network Guide
No ratings yet
Build XOR Neural Network Guide
14 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Da 3 Lab DL 21BCE2687
No ratings yet
Da 3 Lab DL 21BCE2687
15 pages
Exp 4
No ratings yet
Exp 4
3 pages
Week 7 - Lab
No ratings yet
Week 7 - Lab
6 pages
ANN Backpropagation Algorithm
No ratings yet
ANN Backpropagation Algorithm
4 pages
Experiment 9 1
No ratings yet
Experiment 9 1
3 pages
Backpropagation
No ratings yet
Backpropagation
2 pages
MLP Backpropagation Analysis
No ratings yet
MLP Backpropagation Analysis
1 page
Lab 4
No ratings yet
Lab 4
2 pages
Neural Network XOR Gate Implementation
No ratings yet
Neural Network XOR Gate Implementation
12 pages
Lab Report 03
No ratings yet
Lab Report 03
14 pages
3rd Ass
No ratings yet
3rd Ass
6 pages
02 ML PDF
No ratings yet
02 ML PDF
5 pages
Python Code PDF
No ratings yet
Python Code PDF
3 pages
Math Lab 1
No ratings yet
Math Lab 1
7 pages
Experiment 4 NN
No ratings yet
Experiment 4 NN
3 pages
Simple Backpropagation Guide
No ratings yet
Simple Backpropagation Guide
26 pages
Experiment 4
No ratings yet
Experiment 4
2 pages
Expt 3 (1)
No ratings yet
Expt 3 (1)
5 pages
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
No ratings yet
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
13 pages
Hello World NN
No ratings yet
Hello World NN
8 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
ML Assignment-9
No ratings yet
ML Assignment-9
4 pages
Ann Practical 5
No ratings yet
Ann Practical 5
3 pages
Soft Computing Program
No ratings yet
Soft Computing Program
14 pages
Deeplg 3
No ratings yet
Deeplg 3
8 pages
ML 0joh
No ratings yet
ML 0joh
2 pages
Test 2 Lab 6
No ratings yet
Test 2 Lab 6
8 pages
08 Neural Network
No ratings yet
08 Neural Network
11 pages
Kiran ML 12 0506
No ratings yet
Kiran ML 12 0506
6 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
Day1 06 Simple NN Python
No ratings yet
Day1 06 Simple NN Python
18 pages
New Text Document
No ratings yet
New Text Document
3 pages
Lab 4
No ratings yet
Lab 4
4 pages
Lab 8
No ratings yet
Lab 8
10 pages
Question Example
No ratings yet
Question Example
10 pages
Class Neuron Red
No ratings yet
Class Neuron Red
12 pages
Back Propagation Neural Network in Python
No ratings yet
Back Propagation Neural Network in Python
2 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
Da 1 Deeep
No ratings yet
Da 1 Deeep
45 pages
Code For Mean Squared
No ratings yet
Code For Mean Squared
2 pages
CCC
No ratings yet
CCC
25 pages
X OR Problem Using DNN
No ratings yet
X OR Problem Using DNN
3 pages
Neural Network With Functions For Forward Propagation
No ratings yet
Neural Network With Functions For Forward Propagation
2 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
9 pages
Feedforward Neural Network
No ratings yet
Feedforward Neural Network
5 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
Exp 3
No ratings yet
Exp 3
9 pages
ANN Programs
No ratings yet
ANN Programs
20 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
4009 2807 PDF
No ratings yet
4009 2807 PDF
351 pages
End Term
No ratings yet
End Term
6 pages
2223 1 Sehs4648
No ratings yet
2223 1 Sehs4648
10 pages
Interface
No ratings yet
Interface
7 pages
Compressor control-TS - L Manual Operacion
80% (5)
Compressor control-TS - L Manual Operacion
80 pages
It0047 Fa6a
No ratings yet
It0047 Fa6a
13 pages
Network Security Research Presentation
100% (1)
Network Security Research Presentation
14 pages
GSM Gate Opener GSM Remote Switch
No ratings yet
GSM Gate Opener GSM Remote Switch
13 pages
UTech CMP1025 Tutorial 3
No ratings yet
UTech CMP1025 Tutorial 3
2 pages
Sample Ieee Literature Review
67% (3)
Sample Ieee Literature Review
7 pages
Global Supply Chain Management Simulation
No ratings yet
Global Supply Chain Management Simulation
9 pages
K-1000C LED Controller Manual
No ratings yet
K-1000C LED Controller Manual
9 pages
PassThru Protocol Log Analysis
No ratings yet
PassThru Protocol Log Analysis
2 pages
RCM in Nuclear Power Plants
No ratings yet
RCM in Nuclear Power Plants
13 pages
Mikrotik Manual Configuration
No ratings yet
Mikrotik Manual Configuration
28 pages
Survey Design & Questionnaire Techniques
No ratings yet
Survey Design & Questionnaire Techniques
27 pages
Module 4
No ratings yet
Module 4
34 pages
Head Versus Volume and Time
No ratings yet
Head Versus Volume and Time
7 pages
793F - System Payload
No ratings yet
793F - System Payload
6 pages
Introduction To Geography: Edward F. Bergman William H. Renwick
No ratings yet
Introduction To Geography: Edward F. Bergman William H. Renwick
40 pages
Differences Between Quality Assurance and Quality Control - GeeksforGeeks
No ratings yet
Differences Between Quality Assurance and Quality Control - GeeksforGeeks
6 pages
CTA WEB D CAF 005 01 D3.2 Drive by Data Technology Evaluation Report
No ratings yet
CTA WEB D CAF 005 01 D3.2 Drive by Data Technology Evaluation Report
91 pages
CSIT561 Module8 Network Security
No ratings yet
CSIT561 Module8 Network Security
62 pages
Speedtronic® Mark V Turbine Control System
100% (2)
Speedtronic® Mark V Turbine Control System
67 pages
SY MSC CA Project L1
No ratings yet
SY MSC CA Project L1
47 pages
Empowerment Technologies Quarter 1, Module 3
No ratings yet
Empowerment Technologies Quarter 1, Module 3
12 pages
Donut Disturb
No ratings yet
Donut Disturb
5 pages
Ayomide - 2
No ratings yet
Ayomide - 2
32 pages
PL5 - Course Summary - Pathloss PTP PTMP & Coverage 5 Days (PL5-05)
No ratings yet
PL5 - Course Summary - Pathloss PTP PTMP & Coverage 5 Days (PL5-05)
6 pages
Unit 1
No ratings yet
Unit 1
10 pages

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

Uploaded by

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

Uploaded by

a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Implementing Neural Network from

Import required libraries

In [ ]: from sklearn import datasets #for dataset

In [ ]: iris = datasets.load_iris() #load the dataset

shape = data.shape #shape of data

#convert into numpy array

Define Parameters and Hyperparameters

One hidden layer Neural Network.

#num of target labels

#initialize parameters i.e weights

#softmax function for output

Define Utility Functions

---- Logits = matmul(X,Wxh) + Bh

Store output and A in cache to use it in backward propagation

---- Error_output = output - train_labels

---- Wxh = Wxh - learning_rate(dWxh + betaWxh)

4. Calculate Loss and Accuracy

---- Loss = (-1(Y log(prediction)) + (1-Y) (log(1-predictions))) + beta * (sum(Wxh^2) +

#get the parameters

#fill in the cache

#get stuff from cache

#calculate partial derivatives

#store the derivatives

#update the parameters

#get the derivatives

#update the derivatives

#update the dict

#calculate the loss and accuracy

#cal loss and accuracy

Repeat the steps 2-5 for the given number of iterations

In [ ]: #Implementation of 3 layer Neural Network

#initialize the parameters

#to store final predictons after training

#calculate the loss and accuracy

#update the parameters

#update final output

#print accuracy and loss

In [ ]: #shuffle the dataset

#make train_dataset and train_labels

Reached an Accuracy of 97%

Plot the loss vs iteration graph

In [ ]: #plot loss graph

You might also like