0% found this document useful (0 votes)

27 views93 pages

Unit 2

The document provides an overview of Artificial Neural Networks (ANNs), explaining their structure, function, and the biological inspiration behind them. It details the components of ANNs, such as neurons, weights, and activation functions, and introduces the Perceptron as a fundamental type of ANN for binary classification. Additionally, it discusses training methods, including gradient descent and its limitations, as well as the concept of stochastic gradient descent for improved weight updates.

Uploaded by

stuthichinnu7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views93 pages

Unit 2

Uploaded by

stuthichinnu7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 93

Unit -II

Artificial Neural Networks

Introduction
Why Artificial Neural Networks?
There are two basic reasons why we are interested in
building artificial neural networks (ANNs):

• Technical viewpoint: Some problems such as

character recognition or the prediction of future
states of a system require massively parallel and
adaptive processing.

• Biological viewpoint: ANNs can be used to

replicate and simulate components of the human
(or animal) brain, thereby giving us insight into
natural information processing.
• Artificial Neural Networks (ANN) are algorithms based on brain function and are
used to model complicated patterns and forecast issues.
• The development of ANN was the result of an attempt to replicate the workings of
the human brain. The workings of ANN are extremely similar to those of
biological neural networks, although they are not identical.
• An artificial neural network consists of a pool of simple processing units which
communicate by sending signals to each other over a large number of weighted
connections.
• Artificial Neural Networks are a robust method for approximating real-valued,
discrete-valued, and vector-valued target functions.
• They are most effective when used against real-world sensor data.
• They stem from a biological metaphor.
What is Artificial Neural Network?

• An Artificial Neural Network (ANN) is a computational model inspired by the

human brain’s neural structure. It consists of interconnected nodes (neurons)
organized into layers. Information flows through these nodes, and the network
adjusts the connection strengths (weights) during training to learn from data,
enabling it to recognize patterns, make predictions, and solve various tasks in
machine learning and artificial intelligence.
Artificial Neural Networks
• The “building blocks” of neural networks are the
neurons.
• In technical systems, we also refer to them as units or nodes.
• Basically, each neuron
 receives input from many other neurons.
 changes its internal state (activation) based on the current
input.
 sends one output signal to many other neurons, possibly
including its input neurons (recurrent network).
How do ANNs work?
 An artificial neural network (ANN) is either a hardware
implementation or a computer program which strives to
simulate the information processing capabilities of its biological
exemplar. ANNs are typically composed of a great number of
interconnected artificial neurons. The artificial neurons are
simplified models of their biological counterparts.
 ANN is a technique for solving problems by constructing software
that works like our brains.
How do our brains work?
The Brain is A massively parallel information processing system.

• Has about 1011 neurons each connected to 104 other neurons.

• Each neuron has inputs which are called dendrites and one output
which is called the axon.
• The switching time is about 0.001 second. It takes around 0.1
second for the brain to recognize an image, which implies about
100 inference steps.
• So, the brain must do some parallel processing on highly
distributed data.
How do our brains work?
• A neuron is connected to other
neurons through about 10,000
synapses
• A neuron receives input from other
neurons. Inputs are combined.
• Once input exceeds a critical level,
the neuron discharges a spike ‐ an
electrical pulse that travels from the
body, down the axon, to the next
neuron(s)
• The axon endings almost touch the
dendrites or cell body of the next
neuron.
• Transmission of an electrical signal from one neuron to the next is
effected by neurotransmitters
• Neurotransmitters are chemicals which are released from the first
neuron and which bind to the Second.
• This link is called a synapse. The strength of the signal that reaches the
next neuron depends on factors such as the amount of neurotransmitter
available
Dendrites: Input
Cell body: Processor
Synaptic: Link
Axon: Output
How do ANNs work?

An artificial neuron is an imitation of a human neuron

How do ANNs work?
• Now, let us have a look at the model of an artificial neuron.
How do ANNs work?
............
Input xm x2 x1

Processing ∑
∑= X1+X2 + ….+Xm =y

Output y
How do ANNs work?
Not all inputs are equal
............
xm x2 x1
Input
wm .....
weights w2 w1

Processing ∑ ∑= X1w1+X2w2 + ….+Xmwm

Output y
How do ANNs work?
The signal is not passed down to the
next neuron verbatim
............
xm x2 x1
Input
wm ..... w2
weights w1

Processing ∑
Transfer Function
f(vk)
(Activation Function)

Output y
The output is a function of the input, that is
affected by the weights, and the transfer
functions
APPROPRIATE PROBLEMS FOR NEURAL NETWORK LEARNING
• Instances are represented by many attribute-value pairs. These input attributes may be highly
correlated or independent of one another. Input values can be any real values.

• The target function output may be discrete-valued, real-valued, or a vector of several real- or
discrete-valued attributes.

• The training examples may contain errors. ANN learning methods are quite robust to noise in the
training data.

• Long training times are acceptable. Network training algorithms typically require longer training
times than, say, decision tree learning algorithms.

• Fast evaluation of the learned target function may be required. Although ANN learning times are
relatively long, evaluating the learned network, in order to apply it to a subsequent instance, is
typically very fast.

• The ability of humans to understand the learned target function is not important. The weights
learned by neural networks are often difficult for humans to interpret. Learned neural networks are
less easily communicated to humans than learned rules.
PERCEPTRON
• A Perceptron is an Artificial Neuron. It is the simplest possible Neural Network.

• It was introduced by Frank Rosenblatt in 1957s.

• The brain cells (Neurons) receive input from our senses by electrical signals. The Neurons, then
again, use electrical signals to store information, and to make decisions based on previous input.
Frank had the idea that Perceptron could simulate brain principles, with the ability to learn and
make decisions.

• It is the simplest type of feedforward neural network, consisting of a single layer of input nodes
that are fully connected to a layer of output nodes. The original Perceptron was designed to take a
number of binary inputs, and produce one binary output (0 or 1).

• The idea was to use different weights to represent the importance of each input, and that the sum of
the values should be greater than a threshold value before making a decision like yes or no (true or
false) (0 or 1).
Definition

• A perceptron takes a vector of real-valued inputs, calculates a linear combination

of these, and outputs a 1 if its greater than some threshold or -1 otherwise.
Perceptron decision function
• A decision function φ(z) of Perceptron is defined to
take a linear combination of x and w vectors.
• The value z in t he decision f unct ion is given by:

• The decision function is +1 if z is greater than a

threshold θ, and it is -1 otherwise.

•
Bias Unit
• For simplicity, the threshold θ can be brought to
t he lef t and represent ed as w0x0, where w0= -θ
and x0= 1.

• The value w0 is called the bias unit.

• The decision function then becomes:

Output
• The figure shows how the decision function
squashes wTx to either +1 or -1 and how it can
be used to discriminate between two linearly
separable classes.
Types of Perceptron
• Single-Layer Perceptron: This type of perceptron is limited to learning linearly separable patterns.
effective for tasks where the data can be divided into distinct categories through a straight line.

• Multilayer Perceptron: Multilayer perceptrons possess enhanced processing capabilities as they

consist of two or more layers, adept at handling more complex patterns and relationships within the
data.
Basic Components of Perceptron
• Input Features: The perceptron takes multiple input features, each input feature represents a characteristic or
attribute of the input data.

• Weights: Each input feature is associated with a weight, determining the significance of each input feature in
influencing the perceptron’s output. During training, these weights are adjusted to learn the optimal values.

• Summation Function: The perceptron calculates the weighted sum of its inputs using the summation function.
The summation function combines the inputs with their respective weights to produce a weighted sum.

• Activation Function: The weighted sum is then passed through an activation function. Perceptron uses
Heaviside step function functions. which take the summed values as input and compare with the threshold and
provide the output as 0 or 1.

• Output: The final output of the perceptron, is determined by the activation function’s result. For example, in
binary classification problems, the output might represent a predicted class (0 or 1).

• Bias: A bias term is often included in the perceptron model. The bias allows the model to make adjustments
that are independent of the input. It is an additional parameter that is learned during training.

• Learning Algorithm (Weight Update Rule): During training, the perceptron learns by adjusting its weights and
bias based on a learning algorithm. A common approach is the perceptron learning algorithm, which updates
weights based on the difference between the predicted output and the true output.
Activation function
• The activation f unction applies a step rule (convert
the numerical output into +1 or -1) to check if the
output of the weighting function is greater than
zero or not.
Example
• For example:

If ∑wixi> 0 => then final output “o” = 1 (issue bank loan)

Else, final output “o” = -1 (deny bank loan)

• Step function gets triggered above a certain value of the

neuron output; else it outputs zero. Sign Function outputs
+1 or -1 depending on whether neuron output is greater
than zero or not. Sigmoid is the S-curve and outputs a
value between 0 and 1.
How does Perceptron work?

1. All the inputs x are multiplied with their weights w.

2. Add all the multiplied values and call them Weighted Sum.
3. Apply that weighted sum to the correct Activation Function.
Perceptron training rule:
• 1. Evaluate the network according to the equation: σ𝑛𝑖=0 𝑊𝑖. 𝑋𝑖 + 𝑏.

• 2. If the result of step 1 is greater than zero, output O=1 ; if it is less than zero, O=0 .

• 3. If the current output ‘O’ is already equal to the desired output ’t’ , repeat step 1 with a
different set of inputs. If the current output is different from the desired output , proceed
to step 4.

• 4. Adjust the current weights according to:

• Here t is the target output for the current training example, o is the output generated by
the perceptron, and ᶯ is a positive constant called the learning rate. The role of the
learning rate is to moderate the degree to which weights are changed at each step. It is
usually set to some small value (e.g., 0.1) and is sometimes made to decay as the number
of weight-tuning iterations increases.

• 5. Repeat the algorithm from step 1 until O=t for every vector pair.
Imagine a perceptron (in your brain). The perceptron tries to decide if you should go
to a concert. Is the artist good? Is the weather good? What weights should these facts
have?

• Criteria Input Weight

• Artists is Good x1 = 0 or 1 w1 = 0.7

• Weather is Good x2 = 0 or 1 w2 = 0.6

• Friend will Come x3 = 0 or 1 w3 = 0.5

• Food is Served x4 = 0 or 1 w4 = 0.3

• Drinks is Served x5 = 0 or 1 w5 = 0.4

• The Perceptron Algorithm:

• 1. Set a threshold value: Threshold = 1.5

• 2. Multiply all inputs with its weights:

• x1 * w1 = 1 * 0.7 = 0.7

• x2 * w2 = 0 * 0.6 = 0

• x3 * w3 = 1 * 0.5 = 0.5

• x4 * w4 = 0 * 0.3 = 0

• x5 * w5 = 1 * 0.4 = 0.4

• 3. Sum all the results:

• 0.7 + 0 + 0.5 + 0 + 0.4 = 1.6 (The Weighted Sum)

• 4. Activate the Output:

• Return true if the sum > 1.5 ("Yes I will go to the Concert")
Examples
1. AND

• If the two inputs are TRUE (+1), the output of Perceptron is positive, which amounts to
TRUE.

• x1= 1 (TRUE), x2= 1 (TRUE)

• w0 = -.8, w1 = 0.5, w2 = 0.5

• => o(x1, x2) => -.8 + 0.51 + 0.51 = 0.2 > 0

2. OR

• If either of the two inputs are TRUE (+1), the output of Perceptron is positive, which
amounts to TRUE.

• x1 = 1 (TRUE), x2 = 0 (FALSE)

• w0 = -.3, w1 = 0.5, w2 = 0.5

• => o(x1, x2) => -.3 + 0.51 + 0.50 = 0.2 > 0

Limitations of perceptron training rule

• It can be proven that this procedure will converge in finite

time if the training examples are linearly separable and the
learning rate is sufficiently small.

• If the data are not linearly separable then convergence is not

assured.
Perceptron at a Glance
• Perceptron has the following characteristics:
– Perceptron is an algorithm for Supervised Learning of
single layer binary linear classifier.
– Optimal weight coefficients are automatically learned.
– Weights are multiplied with the input features and decision
is made if the neuron is fired or not.
– Activation function applies a step rule to check if the
output of the weighting function is greater than zero.
– Linear decision boundary is drawn enabling the distinction
between the two linearly separable classes +1 and -1.
– If the sum of the input signals exceeds a certain threshold,
it outputs a signal; otherwise, there is no output.
Gradient descent and delta rule
Limitation of gradient descent
• If learning rate too large it is likely to miss the local minimum or even not reach
converge at all.
• If too small, it will take much longer to converge.
• If the number of inputs is large, this becomes even more problematic. Finally,
gradient descent might never find the global minimum.
Stochastic gradient descent
• One common variation on gradient descent intended to alleviate these difficulties is called
incremental gradient descent, or alternatively stochastic gradient descent.
• Whereas the gradient descent training rule presented in Equation (4.7) computes weight updates
after summing over all the training examples in D,.
• The idea behind stochastic gradient descent is to approximate this gradient descent search by
updating weights incrementally, following the calculation of the error for each individual example.
Gradient descent Stochastic gradient descent
The error is summed over all examples weights are updated
before updating weights upon examining each training example

More computation per weight update Comparatively low

Larger step size per weight update Comparatively low
May stuck in local minima avoid falling into local minima
Limitations of Perceptron:

The perceptron model has some limitations that can make it unsuitable for certain
types of problems:

• Limited to linearly separable problems.

• Convergence issues with non-separable data
• Requires labelled data
• Sensitivity to input scaling

ML Unit 5
No ratings yet
ML Unit 5
33 pages
ML Unit-5 Final
No ratings yet
ML Unit-5 Final
23 pages
Lesson 03 Artificial Neural Network
No ratings yet
Lesson 03 Artificial Neural Network
116 pages
ANN (Perceptron and Multilayerd Perceptron)
No ratings yet
ANN (Perceptron and Multilayerd Perceptron)
29 pages
GCET DL Unit-2
No ratings yet
GCET DL Unit-2
54 pages
Module 1 Ann
No ratings yet
Module 1 Ann
31 pages
19ANN
No ratings yet
19ANN
21 pages
Intro to Artificial Neural Networks
No ratings yet
Intro to Artificial Neural Networks
30 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
UNIT1
No ratings yet
UNIT1
72 pages
Neural Network
No ratings yet
Neural Network
85 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
AI Lecture 16
No ratings yet
AI Lecture 16
51 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
54 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
Module 1
No ratings yet
Module 1
100 pages
UNIT-II Chapter-2
No ratings yet
UNIT-II Chapter-2
20 pages
Ict L2 PDF
No ratings yet
Ict L2 PDF
49 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Artificial Neural Networks Guide
No ratings yet
Artificial Neural Networks Guide
51 pages
NFGP Unit I Paavai
No ratings yet
NFGP Unit I Paavai
111 pages
Wk. 12. Artificial Neural Networks (12!05!2021)
No ratings yet
Wk. 12. Artificial Neural Networks (12!05!2021)
48 pages
DL Unit-1 San
No ratings yet
DL Unit-1 San
58 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
AI Module5
No ratings yet
AI Module5
19 pages
Isch 4
No ratings yet
Isch 4
44 pages
DP Learn
No ratings yet
DP Learn
72 pages
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
No ratings yet
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
24 pages
Unit 1
No ratings yet
Unit 1
19 pages
NN Lecture1 Introduction
No ratings yet
NN Lecture1 Introduction
40 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Lecture AI 11 04052025 040850pm
No ratings yet
Lecture AI 11 04052025 040850pm
67 pages
28 Lecture CSC462
No ratings yet
28 Lecture CSC462
28 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Chapter 3-1 Neural Network
No ratings yet
Chapter 3-1 Neural Network
43 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
48 pages
12 Neural Network
No ratings yet
12 Neural Network
52 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Neural Networks Fundamentals, Activation Functions, Feedforward Neural Network
No ratings yet
Neural Networks Fundamentals, Activation Functions, Feedforward Neural Network
33 pages
Module 5 AIML Notes
No ratings yet
Module 5 AIML Notes
77 pages
Neural Nets
No ratings yet
Neural Nets
43 pages
Module-5 PPT Artificial Neural Networks
No ratings yet
Module-5 PPT Artificial Neural Networks
35 pages
Module 3
No ratings yet
Module 3
83 pages
Understanding of Neural Networks
No ratings yet
Understanding of Neural Networks
7 pages
Lntroduction NN
No ratings yet
Lntroduction NN
96 pages
Ann MLP
No ratings yet
Ann MLP
56 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
54 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
CO2 - ANN Structure and Funadamentals - P1
No ratings yet
CO2 - ANN Structure and Funadamentals - P1
65 pages
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
No ratings yet
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
66 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
22 pages
Dog Breed Classification with CNN
No ratings yet
Dog Breed Classification with CNN
54 pages
Ai Based Techniques For Healthy and Bleached Coral Image Classification#1
No ratings yet
Ai Based Techniques For Healthy and Bleached Coral Image Classification#1
25 pages
Back-Propagation Algorithm
No ratings yet
Back-Propagation Algorithm
51 pages
Iris Classifier Accuracy Comparison
No ratings yet
Iris Classifier Accuracy Comparison
5 pages
Traffic Sign Recognition Project
No ratings yet
Traffic Sign Recognition Project
9 pages
Deep Learning Essentials for Tech Enthusiasts
No ratings yet
Deep Learning Essentials for Tech Enthusiasts
3 pages
Neural Networks: A Classroom Approach by Satish Kumar: Neuralnetworksaclassroomapproachbysatishkumarpdffre
33% (3)
Neural Networks: A Classroom Approach by Satish Kumar: Neuralnetworksaclassroomapproachbysatishkumarpdffre
2 pages
10 1109@locs 2019 2927959
No ratings yet
10 1109@locs 2019 2927959
4 pages
Python & ML Project Curriculum
No ratings yet
Python & ML Project Curriculum
3 pages
LSTM & Neural Networks Guide
No ratings yet
LSTM & Neural Networks Guide
85 pages
Neural Network Assignment Guide
No ratings yet
Neural Network Assignment Guide
4 pages
Face Recognition System Guide
No ratings yet
Face Recognition System Guide
5 pages
Cost Function
No ratings yet
Cost Function
3 pages
100 Days of Machine Learning
No ratings yet
100 Days of Machine Learning
45 pages
Image Captioning Generator Using CNN and LSTM
No ratings yet
Image Captioning Generator Using CNN and LSTM
8 pages
Comprehensive Guide to Machine Learning
No ratings yet
Comprehensive Guide to Machine Learning
2 pages
Machine Learning Bla CK Book
No ratings yet
Machine Learning Bla CK Book
71 pages
Cheat Sheet 1 Microsoft Azure Ai Fundamentals Ai 900 Ai Concepts
100% (1)
Cheat Sheet 1 Microsoft Azure Ai Fundamentals Ai 900 Ai Concepts
1 page
Genarative AI
No ratings yet
Genarative AI
7 pages
REVIEW
No ratings yet
REVIEW
27 pages
Data Science Classification Etc
No ratings yet
Data Science Classification Etc
19 pages
CNN for Audio Splicing Detection
No ratings yet
CNN for Audio Splicing Detection
5 pages
2018 Too, A Comparative Study of Fine-Tuning Deep Learning Models For Plant Disease PDF
No ratings yet
2018 Too, A Comparative Study of Fine-Tuning Deep Learning Models For Plant Disease PDF
8 pages
ML
No ratings yet
ML
3 pages
Guava Leaf Disease Detection
No ratings yet
Guava Leaf Disease Detection
5 pages
QB Cse3348 Genrative Ai-1
No ratings yet
QB Cse3348 Genrative Ai-1
3 pages
Tensorflow Tutorial PDF
100% (6)
Tensorflow Tutorial PDF
90 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
Iva Syb With Lab
No ratings yet
Iva Syb With Lab
3 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages

Unit 2

Uploaded by

Unit 2

Uploaded by

Unit -II

Artificial Neural Networks

• Technical viewpoint: Some problems such as

• Biological viewpoint: ANNs can be used to

• An Artificial Neural Network (ANN) is a computational model inspired by the

• Has about 1011 neurons each connected to 104 other neurons.

An artificial neuron is an imitation of a human neuron

Processing ∑ ∑= X1w1+X2w2 + ….+Xmwm

• It was introduced by Frank Rosenblatt in 1957s.

• A perceptron takes a vector of real-valued inputs, calculates a linear combination

• The decision function is +1 if z is greater than a

• The value w0 is called the bias unit.

• The decision function then becomes:

• Multilayer Perceptron: Multilayer perceptrons possess enhanced processing capabilities as they

If ∑wixi> 0 => then final output “o” = 1 (issue bank loan)

Else, final output “o” = -1 (deny bank loan)

• Step function gets triggered above a certain value of the

1. All the inputs x are multiplied with their weights w.

• 4. Adjust the current weights according to:

• Criteria Input Weight

• Artists is Good x1 = 0 or 1 w1 = 0.7

• Weather is Good x2 = 0 or 1 w2 = 0.6

• Friend will Come x3 = 0 or 1 w3 = 0.5

• Food is Served x4 = 0 or 1 w4 = 0.3

• Drinks is Served x5 = 0 or 1 w5 = 0.4

• 1. Set a threshold value: Threshold = 1.5

• 2. Multiply all inputs with its weights:

• 3. Sum all the results:

• 0.7 + 0 + 0.5 + 0 + 0.4 = 1.6 (The Weighted Sum)

• 4. Activate the Output:

• x1= 1 (TRUE), x2= 1 (TRUE)

• w0 = -.8, w1 = 0.5, w2 = 0.5

• => o(x1, x2) => -.8 + 0.5*1 + 0.5*1 = 0.2 > 0

• w0 = -.3, w1 = 0.5, w2 = 0.5

• => o(x1, x2) => -.3 + 0.5*1 + 0.5*0 = 0.2 > 0

• It can be proven that this procedure will converge in finite

• If the data are not linearly separable then convergence is not

More computation per weight update Comparatively low

• Limited to linearly separable problems.

You might also like

• => o(x1, x2) => -.8 + 0.51 + 0.51 = 0.2 > 0

• => o(x1, x2) => -.3 + 0.51 + 0.50 = 0.2 > 0