Unit-1 Deep Learning

dwjdnwjd wjndwjd wnd whjd ewjkf w fewhj fenf wejfnew fb s jesejf je fse jew hef he eh fhefhf ehfehf eh

Uploaded by

Vasanth k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views71 pages

Unit-1 Deep Learning

dwjdnwjd wjndwjd wnd whjd ewjkf w fewhj fenf wejfnew fb s jesejf je fse jew hef he eh fhefhf ehfehf eh

Uploaded by

Vasanth k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 71

Deep Learning

UNIT-I
 Neural Networks: History of Deep Learning,
 Deep Learning Success Stories
 McCulloch Pitts Neuron
 Thresholding Logic
 Perceptrons
 Perceptron Learning Algorithm
 Multilayer Perceptrons (MLPs)
 Representation Power of MLPs
 Sigmoid Neurons
 Gradient Descent
History Of Deep Learning

 Deep Learning is transforming the way machines understand, learn,

and interact with complex data.
 Deep learning mimics neural networks of the human brain.
 It enables computers to autonomously uncover patterns and make
informed decisions from vast amounts of unstructured data.
Historical trends in deep learning

https://medium.com/@lmpo/a-brief-history-of-ai-with-deep-learning-
26f7948bc87b
• Early Beginnings (1940s - 1960s) • 1943: The journey began
with Warren McCulloch and Walter Pitts' model of artificial
neurons, the McCulloch-Pitts neuron, which laid the foundation for
neural network theory. • 1957: Frank Rosenblatt introduced the
Perceptron, an early neural network model capable of learning
and recognizing patterns.

 The Winter of AI (1970s - 1980s) • Despite early

enthusiasm, neural networks faced challenges, including
computational limitations and the inability to train multi-layer
networks, leading to reduced interest in the field, known as the
"AI winter." • 1974: Paul Werbos developed backpropagation, a
key algorithm for training neural networks, but it remained
largely unnoticed until the mid-1980s.
 Revival and Growth (1980s - 1990s) • 1986: Geoffrey
Hinton, David Rumelhart, and Ronald Williams popularized
backpropagation, reviving interest in neural networks. • 1989:
Yann LeCun applied backpropagation to handwritten digit
recognition, leading to the development of Convolutional Neural
Networks (CNNs).
 The Emergence of Deep Learning (2000s) • 2006: Hinton
and his colleagues introduced the concept of deep belief
networks (DBNs), marking the formal beginning of deep
learning. • 2009: Fei-Fei Li's ImageNet project provided a large-
scale dataset for training deep learning models, fueling
advancements in computer vision.
 Breakthroughs and Dominance (2010s)
• 2012: Alex Krizhevsky, Ilya Sutskever, and Hinton won the
ImageNet competition with AlexNet, a deep CNN, demonstrating
the power of deep learning in image recognition.
• 2014: The introduction of Generative Adversarial Networks
(GANs) by Ian Goodfellow opened new possibilities in generative
modeling.
• 2015: Google's DeepMind developed AlphaGo, which defeated
the world champion Go player, showcasing deep learning's
potential in complex strategy games.
• 2016: The emergence of frameworks like TensorFlow and PyTorch
made deep learning more accessible to researchers and
practitioners.
 Recent Advances and Future Directions (2020s)
• 2020: OpenAI's GPT-3, a language model with 175
billion parameters, demonstrated the capabilities of deep
learning in natural language processing.
• Ongoing Research: Deep learning continues to evolve
with advancements in areas like reinforcement learning,
unsupervised learning, and multimodal learning.
Deep Learning Success Stories
1. Healthcare: Detecting Diseases with AI
Example: Google’s DeepMind and AI in Medical Imaging
 Faster and more accurate diagnoses
 Early disease detection saves lives
 Reduces workload for doctors

2. Self-Driving Cars: Tesla & Waymo

Example: Tesla Autopilot & Waymo’s AI Driver
 Safer driving with fewer accidents
 Reducing traffic congestion
 Potential for fully autonomous transportation
3. Natural Language Processing: ChatGPT
& Google Translate
Example: OpenAI’s ChatGPT, Google Translator
 Revolutionizing human-computer interaction
 Automating content creation and summarization
 Breaking language barriers worldwide
4. Gaming: AlphaGo & OpenAI Five
Example: AlphaGo defeating human Go champions
DeepMind’s AlphaGo beat world champion Go players
using deep reinforcement learning. Similarly, OpenAI
Five defeated professional Dota 2 players
 AI mastering complex strategy games
 Advancing reinforcement learning techniques
 Applications in real-world decision-making
5. Finance: Fraud Detection & Algorithmic
Trading
Example: AI in Fraud Detection
 Preventing financial fraud
 Enhancing security in banking
 Improving automated trading strategies
6. Entertainment: Deepfake Technology &
Personalized Recommendations
Example: Netflix & Spotify AI Recommendations
 Increased user engagement
 Better content discovery
 Improved customer retention for businesses

7. Climate Science: AI for Weather Forecasting

Example: NVIDIA’s FourCastNet
 Faster and more precise weather forecasting
 Better disaster preparedness
 Climate change research advancements
McCulloch pitts neuron model
 The McCulloch-Pitts Neuron is the first
computational model of a neuron.
 It can be divided into two parts:
 1. Aggregation: The neuron aggregates multiple
Boolean inputs (0 or 1).
 2. Threshold Decision: Based on the aggregated
value, the neuron makes a decision using a
threshold function.
The first computational model of a neuron was
proposed by Warren MuCulloch (neuroscientist)
and Walter Pitts (logician) in 1943.
Perceptron
Perceptron Learning Algorithm

The perceptron model consists of 4

steps:
 Input from other Neurons
 Weights and bias
 Net Sum
 Activation function

Step Activation Function:

Perceptron Learning Boolean AND Function
Feedforward Neural Network

• Feedforward Neural Network (FNN) is a type

of artificial neural network in which
information flows in a single direction—from
the input layer through hidden layers to the
output layer—without loops or feedback.
• It is mainly used for pattern recognition
tasks like image and speech classification.
Structure of a Feedforward
Neural Network
Feedforward Neural Networks have a structured layered
design where data flows sequentially through each layer.
• Input Layer: The input layer consists of neurons that receive
the input data. Each neuron in the input layer represents a
feature of the input data.
• Hidden Layers: One or more hidden layers are placed
between the input and output layers. These layers are
responsible for learning the complex patterns in the data.
Each neuron in a hidden layer applies a weighted sum of
inputs followed by a non-linear activation function.
• Output Layer: The output layer provides the final output of
the network. The number of neurons in this layer
corresponds to the number of classes in a classification
problem or the number of outputs in a regression problem.
Feedforward Neural Network
Activation Functions

 Activation functions introduce non-linearity into the network

enabling it to learn and model complex data patterns.
 Common activation functions include:
Training a Feedforward Neural Network

 Training a Feedforward Neural Network involves adjusting the

weights of the neurons to minimize the error between the
predicted output and the actual output. This process is typically
performed using backpropagation and gradient descent.
 Forward Propagation: During forward propagation the input
data passes through the network and the output is calculated.
 Loss Calculation: The loss (or error) is calculated using a loss
function such as Mean Squared Error (MSE) for regression tasks
or Cross-Entropy Loss for classification tasks.
 Backpropagation: In backpropagation the error is propagated
back through the network to update the weights. The gradient of
the loss function with respect to each weight is calculated and
the weights are adjusted using gradient descent.
Backpropagation in Neural Network

 Back Propagation is also known as "Backward

Propagation of Errors" is a method used to train
neural network .
 Its goal is to reduce the difference between the
model’s predicted output and the actual output by
adjusting the weights and biases in the network.
 It works iteratively to adjust weights and bias to
minimize the cost function.
 It often uses optimization algorithms like gradient
descent or stochastic gradient descent.
Backpropagation in Neural
Network
Back Propagation plays a critical role in how neural
networks improve over time. Here's why:
 Efficient Weight Update: It computes the gradient
of the loss function with respect to each weight
using the chain rule making it possible to update
weights efficiently.
 Scalability: The Back Propagation algorithm scales
well to networks with multiple layers and complex
architectures making deep learning feasible.
 Automated Learning: With Back Propagation the
learning process becomes automated and the
model can adjust itself to optimize its performance.
Representation Power Of MLPS
 The representation power of a multilayer perceptron
(MLP) refers to its ability to approximate complex
functions and map inputs to outputs effectively.
 Here are key aspects of its representation power
1. Universal Approximation Theorem
 An MLP with at least one hidden layer and a nonlinear
activation function (such as ReLU, sigmoid, or tanh)
can approximate any continuous function to any desired
accuracy, given sufficient neurons in the hidden layer.
 This means that an MLP is a universal function
approximator, capable of representing highly complex
decision boundaries.
 This theorem provides a mathematical foundation
for why neural networks are capable of solving
complex problems across various domains like
image recognition, natural language
processing, and more.
2. Depth vs. Width
 Shallow MLPs (Single Hidden Layer): A single-
layer MLP with enough neurons can approximate
any function, but it may require an exponentially
large number of neurons.
 Deep MLPs (Multiple Hidden Layers): Adding
depth often allows a model to represent functions
more efficiently, requiring fewer neurons and
improving generalization.
3. Nonlinear Activations are Crucial
 Without nonlinear activation functions, an MLP is just
a linear transformation, no more powerful than a
single-layer perceptron.
 Activation functions like ReLU, tanh, or sigmoid
introduce nonlinearity, allowing the MLP to learn
complex representations.
4. Representation of Decision Boundaries
 A single-layer perceptron can only represent linearly
separable functions.
 An MLP with hidden layers can model nonlinear
decision boundaries, enabling it to solve problems like
XOR, image recognition, and natural language
processing
5. Expressive Power vs. Trainability
 While deep MLPs can represent complex functions,
training them effectively requires.
 Sufficient data to avoid over fitting.
 Proper weight initialization to avoid
vanishing/exploding gradients.
 Optimization techniques like batch normalization
and dropout to improve generalization.
Sigmoid Neurons

 Sigmoid neurons, also known as logistic neurons, are a type

of artificial neuron that uses the sigmoid function as its
activation function.
 This function squashes the output to a range between 0 and 1,
making it suitable for tasks like binary classification and as a
building block for deeper neural networks.
 It’s called a sigmoid neuron because its function’s output
forms an S-shaped curve when plotted on a graph; an S is
almost like a line that’s been squeezed into a smaller space.
Sigmoid Neurons
Gradient Descent

 Gradient Descent is an optimization algorithm used to minimize the loss function by

iteratively updating the weights in the direction of the negative gradient. Common
variants of gradient descent include:
 Batch Gradient Descent: Updates weights after computing the gradient over the
entire dataset.
 Stochastic Gradient Descent (SGD): Updates weights for each training example
individually.
 Mini-batch Gradient Descent: It Updates weights after computing the gradient over
a small batch of training examples.
Evaluation of Feedforward neural network
 Evaluating the performance of the trained model involves several metrics:
 Accuracy: The proportion of correctly classified instances out of the total instances.
 Precision: The ratio of true positive predictions to the total predicted positives.
 Recall: The ratio of true positive predictions to the actual positives.
 F1 Score: The harmonic mean of precision and recall, providing a balance between
the two.
 Confusion Matrix: A table used to describe the performance of a classification
model, showing the true positives, true negatives, false positives, and false negatives.

Dl-Module 1
No ratings yet
Dl-Module 1
82 pages
Deep Learning Has Evolved Significantly Since Its Inception in The 1940s
No ratings yet
Deep Learning Has Evolved Significantly Since Its Inception in The 1940s
50 pages
Deep Learning (Handout)
No ratings yet
Deep Learning (Handout)
11 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Deep Learning U1
No ratings yet
Deep Learning U1
5 pages
Neural Networks
No ratings yet
Neural Networks
17 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Neural Networks: Training & Evolution
No ratings yet
Neural Networks: Training & Evolution
17 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Deep Learning Module 1 Chapter 1
No ratings yet
Deep Learning Module 1 Chapter 1
18 pages
Unit.1.Introduction To Deep Learning
No ratings yet
Unit.1.Introduction To Deep Learning
10 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
DL Unit 1
No ratings yet
DL Unit 1
8 pages
Report On Neural Networks
No ratings yet
Report On Neural Networks
15 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Unit 1
No ratings yet
Unit 1
30 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Unit1 DeepLearning Part 1
No ratings yet
Unit1 DeepLearning Part 1
23 pages
4.0 The Complete Guide To Artificial Neural Networks
0% (1)
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Dl-Unit 1
No ratings yet
Dl-Unit 1
12 pages
DL Unit - I CSD Iv
No ratings yet
DL Unit - I CSD Iv
19 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Unit 1
No ratings yet
Unit 1
16 pages
This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
Unit 5
No ratings yet
Unit 5
61 pages
Machine Learning Deep Learning Overview AIST
No ratings yet
Machine Learning Deep Learning Overview AIST
86 pages
Chapter 1 - Introduction To Deep Learning 2023
No ratings yet
Chapter 1 - Introduction To Deep Learning 2023
50 pages
Physics 12
No ratings yet
Physics 12
33 pages
Deep Learning for BTech Students
No ratings yet
Deep Learning for BTech Students
78 pages
What Is Deep Learning - SAP
No ratings yet
What Is Deep Learning - SAP
13 pages
Chapter-2 (Deep Learning)
No ratings yet
Chapter-2 (Deep Learning)
18 pages
Multilayer Perceptron Neural Networks
No ratings yet
Multilayer Perceptron Neural Networks
51 pages
CVDL Cae1
No ratings yet
CVDL Cae1
28 pages
DL SansON Iat1
No ratings yet
DL SansON Iat1
17 pages
DL Digital Notes
No ratings yet
DL Digital Notes
150 pages
Chapter 1
No ratings yet
Chapter 1
52 pages
UNIT - 4 ML
No ratings yet
UNIT - 4 ML
45 pages
DL Unit - 1 Notes
No ratings yet
DL Unit - 1 Notes
45 pages
OE Unit 5
No ratings yet
OE Unit 5
80 pages
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
58 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
Lecture 2
No ratings yet
Lecture 2
37 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Unit V
No ratings yet
Unit V
49 pages
AIMLB PGP 2025 Session 13 14
No ratings yet
AIMLB PGP 2025 Session 13 14
44 pages
DL Unit 1
No ratings yet
DL Unit 1
199 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
Deep Learning-1
No ratings yet
Deep Learning-1
20 pages
Unit 3
No ratings yet
Unit 3
8 pages
Unit 5 ML
No ratings yet
Unit 5 ML
37 pages
Module 1
No ratings yet
Module 1
16 pages
Neural Networks Lecture Notes
No ratings yet
Neural Networks Lecture Notes
19 pages
Deep Learning in Healthcare
100% (1)
Deep Learning in Healthcare
57 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Deep Learning Image Captions
No ratings yet
Deep Learning Image Captions
9 pages
EE514 Machine Learning Course - Introduction
No ratings yet
EE514 Machine Learning Course - Introduction
41 pages
L12 Optim Slides
No ratings yet
L12 Optim Slides
55 pages
AI in Healthcare Fraud Detection
No ratings yet
AI in Healthcare Fraud Detection
25 pages
Generating Wikipedia by Summarizing Long Sequence
No ratings yet
Generating Wikipedia by Summarizing Long Sequence
33 pages
Enhanced Fault Diagnosis in Rotating Machinery Using A Hybrid CWT-LeNet-5-LSTM Model Performance Across Various Load Conditions
No ratings yet
Enhanced Fault Diagnosis in Rotating Machinery Using A Hybrid CWT-LeNet-5-LSTM Model Performance Across Various Load Conditions
20 pages
Question Paper Soft Computing Mid Term April 2022
No ratings yet
Question Paper Soft Computing Mid Term April 2022
1 page
Single Layer Perceptron Experiment
No ratings yet
Single Layer Perceptron Experiment
11 pages
NPTEL Live Session Week 1 Deep Learning-IIT Ropar
No ratings yet
NPTEL Live Session Week 1 Deep Learning-IIT Ropar
26 pages
Artificial Intelligence Ai Courses Training Udacity
No ratings yet
Artificial Intelligence Ai Courses Training Udacity
1 page
Feed Forward Neural Networks: Prof. Adel Abdennour
No ratings yet
Feed Forward Neural Networks: Prof. Adel Abdennour
48 pages
Assignment No 4: Text Classification Using Naive Bayes
No ratings yet
Assignment No 4: Text Classification Using Naive Bayes
6 pages
Industrial Training PPT On Movie Recomendation System
No ratings yet
Industrial Training PPT On Movie Recomendation System
13 pages
National Institute of Technology Patna: Department of Computer Science & Engineering
No ratings yet
National Institute of Technology Patna: Department of Computer Science & Engineering
2 pages
DIY Deep Learning For Vision - A Hands-On Tutorial With Caffe
No ratings yet
DIY Deep Learning For Vision - A Hands-On Tutorial With Caffe
89 pages
Iris Classifier Accuracy Comparison
No ratings yet
Iris Classifier Accuracy Comparison
5 pages
Ahmed El-Sheikh: Objective
No ratings yet
Ahmed El-Sheikh: Objective
2 pages
B - Principles of Training BP
No ratings yet
B - Principles of Training BP
11 pages
Machine Learning for B.Tech Students
No ratings yet
Machine Learning for B.Tech Students
3 pages
A Single-Layer RNN Can Approximate Stacked and Bidirectional RNNS, and Topologies in Between
No ratings yet
A Single-Layer RNN Can Approximate Stacked and Bidirectional RNNS, and Topologies in Between
18 pages
Generating AI Songs A Comprehensive Guide To Creating Music With Artificial Intelligence
No ratings yet
Generating AI Songs A Comprehensive Guide To Creating Music With Artificial Intelligence
4 pages
CCS369 Unit-2 20.12.24
No ratings yet
CCS369 Unit-2 20.12.24
41 pages
Advances in Artificial Intelligence and Blockchain Technologies For Early Detection of Human Diseases
No ratings yet
Advances in Artificial Intelligence and Blockchain Technologies For Early Detection of Human Diseases
28 pages
Dive Into Deep Learning: Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola
No ratings yet
Dive Into Deep Learning: Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola
1,222 pages
Detecting Spam in Emails. Applying NLP and Deep Learning For Spam - by Ramya Vidiyala - Towards Data Science
No ratings yet
Detecting Spam in Emails. Applying NLP and Deep Learning For Spam - by Ramya Vidiyala - Towards Data Science
23 pages
QB Pec-Cs701e
No ratings yet
QB Pec-Cs701e
12 pages
34 Vol 102 No 4
No ratings yet
34 Vol 102 No 4
25 pages
Sarcasm Detection Using Genetic Optimization On LSTM With CNN
No ratings yet
Sarcasm Detection Using Genetic Optimization On LSTM With CNN
4 pages
P-CNN: Pose-Based CNN Features For Action Recognition: Guilhem CH Eron Ivan Laptev Cordelia Schmid Inria
No ratings yet
P-CNN: Pose-Based CNN Features For Action Recognition: Guilhem CH Eron Ivan Laptev Cordelia Schmid Inria
9 pages
Synopsis Report
No ratings yet
Synopsis Report
3 pages