0% found this document useful (0 votes)

143 views32 pages

Lec-04-Logistic Regression and Neural Networks PDF

This document outlines the topics covered in Lecture 4 on machine learning, including logistic regression, single-layer neural networks, the completeness problem of neural networks, and multilayer neural networks. Key points covered include logistic regression models and gradient descent training, biological intuitions for neurons, Rosenblatt's rule and Hebb's rule for neural network training, the completeness of neural networks to represent functions, backpropagation for training multilayer networks, and advantages and disadvantages of backpropagation.

Uploaded by

Augusto Monge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

143 views32 pages

Lec-04-Logistic Regression and Neural Networks PDF

Uploaded by

Augusto Monge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Lecture 4

Logistic regression
and neural networks

Machine Learning
Andrey Filchenkov

08.06.2016
Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

• The presentation is prepared with

materials of the K.V. Vorontsov’s
course “Machine Leaning”.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 2

Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 3

Logistic regression

We may want to talk about probably of belonging to a class

(we will discuss it on Lecture 5 in details).
1 checar lo de (-∞, ∞) -> (0, 1) con
𝑦 = = σ 〈𝑤, 𝑥 〉 , el exponente
1+𝑒 〈 , 〉
where σ 𝑧 is logistic (sigmoid) function.

Then classification model is

ℓ

𝑄 𝑎, 𝑇 ℓ = ln(1 + exp(− 𝑤, 𝑥 𝑦)) → min .

That is logarithmic loss function.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 4

Logarithmic loss function plot

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 5

Gradient descent

Derivative:
σ 𝑠 = σ 𝑠 σ(−𝑠).
Gradient:
ℓ

µ∇𝑄 𝑤 [ ] =− 𝑦 𝑥 σ −𝑀 𝑤 .

Gradient descent step:

𝑤[ ] = 𝑤 [ ] − µ𝑦 𝑥 σ −𝑀 𝑤 .

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 6

Smoothed Hebb’s rule

Hebb’s rule:
if − 𝑤 , 𝑥 𝑦 > 0, then 𝑤 [ ] = 𝑤 [ ] + µ𝑥 𝑦 .
Marginal [𝑀 < 0] and smoothed σ −𝑀 :

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 7

Smoothed Hebb’s rule

Hebb’s rule:
if − 𝑤 , 𝑥 𝑦 > 0, then 𝑤 [ ] = 𝑤 [ ] + µ𝑥 𝑦 .
Marginal [𝑀 < 0] and smoothed σ −𝑀 :

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 8

Logistic regression implementation

Python: LogisticRegression with different solvers

Weka: Logistic

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 9

Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 10

Biological intuition

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 11

Neuron

Generalized McCulloch-Pitts neuron:

𝑎 𝑥, 𝑇 ℓ = σ 𝑤 𝑓 𝑥 −𝑤 ,

where σ is activation function.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 12

Activation functions

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 13

Rosenblatt’s rule and Hebb’s rule

Rosenblatt’s rule for {1; 0} classification case for

weight learning is for each object 𝑥( ) change
weight vector:
𝑤 [ ] ≔ 𝑤 − η(𝑎 𝑥 − 𝑦 ).

Hebb’s rule for {1; −1} classification case for

weight learning is for each object 𝑥( ) change
weight vector:
If 𝑤 𝑥 𝑦( ) < 0 then 𝑤 [ ] ≔𝑤 + η𝑥 𝑦 .

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 14

Delta rule

Let 𝐿 𝑎 , 𝑥 = 〈𝑤, 𝑥〉 − 1 .
Delta-rule for weight learning is for each object
𝑥( ) change weight vector:
𝑤 [ ] ≔ 𝑤 − η 𝑤, 𝑥 −𝑦 .

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 15

Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 16

Completeness problem (for neuron)

Basic idea: synthesize combinations of neurons.

Completeness problem: how rich is family of

function which can be represented with neural
network?

Start with single neuron.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 17

Logical functions as neural networks

Logical AND
𝑥 ∧ 𝑥 = [𝑥 + 𝑥 − 3/2 > 0]

Logical OR
𝑥 ∨ 𝑥 = [𝑥 + 𝑥 − 1/2 > 0]

Logical NOT
¬𝑥 = [−𝑥 + 1/2 > 0]

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 18

Two ways of making it more complex

Example (Minkovski):
𝑥 ⊕𝑥

Two way of making it more complex

1. Use non-linear transformation:
𝑥 ⊕ 𝑥 = [𝑥 + 𝑥 − 2𝑥 𝑥 − 1/2 > 0]
2. Build superposition:
𝑥 ⊕ 𝑥 = [(𝑥 ∨ 𝑥 ) − (𝑥 ∧ 𝑥 ) − 1/2 > 0]

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 19

Completeness problem (Boolean functions)

Completeness problem: how rich is family of

function which can be represented with neural
network?
DNF Theorem:
Any particular Boolean function can be
represented by one and only one full disjunctive
normal form.
What is with a all possible functions?

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 20

Gorban Theorem

Theorem (Gorban, 1998)

Let
• 𝑋 be compact space,
• 𝐶(𝑋) be an algebra of continuous on 𝑋 real-
valued functions,
• 𝐹 be linear subspace 𝐶(𝑋), closed with respect to
nonlinear continuous function ϕ and containing
constant 1 ∈ 𝐹 ,
• 𝐹 separated points in 𝑋.
Then 𝐹 is dense in 𝐶 𝑋 .
Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 21
Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 22

Multilayer neural network

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 23

Multilayer neural network

Any number of layers

Any number of neurons on each layer
Any number of ties between different layers

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 24

Weights adjusting

Let use SGD to learn weights

𝑤 = 𝑤 ,𝑤 ∈ℝ :

𝑤[ ] = 𝑤 [ ] − η𝛻𝐿 𝑤, 𝑥 , 𝑦 ,

where 𝐿 𝑤, 𝑥 , 𝑦 is loss function (depends on the

problem we are solving).

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 25

Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 26

Derivation of functions superposition

𝑎 𝑥 =𝜎 𝑤 𝑢 𝑥 ;

𝑢 𝑥 =𝜎 𝑤 𝑓 𝑥 ;

Let 𝐿 𝑤 = ∑ 𝑎 𝑥 −𝑦 .
Find partial derivatives
∂𝐿 (𝑤) ∂𝐿 (𝑤)
; .
∂𝑎 ∂𝑢
Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 27
27
Errors on layers

∂𝐿 (𝑤)
=𝑎 𝑥 −𝑦
∂𝑎
ε =𝑎 𝑥 −𝑦 is error on output layer.

∂𝐿 (𝑤)
= 𝑎 𝑥 −𝑦 σ 𝑤 = ε σ 𝑤
∂𝑢
ε =∑ ε σ 𝑤 is error on hidden layer.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 28

28
Backpropagation discussion (advantages)

Advantages:
• efficacy: gradient can be computed in a time,
which is comparable to time of the network
processing;
• can be easily applied for any σ, 𝐿 ;
• can be applied in dynamical learning;
• not all the sample objects can be used;
• can be paralleled.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 29

29
Backpropagation discussion
(disadvantages)

Disadvantages:
• do not always converge;
• can stuck in local optima;
• number of neurons in the hidden layer should be
fixed;
• the more ties, the probable overfitting is;
• “paralysis” of a single neuron and for network.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 30

30
Lecture plan
• Logistic regression
• Single-layer neural network
• Completeness problem of neural
networks
• Multilayer neural networks
• Backpropagation
• Modern neural networks

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 31

Plethora of neural networks

Tens or even hundreds different neural networks

exist:
• self-organizing map
• deep learning networks
• recurrental neural networks
• radial basis function networks
• Bayesian neural networks
• modular neural networks
• …
Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 32

L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Main
No ratings yet
Main
25 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Lecture 0.4 - Neural Networks
No ratings yet
Lecture 0.4 - Neural Networks
51 pages
Neural Net HO
No ratings yet
Neural Net HO
4 pages
Lec 2
No ratings yet
Lec 2
43 pages
Lect8 DNN
No ratings yet
Lect8 DNN
33 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Hebb Network
No ratings yet
Hebb Network
10 pages
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
No ratings yet
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
59 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
855597620
No ratings yet
855597620
44 pages
Lecture14 - ML (FF, Autoenc, Dense Networks)
No ratings yet
Lecture14 - ML (FF, Autoenc, Dense Networks)
28 pages
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
No ratings yet
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
24 pages
L05 Slides - mlp2
No ratings yet
L05 Slides - mlp2
21 pages
I2ml3e Chap11
No ratings yet
I2ml3e Chap11
38 pages
Slide 7 - Neural Networks
No ratings yet
Slide 7 - Neural Networks
64 pages
06 NeuralNetworks 2024
No ratings yet
06 NeuralNetworks 2024
82 pages
What Is Neural Network Technology?
No ratings yet
What Is Neural Network Technology?
17 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
Part 1
No ratings yet
Part 1
48 pages
2-Mathematical Optimization and Deep Learning
No ratings yet
2-Mathematical Optimization and Deep Learning
53 pages
Neural Networks For Machine Learning: Lecture 3a Learning The Weights of A Linear Neuron
No ratings yet
Neural Networks For Machine Learning: Lecture 3a Learning The Weights of A Linear Neuron
34 pages
Feedforward Networks: Marco Kuhlmann
No ratings yet
Feedforward Networks: Marco Kuhlmann
53 pages
Intro Slides
No ratings yet
Intro Slides
31 pages
Lecture 3
No ratings yet
Lecture 3
26 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
FLNN Question Bank
75% (4)
FLNN Question Bank
23 pages
cs188 Fa24 Lec24
No ratings yet
cs188 Fa24 Lec24
46 pages
Chapter 2 - 2 Shallow Neural Network 2 - 2
No ratings yet
Chapter 2 - 2 Shallow Neural Network 2 - 2
34 pages
ML Unit - 2
No ratings yet
ML Unit - 2
70 pages
Guilhoto Math
No ratings yet
Guilhoto Math
25 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
Multi-Layer Perceptrons Guide
No ratings yet
Multi-Layer Perceptrons Guide
20 pages
Artificial Neural Networks: Introduction To Computational Neuroscience
No ratings yet
Artificial Neural Networks: Introduction To Computational Neuroscience
42 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Chapter 11 Neural Nets - Week 08
No ratings yet
Chapter 11 Neural Nets - Week 08
41 pages
Ia Davma Unidad 2
No ratings yet
Ia Davma Unidad 2
113 pages
Week 1 - Artificial Neural Networks - Part I - Justin
No ratings yet
Week 1 - Artificial Neural Networks - Part I - Justin
56 pages
Week 4
No ratings yet
Week 4
61 pages
Maths For ML
No ratings yet
Maths For ML
156 pages
Matematics and Machine Learning
No ratings yet
Matematics and Machine Learning
156 pages
Neural Networks for Researchers
No ratings yet
Neural Networks for Researchers
16 pages
Artificial Neural Networks & Fuzzy Logic
No ratings yet
Artificial Neural Networks & Fuzzy Logic
13 pages
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
No ratings yet
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
25 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Unit 1 Application of Soft Computing - Presentation
No ratings yet
Unit 1 Application of Soft Computing - Presentation
47 pages
CSC 323-06 Artificial Neural Network
No ratings yet
CSC 323-06 Artificial Neural Network
29 pages
Learning Representations by Backpropagating Errors PDF
No ratings yet
Learning Representations by Backpropagating Errors PDF
4 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
06 - Week 4 Lecture Notes 2
No ratings yet
06 - Week 4 Lecture Notes 2
6 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Lecture1 Introduction To ANN
No ratings yet
Lecture1 Introduction To ANN
70 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Lecture 02 - Artificial Neural Network
No ratings yet
Lecture 02 - Artificial Neural Network
37 pages
1957 Wiesinger Occult Phenomena in The Light of Theology PDF
100% (1)
1957 Wiesinger Occult Phenomena in The Light of Theology PDF
308 pages
Lec 06 Feature Selection and Extraction
No ratings yet
Lec 06 Feature Selection and Extraction
43 pages
Lec-04-Logistic Regression and Neural Networks PDF
No ratings yet
Lec-04-Logistic Regression and Neural Networks PDF
32 pages
Practical Buddhism
No ratings yet
Practical Buddhism
193 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Assignment 4
No ratings yet
Assignment 4
46 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Reinforcement Learning and Deep Learning
No ratings yet
Reinforcement Learning and Deep Learning
3 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
47 pages
Lec 1 Intro
No ratings yet
Lec 1 Intro
54 pages
1866 - Year - B.E. Computer Technology Sem-VII Subject - CT7052 - CT705 - Elective-II - Neural Network & Fuzzy Logic
No ratings yet
1866 - Year - B.E. Computer Technology Sem-VII Subject - CT7052 - CT705 - Elective-II - Neural Network & Fuzzy Logic
4 pages
Neural Network (RNN & CNN)
No ratings yet
Neural Network (RNN & CNN)
31 pages
CVDL
No ratings yet
CVDL
3 pages
Deep Learning With Python Mini Course
No ratings yet
Deep Learning With Python Mini Course
26 pages
SATHISH Intern
No ratings yet
SATHISH Intern
50 pages
Full Single-Type Deep Learning Models With Multihead Attention For Speech Enhancement
No ratings yet
Full Single-Type Deep Learning Models With Multihead Attention For Speech Enhancement
3 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
36 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Deep Learning Q Bank Mte
No ratings yet
Deep Learning Q Bank Mte
2 pages
Graphical Abstract: A Timely Survey On Vision Transformer For Deepfake Detection
No ratings yet
Graphical Abstract: A Timely Survey On Vision Transformer For Deepfake Detection
25 pages
Neuro-Fuzzy Logic for Engineers
No ratings yet
Neuro-Fuzzy Logic for Engineers
21 pages
Final Neural June 2020
No ratings yet
Final Neural June 2020
2 pages
3 Sequence and Language Modeling
No ratings yet
3 Sequence and Language Modeling
56 pages
ML Unit4
No ratings yet
ML Unit4
32 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
CSE3008 Module3
No ratings yet
CSE3008 Module3
38 pages
05 Attention Slides
No ratings yet
05 Attention Slides
69 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
DNN Hongyo2019
No ratings yet
DNN Hongyo2019
3 pages
Ann Unit 1
No ratings yet
Ann Unit 1
26 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Multi Layer Perceptron - Notes
No ratings yet
Multi Layer Perceptron - Notes
4 pages

Lec-04-Logistic Regression and Neural Networks PDF

Uploaded by

Lec-04-Logistic Regression and Neural Networks PDF

Uploaded by

Lecture 4

• The presentation is prepared with

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 2

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 3

We may want to talk about probably of belonging to a class

Then classification model is

𝑄 𝑎, 𝑇 ℓ = ln(1 + exp(− 𝑤, 𝑥 𝑦)) → min .

That is logarithmic loss function.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 4

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 5

Gradient descent step:

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 6

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 7

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 8

Python: LogisticRegression with different solvers

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 9

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 10

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 11

Generalized McCulloch-Pitts neuron:

where σ is activation function.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 12

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 13

Rosenblatt’s rule for {1; 0} classification case for

Hebb’s rule for {1; −1} classification case for

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 14

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 15

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 16

Basic idea: synthesize combinations of neurons.

Completeness problem: how rich is family of

Start with single neuron.

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 17

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 18

Two way of making it more complex

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 19

Completeness problem: how rich is family of

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 20

Theorem (Gorban, 1998)

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 22

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 23

Any number of layers

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 24

Let use SGD to learn weights

where 𝐿 𝑤, 𝑥 , 𝑦 is loss function (depends on the

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 25

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 26

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 28

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 29

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 30

Machine learning. Lecture 4. Logisitc regression and neural neworks. 08.06.2016. 31

Tens or even hundreds different neural networks

You might also like