0% found this document useful (0 votes)

10 views22 pages

ML Logistic Regression Module3 Final

The document discusses binary logistic regression, a statistical method used to model relationships between independent variables and a binary dependent variable. It explains the mathematical foundations, including the odds and log-odds, and introduces the sigmoid function for estimating probabilities. Additionally, it covers the use of Maximum Likelihood Estimation (MLE) for estimating the model parameters.

Uploaded by

backupvedant2612

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views22 pages

ML Logistic Regression Module3 Final

Uploaded by

backupvedant2612

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Module 2 – Supervised Learning

Logistic Regression

▪ There are many important research topics for which the

dependent variable (y) is "limited" or “Categorical”
▪ For example: voting, morbidity or mortality, and participation
data is not continuous or distributed normally.
▪ Binary logistic regression is a type of regression analysis where
the
▪ dependent variable is a binary / dummy variable: coded 0 (failure)
or 1(success)
▪ Independent variable(s) is of any kind
Logistic
Regression
Like the multiple regression, logistic regression is a statistical analysis used to examine

relationships between independent variables (predictors) and a dependant variable

(criterion)

The main difference is in logistic regression, the criterion is nominal (predicting group

membership). For example, do age and gender predict whether one signs up for

swimming lessons (yes/no)

Logistic Regression
▪ Logistic Regression estimates the probability of an event
occurring, based on a given dataset of independent variables.
▪ It helps to understand the relationship between one or more
independent variables and a target variable
▪ Target / Dependent variable is bounded between 0 and 1
Logistic Regression
▪ We have a binary (dichotomous) response variable Y defined as

Y = 1 if “success” (“yes”)
0 if “failure” (“no”)

▪ π = Proportion of “Success”. We want to model the probability π for Y=1.

▪ In ordinary regression the model predicts the mean Y for any

combination of predictors. In logistic regression the model predicts the
true proportion of success, π, at any predictor value

# 𝑜𝑓 1 ′𝑠
𝜋= = Proportion of "success”
# 𝑜𝑓 𝑜𝑓 𝑡𝑟𝑖𝑎𝑙𝑠
Maths behind Logistic Regression
O dd s

 =
P(Yes)
is defined as odds of Yes
1−  P(No)

odds =
  =
odds
1−  1+ odds
Die Rolling
Event Prob Odds
even # 1/2 1 [or 1:1]
X>2 2/3 2 [or 2:1]

roll a 2 1/6 1/5 [or 1/5:1 or 1:5]

Maths behind Logistic Regression
➢ This restricts the ‘Odds’ value in the range of (0, +∞), but actually the variable
can take values over the entire range, (-∞, +∞)
➢ To incorporate this, we take ‘log of odds’, which has a range of (-∞, +∞)
𝑃
log = 𝛽0+𝛽1X
1−𝑃
➢ Taking exponential on both sides, we get,
𝑃
exp(log ) = exp(𝛽0+𝛽1X)
1−𝑃
𝑃
𝑒 log 1−𝑃
= 𝑒 (𝛽0 +𝛽1X)
𝑃
= 𝑒(𝛽0+𝛽1X)
1−𝑃
𝑃 = 𝑒(𝛽 0+𝛽 1X) - 𝑃𝑒(𝛽0+𝛽1X)
Maths behind Logistic Regression
➢ Dividing both sides by P, we get
(𝛽 0 + 𝛽 1 X)
1= 𝑒 - 𝑒(𝛽0 +𝛽1X)
𝑃
P[1 + 𝑒(𝛽 0 +𝛽 1 X)
] = 𝑒(𝛽 0 +𝛽 1 X)
𝑒 (𝛽 0 + 𝛽 1 X)
P=
1 + 𝑒 (𝛽 0 + 𝛽 1 X)

➢ Dividing by 𝑒(𝛽 0+𝛽 1X) , we get

1
P=
1 + 𝑒 −(𝛽 0 + 𝛽 1 X)
➢ This is the SIGMOID function, which has a S-shaped curve, ranging
between 0 and 1, for different values of (𝛽0+𝛽1X), as explained in next
slides
Two forms of Binary Logistic Regression

▪ X = Quantitative predictor Y = Binary response

π = proportion of success (Y=1) at any X

▪ Equivalent forms of the logistic regression model:

Logit form Probability form

𝜋 𝑒𝛽0+𝛽1X
𝑙𝑛 = 𝛽0+𝛽1X 𝜋=
1−𝜋 1+𝑒 𝛽 0 + 𝛽 1 X
Binary Logistic Regression Mod el – C o n t d …

▪ The logistic distribution constrains the estimated probabilities to lie

between 0 and 1.

▪ The estimated probability (𝜋 / P) is:

𝑒𝛽0+𝛽1X 1
𝜋= 𝜋=
1+𝑒 𝛽 0 + 𝛽 1 X 1+𝑒 −(𝛽 0 + 𝛽 1 X)
▪ If you let ′𝛽0+𝛽1X′ =0, then p = 0.50
▪ As ′𝛽0+𝛽1X′ gets really big (+∞), ‘P’ approaches 1
▪ As ′𝛽0+𝛽1X′ gets really small (-∞), ‘P’ approaches 0
Probability function
O d d s Ratio
A common way to compare two groups is to look at the ratio
of their odds

Odds1
Odds Ratio = OR =
Odds2
Note: Odds ratio (OR) is similar to relative risk (RR)

p1 1− p2
RR = OR = RR *
p2 1− p1

So when p is small, OR ≈ RR
Interpreting “Slope” using O d d s

When X is replaced by X + 1:

odds = e0 +1X

is replaced by

odds = e0 +1( X +1)

So the ratio is
0 +1 ( X +1)
e 0 +1( X +1)−( 0 +1 X ) 1
0 +1 X
=e =e
e
When we increase X by 1, the ratio of the new odds to the old odds
i. e., odds are multiplied by 𝑒𝛽1
Maximum Likelihood Estimation (MLE)
➢ The beta parameters or coefficients of logistic regression are estimated
using Maximum Likelihood estimation (MLE)
➢ This method tests the different values of beta through multiple
iterations to optimize for the best fit of ‘log odds’
➢ Cost function in Logistic regression:
➢ In Logistic regression ?Is a nonlinear function, given as

➢ As ?𝑖 is binary, each label can be interpreted as Bernoulli random variable,

which follows the Bernoulli distribution given as,
Maximum Likelihood Estimation (MLE)
➢ For the sigmoid function ‘p’, it can be given as,
𝑝=𝜎 𝛽𝑇 𝑥 𝑖 𝑦 (1 − 𝜎(𝛽𝑇𝑥𝑖))1−𝑦
➢ Likelihood function L(𝛽) is given as,
L(𝛽) = σ 𝑛𝑖=1 𝜎 𝛽𝑇𝑥𝑖 𝑦 (1 − 𝜎(𝛽𝑇𝑥𝑖))1−𝑦
➢ We need to find the value of 𝛽 which will maximise this likelihood f u n c t i o n
➢ For easier calculation, multiply both sides by ‘log’, to get the log- likelihood
function {LL(𝛽)}, given as,

log(L(𝛽) )= σ 𝑛𝑖=1 𝑦 ∗ log[𝜎 𝛽𝑇𝑥𝑖 ] + 1 − y ∗ log[(1 − 𝜎(𝛽𝑇𝑥𝑖)]

➢ In order to maximise the LL(𝛽), we can perform minimizing the [- LL(𝛽)]. i.e.,
max[log 𝑥 ] = min[− log 𝑥 ]
Cost function
🠶 Derivative of Sigmoid function:
Derivation of Cost function
Derivation of Cost function
Derivation of Cost function

Derivative of cost function: 𝜽𝒏𝒆𝒘 = 𝜽𝒐𝒍𝒅 − 𝜶[𝝈(𝜽𝑻𝒙) − 𝒚] 𝒙𝒋

Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
Logistic Regresson
No ratings yet
Logistic Regresson
32 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
19 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Logistic Regression for Researchers
100% (2)
Logistic Regression for Researchers
51 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Lecture 6 Logistic Regression
No ratings yet
Lecture 6 Logistic Regression
28 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Sarang Ke Liye
No ratings yet
Sarang Ke Liye
14 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
100% (3)
Logistic Regression
30 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
20 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
ML2 Logistic Regression
No ratings yet
ML2 Logistic Regression
23 pages
29 LogisticRegression
No ratings yet
29 LogisticRegression
15 pages
Logistic Regression: Psy 524 Ainsworth
No ratings yet
Logistic Regression: Psy 524 Ainsworth
37 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Logistic Regression Insights
No ratings yet
Logistic Regression Insights
54 pages
Exp 2 121a1047 ML Lavanya Kurup Div C C3
No ratings yet
Exp 2 121a1047 ML Lavanya Kurup Div C C3
8 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Materi MT
No ratings yet
Materi MT
14 pages
Aiml Unit 3 1
No ratings yet
Aiml Unit 3 1
9 pages
M2 Logistic Regression Classcopy 4
No ratings yet
M2 Logistic Regression Classcopy 4
7 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
T3 Logistic Regression
No ratings yet
T3 Logistic Regression
53 pages
Logistic Nota
No ratings yet
Logistic Nota
87 pages
Day 4
No ratings yet
Day 4
29 pages
Materi MT
No ratings yet
Materi MT
14 pages
Logistic Regression
0% (1)
Logistic Regression
49 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
Experiment No 8
No ratings yet
Experiment No 8
4 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
48 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Loges Tic
No ratings yet
Loges Tic
30 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Logistic Regression
No ratings yet
Logistic Regression
23 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Detailed Logistic Regression
No ratings yet
Detailed Logistic Regression
30 pages
Binary Logistic Regression Lecture 9
No ratings yet
Binary Logistic Regression Lecture 9
33 pages
4 - C - Logistic Regression
No ratings yet
4 - C - Logistic Regression
13 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
3 pages
Lec-18 - Model Evaluation Metrics & Performance Measures
No ratings yet
Lec-18 - Model Evaluation Metrics & Performance Measures
13 pages
What's Next?: Binary Classification and Related Tasks Classification
No ratings yet
What's Next?: Binary Classification and Related Tasks Classification
44 pages
Econometrics II Handout For Students
No ratings yet
Econometrics II Handout For Students
29 pages
Great LEarning Weekly Quiz - Bagging and Random Forest
100% (4)
Great LEarning Weekly Quiz - Bagging and Random Forest
5 pages
Logit & Probit Theo Sheet
No ratings yet
Logit & Probit Theo Sheet
6 pages
4 - Multiple Linear Regressions
No ratings yet
4 - Multiple Linear Regressions
61 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
BBA Students' Degree Attitudes
No ratings yet
BBA Students' Degree Attitudes
18 pages
SSTA031 Lecture Notes
No ratings yet
SSTA031 Lecture Notes
49 pages
ML Exp 8
No ratings yet
ML Exp 8
22 pages
800 Data Science Questions
100% (2)
800 Data Science Questions
258 pages
Predictive Modeling Week3
No ratings yet
Predictive Modeling Week3
68 pages
MOD-5 Notes
No ratings yet
MOD-5 Notes
58 pages
Correlation and Regression
No ratings yet
Correlation and Regression
59 pages
Laporan Analisis Dan Interpretasi One Way Anova
No ratings yet
Laporan Analisis Dan Interpretasi One Way Anova
2 pages
The Overlapping Data Problem
No ratings yet
The Overlapping Data Problem
39 pages
JKKN College of Engineering and Technology: Mr.V.Dharani, Ap/Cse 3 0 0 B.E Cse 4 Iii/Vi 45 2023-2024 2021
No ratings yet
JKKN College of Engineering and Technology: Mr.V.Dharani, Ap/Cse 3 0 0 B.E Cse 4 Iii/Vi 45 2023-2024 2021
5 pages
Ecs3706 Assignment 01 Sem01
No ratings yet
Ecs3706 Assignment 01 Sem01
7 pages
Unit 3 - Notes
No ratings yet
Unit 3 - Notes
32 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
7 pages
SJ PDF 1 MRJ 10.1177 - 00222437231182434
No ratings yet
SJ PDF 1 MRJ 10.1177 - 00222437231182434
30 pages
Regression Statistics
No ratings yet
Regression Statistics
8 pages
Irish Bank Contagion Analysis
No ratings yet
Irish Bank Contagion Analysis
16 pages
Assignment No1 ML
No ratings yet
Assignment No1 ML
5 pages
Environmental and Ecological Statistics With R, Second Edition (Song S. Qian)
100% (1)
Environmental and Ecological Statistics With R, Second Edition (Song S. Qian)
560 pages
Factorial Design for Engineers
100% (1)
Factorial Design for Engineers
98 pages
4.8 Slides - Example Melanoma Mortality (Count)
No ratings yet
4.8 Slides - Example Melanoma Mortality (Count)
12 pages
Mediation and Multi-Group Moderation
No ratings yet
Mediation and Multi-Group Moderation
41 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Sales Analysis for 10-Piece Set
No ratings yet
Sales Analysis for 10-Piece Set
6 pages

ML Logistic Regression Module3 Final

Uploaded by

ML Logistic Regression Module3 Final

Uploaded by

Module 2 – Supervised Learning

▪ There are many important research topics for which the

relationships between independent variables (predictors) and a dependant variable

swimming lessons (yes/no)

▪ π = Proportion of “Success”. We want to model the probability π for Y=1.

▪ In ordinary regression the model predicts the mean Y for any

roll a 2 1/6 1/5 [or 1/5:1 or 1:5]

➢ Dividing by 𝑒(𝛽 0+𝛽 1X) , we get

▪ X = Quantitative predictor Y = Binary response

▪ Equivalent forms of the logistic regression model:

Logit form Probability form

▪ The logistic distribution constrains the estimated probabilities to lie

▪ The estimated probability (𝜋 / P) is:

odds = e0 +1X

odds = e0 +1( X +1)

➢ As ?𝑖 is binary, each label can be interpreted as Bernoulli random variable,

log(L(𝛽) )= σ 𝑛𝑖=1 𝑦 ∗ log[𝜎 𝛽𝑇𝑥𝑖 ] + 1 − y ∗ log[(1 − 𝜎(𝛽𝑇𝑥𝑖)]

Derivative of cost function: 𝜽𝒏𝒆𝒘 = 𝜽𝒐𝒍𝒅 − 𝜶[𝝈(𝜽𝑻𝒙) − 𝒚] 𝒙𝒋

You might also like