Module1.4 Regression

The document outlines the course CSA3002 on Machine Learning Algorithms, focusing on supervised learning techniques such as regression analysis. It covers key concepts including linear and logistic regression, their applications, and the importance of understanding dependent and independent variables. The course aims to equip students with practical skills in applying machine learning models to solve real-world problems.

Uploaded by

kavya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views24 pages

Module1.4 Regression

Uploaded by

kavya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Course Code: CSA3002

MACHINE LEARNING ALGORITHMS

Course Type: LPC – 2-2-3

Course Outcomes
At the end of the course, students should be able to
1. Understanding of training and testing the datasets using machine
Learning techniques.
2. Apply optimization and parameter tuning techniques for machine
Learning algorithms.
3. Apply a machine learning model to solve various problems using
machine learning algorithms.
4. Apply machine learning algorithm to create models.
Course Objectives
• The objective of the course is to familiarize the learners with
the concepts of Machine Learning Algorithms and attain
Skill Development through Experiential Learning
techniques.
Supervised Learning - Regression
Analysis
• It consists of a set of machine learning methods that allow us to
predict a continuous outcome variable (y) based on the value of one or
multiple independent variables (x).
• It predicts continuous/real values such as temperature, age, salary,
price, etc.
Terminologies
• Dependent Variable: The main factor in Regression analysis which we want to
predict or understand is called the dependent variable. It is also called target
variable.
• Independent Variable: The factors which are used to predict the values of the
dependent variables are called independent variable, also called as a predictor.
• Outliers: Outlier is an observation which contains either very low value or very
high value in comparison to other observed values. An outlier may hamper the
result, so it should be avoided.
• Underfitting and Overfitting: If our algorithm works well with the training
dataset but not well with test dataset, then such problem is called Overfitting.
And if our algorithm does not perform well even with training dataset, then such
problem is called underfitting.
Types of Regression

• Simple Linear Regression

• One dependent variable (interval or ratio)
• One independent variable (interval or ratio or dichotomous)
• Multiple Linear Regression
• One dependent variable (interval or ratio)
• Two or more independent variables (interval or ratio or dichotomous)
• Logistic Regression
• One dependent variable (binary)
• Two or more independent variable(s) (interval or ratio or dichotomous)
Linear Regression
• Linear regression is a statistical regression method which is used for
predictive analysis.
• It is one of the very simple and easy algorithms which works on
regression and shows the relationship between the continuous
variables.
• It is used for solving the regression problem in machine learning.
• Linear regression shows the linear relationship between the
independent variable (X-axis) and the dependent variable (Y-axis),
hence called linear regression.
• If there is only one input variable (x), then such linear regression is
called simple linear regression. And if there is more than one input
variable, then such linear regression is called multiple linear
regression.
• The mathematical
equation for Linear
regression:
• Y= a+bX
• Here, Y = dependent
variables (target
variables),
X= Independent
variables (predictor
variables),
a and b are the linear
coefficients
Find linear regression equation for the following
two sets of data: Also find the value of Y for x=12
find the value of Y for x=12
Y=1.5 + (0.95 * 12)
Y= 12.5
Problem

a) Find the regression line y=a+bx

b) Use the regression line as a model to estimate the sales of the
company in 2012.
Solution
The following data are math aptitude test and statistics
score for five students.
Maths 95 85 80 70 60
Statistics 85 95 70 65 70

1. What linear regression equation best predicts statistics

performance, based on math aptitude scores?
2. If a student made a 75 on the math aptitude test, what grade
would we expect her to make in statistics?
Logistics Regression
• Logistic regression is one of the most popular Machine Learning
algorithms, which comes under the Supervised Learning technique.
It is used for predicting the categorical dependent variable using a
given set of independent variables.
• Logistic regression predicts the output of a categorical dependent
variable. Therefore, the outcome must be a categorical or discrete
value. It can be either Yes or No, 0 or 1, true or False, etc. but
instead of giving the exact value as 0 and 1, it gives the
probabilistic values which lie between 0 and 1.
• Logistic Regression is much similar to the Linear Regression except
that how they are used. Linear Regression is used for solving
Regression problems, whereas Logistic regression is used for
solving the classification problems.
• In Logistic regression, instead
of fitting a regression line, we
fit an "S" shaped logistic
function, which predicts two
maximum values (0 or 1).
• Logistic Regression is a
significant machine learning
algorithm because it has the
ability to provide probabilities
and classify new data using
continuous and discrete
datasets.
Logistic Function (Sigmoid Function):
• The sigmoid function is a mathematical function used to map the
predicted values to probabilities.
• It maps any real value into another value within a range of 0 and 1.
• The value of the logistic regression must be between 0 and 1, which
cannot go beyond this limit, so it forms a curve like the "S" form. The
S-form curve is called the Sigmoid function or the logistic function.
• In logistic regression, we use the concept of the threshold value, which
defines the probability of either 0 or 1. Such as values above the
threshold value tends to 1, and a value below the threshold values
tends to 0.
Apply linear function in sigmoid function to get S type
curve
Example:
• Example: Email Classification
• Imagine you're working on an email system, and you want to
automatically classify emails as either "spam" or "not spam" (ham).
Logistic regression can help you build a model that assigns a
probability of an email being spam or not.
• Logistic Regression Concept:
• Logistic regression uses the logistic function (also called the sigmoid
function) to map any input into a value between 0 and 1. This value
represents the probability of an instance belonging to the positive
class (in our case, spam).
• Applying Logistic Regression to the Example:
• Here's how you might apply logistic regression to classify emails:
• Data Preparation: Collect a dataset of emails, where each email has
features like the number of words, the presence of certain keywords,
etc. Each email is labeled as either "spam" (1) or "not spam" (0).
• Feature Scaling: Normalize or scale the features so that they're on a
similar scale. This can improve the convergence of the algorithm.
• Model Training: Use logistic regression to find the best parameters
(coefficients) for your model that maximizes the likelihood of the
observed data. This involves finding the best-fitting sigmoid curve.
• Prediction: Once the model is trained, you can input the features of a
new email and calculate z. Then, plug z into the logistic function to get
the probability of it being spam.
• Thresholding: Choose a threshold (commonly 0.5) above which you
classify the email as spam, and below which you classify it as not spam.
• Conclusion:
• Logistic regression is a powerful tool for binary classification tasks like
spam detection. It's easy to understand because it provides probabilities
that an instance belongs to a certain class. The concept of mapping a
linear combination of features to a probability using the sigmoid function
is at the core of logistic regression. In real-world applications, logistic
regression is used in a wide range of areas, from medical diagnosis to
sentiment analysis in natural language processing.

DMML Unit4
No ratings yet
DMML Unit4
77 pages
Supervised Learning & Regression Guide
No ratings yet
Supervised Learning & Regression Guide
47 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Artificial Intelligence Lec 4
No ratings yet
Artificial Intelligence Lec 4
13 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Tybsc Cs368 Data Analytics Labbook
No ratings yet
Tybsc Cs368 Data Analytics Labbook
58 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
L5 LogisticRegression
No ratings yet
L5 LogisticRegression
22 pages
L6 LogisticRegression
No ratings yet
L6 LogisticRegression
22 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
13 pages
Bias and Variance Tradeoff:: High Bias Underfitting Low Training & Testing
No ratings yet
Bias and Variance Tradeoff:: High Bias Underfitting Low Training & Testing
12 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Misc 5
No ratings yet
Misc 5
1 page
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Regression vs Classification Algorithms
100% (1)
Regression vs Classification Algorithms
13 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Unit 2
No ratings yet
Unit 2
19 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Machine Learning & NLP Overview
No ratings yet
Machine Learning & NLP Overview
41 pages
Unit 2
No ratings yet
Unit 2
67 pages
Unit 2 Linear and Logistic Regression
No ratings yet
Unit 2 Linear and Logistic Regression
23 pages
Types of Supervised Learning2
No ratings yet
Types of Supervised Learning2
66 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
11 pages
Regression
No ratings yet
Regression
11 pages
Machine Learning for Data Analysts
No ratings yet
Machine Learning for Data Analysts
201 pages
Linear Regression Logistic Regression Classification
No ratings yet
Linear Regression Logistic Regression Classification
66 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
LAB04 RegressionTasks
No ratings yet
LAB04 RegressionTasks
31 pages
Classification Algorithm Guide
100% (2)
Classification Algorithm Guide
23 pages
Day 2
No ratings yet
Day 2
52 pages
Linear vs Logistic Regression Guide
No ratings yet
Linear vs Logistic Regression Guide
81 pages
Logistic Regression 5
No ratings yet
Logistic Regression 5
61 pages
Logistic Regression Course Overview
No ratings yet
Logistic Regression Course Overview
16 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
69 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
REGRESSION
No ratings yet
REGRESSION
13 pages
Chapter 8
No ratings yet
Chapter 8
39 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
Regression: Machine Learning Algorithms
No ratings yet
Regression: Machine Learning Algorithms
5 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
Assignment Questions
No ratings yet
Assignment Questions
4 pages
Lesson 3
No ratings yet
Lesson 3
8 pages
Readiness of SHS Teacher in Conducting Research Study
No ratings yet
Readiness of SHS Teacher in Conducting Research Study
40 pages
Lecture Machinelearning
No ratings yet
Lecture Machinelearning
32 pages
Support Vector Machines Problem Statement
No ratings yet
Support Vector Machines Problem Statement
27 pages
21 Feature Importance Methods in ML
100% (1)
21 Feature Importance Methods in ML
41 pages
Forecasting Session1 2 3 4
No ratings yet
Forecasting Session1 2 3 4
58 pages
Williams L.J., Krishnan A., Abdi H - Experimental Design and Analysis For Psychology PDF
100% (1)
Williams L.J., Krishnan A., Abdi H - Experimental Design and Analysis For Psychology PDF
192 pages
Quantitative Research (Sample)
No ratings yet
Quantitative Research (Sample)
11 pages
Core Concepts in Statistics and Research Methods.
No ratings yet
Core Concepts in Statistics and Research Methods.
9 pages
Multiple Classical Linear Regression Model
No ratings yet
Multiple Classical Linear Regression Model
19 pages
Canonical Correlation PDF
No ratings yet
Canonical Correlation PDF
10 pages
LR Q4 Week1-2 Research8 CollectionAnalysis-and-Presentation
No ratings yet
LR Q4 Week1-2 Research8 CollectionAnalysis-and-Presentation
8 pages
Published Article 9
No ratings yet
Published Article 9
13 pages
Final Project List
No ratings yet
Final Project List
15 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
Environmental Health Data Analysis
No ratings yet
Environmental Health Data Analysis
18 pages
Religion and Psychiatry Career Choice
No ratings yet
Religion and Psychiatry Career Choice
18 pages
7-Factors Associated With Reading Comprehension of Secondary School Students
No ratings yet
7-Factors Associated With Reading Comprehension of Secondary School Students
14 pages
Linear RegressionSV
No ratings yet
Linear RegressionSV
66 pages
Uses and Abuses of The Analysis of Covariance
No ratings yet
Uses and Abuses of The Analysis of Covariance
11 pages
Corrected T Test Hypothesis For Methodology and Results
No ratings yet
Corrected T Test Hypothesis For Methodology and Results
35 pages
DSUP (AI-DS) Experiments Prem
No ratings yet
DSUP (AI-DS) Experiments Prem
107 pages
Child Labour Determinants in Odisha
No ratings yet
Child Labour Determinants in Odisha
11 pages
Graduate Statistics Guide
No ratings yet
Graduate Statistics Guide
67 pages
Multicollinearity in Regression Models
No ratings yet
Multicollinearity in Regression Models
23 pages
Data Analysis and Business Modelling Laboratory - BA4212 - Lab Record
No ratings yet
Data Analysis and Business Modelling Laboratory - BA4212 - Lab Record
37 pages
Effects of Influential Factors On Entrepreneurial Intention of Postgraduate Students in Malaysia
No ratings yet
Effects of Influential Factors On Entrepreneurial Intention of Postgraduate Students in Malaysia
10 pages
Math Skills for Radiology Students
100% (1)
Math Skills for Radiology Students
5 pages
Time Management Essay (2000 Words)
No ratings yet
Time Management Essay (2000 Words)
5 pages

Module1.4 Regression

Uploaded by

Module1.4 Regression

Uploaded by

Course Code: CSA3002

MACHINE LEARNING ALGORITHMS

Course Type: LPC – 2-2-3

• Simple Linear Regression

a) Find the regression line y=a+bx

1. What linear regression equation best predicts statistics

You might also like