03 Linear Regression Intuition

Uploaded by

elrasifasmaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views23 pages

03 Linear Regression Intuition

Uploaded by

elrasifasmaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Machine Learning

Linear Regression Intuition

Mostafa S. Ibrahim
Teaching, Training and Coaching for more than a decade!

Artificial Intelligence & Computer Vision Researcher

PhD from Simon Fraser University - Canada
Bachelor / MSc from Cairo University - Egypt
Ex-(Software Engineer / ICPC World Finalist)

© 2023 All rights reserved.

Please do not reproduce or redistribute this work without permission from the author
Recall
● Supervised learning
○ We are given both the input (X) and its output (Y)
○ We want to be able to map X to Y (predict [‫ ]ﺗوﻗﻊ‬Y given the input X)
● Regression
○ The Y that we predict is a real-valued output (e.g. 0.7)
○ For example, we can predict the price of a property based on the features and/or location
○ Forget about the English meaning of regression. There is some history for it.
● Classification
○ The Y that we predict is a discrete-valued output (e.g. 3)
○ Example: is this image a cat, dot or cow?
House Price Prediction
● One of the most common ML tasks is predicting the price of a property
● For simplicity, we will assume we have only a single factor: the house size
● Goal: Learn and Predict
● 1) Learn
○ We collect data from 'N' pairs of examples (input being the size, output being the price)
○ Learn the patterns/associations in these data
● 2) Predict
○ Given a new query of a ‘house size’, predict the ‘price’
House Price Prediction: Dataset
ID Size in meters2 Price (target)
● N = 8 (8 training samples)
0 100 250,000 ● Size is X (input)
● Price is Y (output)
1 130 340,000
● The i-th examples is referred with: x(i), y(i)
2 200 550,000

3 250 700,000

4 270 760,000

5 300 850,000

6 325 925,000

7 400 115,0000
Visualization
● Visualization is
a critical key for success
● Can you guess the price
of a house of size
350 m2?
● How can we model this
data such that in
future we can make a
prediction automatically?
Modeling the data as a line
● The data seems came from a linear equation (mx+c)
● Use 2 points to compute these 2 parameters (weights)
● With some math:
y = 3 * x - 50
● Given x = 350
y = 1000 (thouthands)
● We just learned from data
Real-life Data
● Sadly, real life data has
variance (e.g. property house
between 250k +/- 10k
● From numbers perspective,
we can think, there is some
noise added to our data
● Observe: the x and y range is
now close to the [0, 1] range
How to model such noisy data?
● Our intuition is, this data really
came from some line
● How can we find one good line to
fit the data as closely as possible?
○ Good is a vague word!
○ How to define the criteria?
Which line is a better fit?
● We have 6 data points (e.g. size vs price)
● 2 lines are proposed here
● Which one is a better fit?
● How did you decide so?
Criteria: The closest!
● We need the line that is closer to most of the points!
● How can we measure how close a line is to a set of data points?
● We need to use some distance metric between the ground truth and the
prediction
○ Assume our dataset has point (size=200, price = 350,000)
○ Using size=200 in a line gives us price = 350,427
○ We need to compute the distance between 350,000 and 350,427
○ There is an error from the difference between these 2 values
■ Target (ground truth) vs prediction (of our model, the line)
Distance metric between 2 values
● Linear regression typically uses
the squared error cost function
○ A cost function returns a numerical
value based on the error
● The squared error function
computes:
(target - prediction)2
○ Error: 6.75 - 4.5 = 2.25
■ Aka residual
○ Squared error =2.25 x 2.25
Mean Squared Error (MSE)
● Now, we know how to compute the error of a single point
● What about a dataset of M points?
● Simply sum the cost and average it
● Why average? To give the average squared error per example
○ Yi is the ground truth
○ We can calculate square root to get the average error (+ve)

● Without the 1/n, it is called the sum of square error (SSE)

Can we use other metrics?
● Can we use other metrics such as the absolute error |T-P|? Yes
● There are many reasons for preferring squared error (easy to optimize,
derivative everywhere, has a closed form, works in practice)
● However, one important reason is related to gaussian noise
○ You can find more mathematical details in famous books, such as Bishop’s book
○ The web also has a lot of details
○ Gaussian also plays a vital role in math and statistics
● One major drawback: sensitivity to outliers
● We may return to this concern later on
○ While rare, interviewers can sometimes stress regarding the deep differences
○ Observe: the normal distribution formula is a function in Euclidean distance
Back to the 2 lines
● Now we know a formula that can assign the total error/cost of a line
○ MSE: target vs prediction
● Then we can simply choose the
line that has the smallest error
○ E.g. 5.0 (red) vs 27.9 (blue)
● The key question now is:
Given the dataset, how can
we find the parameters (m, c)
that minimize the cost function?
/2
Code snippet
Question (src)
● We have a line and 2 datasets. Which data has the higher MSE?
Question!
● Assume we learned the below line of data (size vs price in thouthands)
● Now a new query is: what is the price for a house of size 600?
Question!
● Assume all our data have the same x (e.g. multiple prices of the same house
specs)
○ E.g. (5, 3), (5, 7), (5, 50)
● What is an optimal line representing the data?

● Any line passing with the point x = x, y = average(ys)

○ x = 5, y = 20
Relevant Materials
● Prof Andrew Ng: video1, video2
● Hesham Asem (Arabic): video1, video
“Acquire knowledge and impart it to the people.”

“Seek knowledge from the Cradle to the Grave.”

Linear Regression
No ratings yet
Linear Regression
130 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
ML Day2
No ratings yet
ML Day2
7 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
5.2 Regression
No ratings yet
5.2 Regression
19 pages
Linear Regression For Absolute Beginners With Implementation in Python
No ratings yet
Linear Regression For Absolute Beginners With Implementation in Python
17 pages
Revised-L3-Linear Regression
No ratings yet
Revised-L3-Linear Regression
41 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
CS435 Ch6
No ratings yet
CS435 Ch6
14 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
Linear Regression for House Pricing
No ratings yet
Linear Regression for House Pricing
113 pages
Unit 2
No ratings yet
Unit 2
35 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Linear Regression
No ratings yet
Linear Regression
54 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Regression - Docx 1 2
No ratings yet
Regression - Docx 1 2
2 pages
Lecture Slides-Week9,10
No ratings yet
Lecture Slides-Week9,10
66 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Lec 03
No ratings yet
Lec 03
42 pages
Lecture Slides-Week9
No ratings yet
Lecture Slides-Week9
46 pages
CSE445 Linear-Regression
No ratings yet
CSE445 Linear-Regression
40 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Lecture 0.2 - Linear Methods For Regression, Optimization
No ratings yet
Lecture 0.2 - Linear Methods For Regression, Optimization
53 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
UC Berkeley ML Course Guide
100% (1)
UC Berkeley ML Course Guide
185 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
185 pages
Cs229 ML Notes
No ratings yet
Cs229 ML Notes
192 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Week 4
No ratings yet
Week 4
101 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Session Presentation
No ratings yet
Session Presentation
79 pages
ML 02 Linear Regression
No ratings yet
ML 02 Linear Regression
51 pages
Linear - Regression - SGD
No ratings yet
Linear - Regression - SGD
71 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
38 pages
Linear Regression: Jia-Bin Huang Virginia Tech
No ratings yet
Linear Regression: Jia-Bin Huang Virginia Tech
59 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
ML Week 4
No ratings yet
ML Week 4
5 pages
Linear Regression
100% (1)
Linear Regression
51 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
ML PPT 2
No ratings yet
ML PPT 2
206 pages
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
No ratings yet
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
17 pages
Lecture3 - Linear Regression and Logistic Regression
No ratings yet
Lecture3 - Linear Regression and Logistic Regression
60 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
Stanford ML
No ratings yet
Stanford ML
168 pages
02 - Linear Models - A
No ratings yet
02 - Linear Models - A
23 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Machine Learning Basics and Algorithms
No ratings yet
Machine Learning Basics and Algorithms
94 pages
Cost Function
No ratings yet
Cost Function
31 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Lecture Slides - Linear Regression (2025)
No ratings yet
Lecture Slides - Linear Regression (2025)
45 pages
Cyber Attacks
No ratings yet
Cyber Attacks
14 pages
Bochum Aa Mot
No ratings yet
Bochum Aa Mot
2 pages
01 Why Programming Python
No ratings yet
01 Why Programming Python
5 pages
04 Background - Calculs
No ratings yet
04 Background - Calculs
24 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Task
No ratings yet
Task
1 page
Ai Assignment 1 1
100% (1)
Ai Assignment 1 1
13 pages
PHD Thesis Eindhoven University of Technology
100% (2)
PHD Thesis Eindhoven University of Technology
8 pages
CGAN
No ratings yet
CGAN
13 pages
Plotting Decision Regions - Mlxtend
No ratings yet
Plotting Decision Regions - Mlxtend
5 pages
CNN Layers: Sparse Connectivity & Weight Sharing
No ratings yet
CNN Layers: Sparse Connectivity & Weight Sharing
2 pages
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
No ratings yet
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
16 pages
Sem 520
No ratings yet
Sem 520
20 pages
SAP Analytics & ML Expert Profile
No ratings yet
SAP Analytics & ML Expert Profile
1 page
Proposed PHD in Data Science
No ratings yet
Proposed PHD in Data Science
166 pages
Bond Business School Overview
No ratings yet
Bond Business School Overview
15 pages
Energy Forecasting Techniques Review
No ratings yet
Energy Forecasting Techniques Review
11 pages
Ensemble Techniques in ML Guide
No ratings yet
Ensemble Techniques in ML Guide
13 pages
Deep Representation Learning in Speech Processing Challenges, Recent Advances, and Future Trends
No ratings yet
Deep Representation Learning in Speech Processing Challenges, Recent Advances, and Future Trends
25 pages
Effects of Batches - Jupyter Notebook
No ratings yet
Effects of Batches - Jupyter Notebook
73 pages
Improving Graph Neural Networks With Simple Architecture Design
No ratings yet
Improving Graph Neural Networks With Simple Architecture Design
10 pages
Introduction To The Artificial Neural Networks: Andrej Krenker, Janez Bešter and Andrej Kos
No ratings yet
Introduction To The Artificial Neural Networks: Andrej Krenker, Janez Bešter and Andrej Kos
18 pages
Machine Learning With Spark Nick Pentreath - Download The Ebook Now For Full and Detailed Access
100% (6)
Machine Learning With Spark Nick Pentreath - Download The Ebook Now For Full and Detailed Access
67 pages
Notes - Machine Learning
No ratings yet
Notes - Machine Learning
138 pages
Machine Learning Model Building
No ratings yet
Machine Learning Model Building
6 pages
Salesforce AI Associate Exam Guide
No ratings yet
Salesforce AI Associate Exam Guide
7 pages
SVM and Naïve Bayes Exercises
No ratings yet
SVM and Naïve Bayes Exercises
4 pages
Hritajit CyberSecurity
No ratings yet
Hritajit CyberSecurity
1 page
A Hybrid Approach To Paraphrase Detection Based On Text Similarities and Machine
No ratings yet
A Hybrid Approach To Paraphrase Detection Based On Text Similarities and Machine
6 pages
Security Analytics Course Guide
No ratings yet
Security Analytics Course Guide
2 pages
Iim C Apal 02 Brochure
No ratings yet
Iim C Apal 02 Brochure
19 pages
Neural Networks - Vs - Chaid Tree Ctp4
No ratings yet
Neural Networks - Vs - Chaid Tree Ctp4
14 pages
Advanced Certification in Data Science and AI IHUB IITR
No ratings yet
Advanced Certification in Data Science and AI IHUB IITR
15 pages
AI in Production: A Game Changer For Manufacturers With Heavy Assets
No ratings yet
AI in Production: A Game Changer For Manufacturers With Heavy Assets
46 pages
Introduction To ML
No ratings yet
Introduction To ML
80 pages
Machine Unit4
No ratings yet
Machine Unit4
55 pages