0% found this document useful (0 votes)

16 views3 pages

Lecture 4

Lecture 4 covers linear regression, including data loading, preprocessing, feature selection, model creation, and evaluation metrics such as MSE, RMSE, and R2. It discusses supervised learning, point estimates, and methods like gradient descent and ordinary least squares (OLS) for model fitting. The lecture also emphasizes the importance of standardizing datasets and cross-validation to prevent memorization in model training.

Uploaded by

Cảnh Nguyễn Hữu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

Lecture 4

Uploaded by

Cảnh Nguyễn Hữu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Lecture 4

1. Linear Regression:
a. How to load data
b. Pre-process / clean data
c. How to choose features
d. Create model
e. Model evaluation metrics
i. MSE
ii. RMSE
iii. R2
iv. R2 adj (adjuster)
v. Value of the coefficients
vi. Se of the coefficients
vii. T-statistic
viii. P – value
ix. Null hypothesis
2. Supervised:
X  feature vector (x1, x2, …)
Y  label

Use X to predict Y

f(X) to predict

f(X) = w0 + w1 * x2
True regression functions are never linear

Model:
Y = B0 + B1 * X + E
In Python, use .fit() function to predict the model (substitute X to find
function f(x)

Y = B0 + B1 * X1
- Analytical: Close form
Error = ∑ ❑ Lowest value
o Residual sum of squares (RSS) = e1^2 + e2^2 + … + en^2
o Least square approach
- Numerical:
o Gradient Descent (check below)
 Linear time
 Local Minima
 Best weights / Coefficients

3. Point Estimates:
- Sales = 10000 + (1.6) * (TV) + (2.9) * (Radio)
4. The linear regression is computed as (X'X)^-1 X'Y
5. ChatGPT: too much data, billions
6. Gradient Descent Approach:
- w(0) = initial value (guess)
- w(1) = w(0) – (Learning Rate) * d(error)/dw
- Point Estimate – Best Coefficients
- Standardize the dataset
o N(0 , 1)
7. Exact Solution (Closed Form):
- Point Estimate
o Standard Error
8. Root mean square error (RMSE)
9. MAE

----------------------

Ex: House price prediction:

X1: Size

X2: Bathroom

X3: Bedroom

Mean and variance of the price of the house (predicted from x1, x2, x3)

Ex: Temperature

With temperatures, we may predict the mean, but it’s hard to predict the
variance

10. Predicted mean  Actual mean

11. Predicted var  Actual var
% Variance explained

If I show the entire dataset to the model, and test using the same data

 Memorization
 To prevent, you cross-data

Web for datasets: Auto MPG - UCI Machine Learning Repository

12. SGD:
- Evaluation Metrics:
o 1. Score (R2) value: R2 = 1 - RSS/TSS
o RSS = (y1 – y1’)2 + (y2 – y2’)2 + … + (yn – yn’)2
o TSS = (y1 – y’)2 + (y2 – y’)2 + … + (yn – y’)2
 0<x<1
- Approximate approach
- Faster
- No guarantee of best solution
- Doesn’t give many evaluation metrics
- A good idea to standardize the data
13. OLS:
- Closed form
- Exact solution
- Cubic time complexity
- Statsmodels
- Not require to standardize the data
- Diagnostics:
o Point Estimate:
 MedHouseValue: 0.0163 + 0.4416 * MedInc + …
o T-statistic = point estimate / std.error  as large as possible

2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Linear Regression for House Pricing
No ratings yet
Linear Regression for House Pricing
113 pages
Linear Regression Techniques
No ratings yet
Linear Regression Techniques
25 pages
Linear Regression
No ratings yet
Linear Regression
130 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
ML Cheatsheet PDF
100% (1)
ML Cheatsheet PDF
211 pages
ML Cheatsheet for Beginners
100% (1)
ML Cheatsheet for Beginners
211 pages
Unit 5
No ratings yet
Unit 5
18 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Unit II - Supervised Machine Learning Techniques
No ratings yet
Unit II - Supervised Machine Learning Techniques
131 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
6 - Classification and Regression Tasks
No ratings yet
6 - Classification and Regression Tasks
115 pages
ML Unit
No ratings yet
ML Unit
23 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Closed Form Linear Regression
No ratings yet
Closed Form Linear Regression
7 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
AI & ML Lab Manual - LDCE
No ratings yet
AI & ML Lab Manual - LDCE
70 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
An Introduction To Stadistical Learning-129-140-1-8
No ratings yet
An Introduction To Stadistical Learning-129-140-1-8
8 pages
Unit-4 303-05 - Fundamentals of Machine Learning
No ratings yet
Unit-4 303-05 - Fundamentals of Machine Learning
17 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
33 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Linear Regression Mastry
No ratings yet
Linear Regression Mastry
6 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
30 pages
DSBDL - Write - Ups - 4 To 7
No ratings yet
DSBDL - Write - Ups - 4 To 7
11 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Linear Regression Guide for Data Analysts
No ratings yet
Linear Regression Guide for Data Analysts
16 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
ML Week 4
No ratings yet
ML Week 4
5 pages
Revised-L3-Linear Regression
No ratings yet
Revised-L3-Linear Regression
41 pages
Training Models
No ratings yet
Training Models
13 pages
Linear Regression Concepts - A4
No ratings yet
Linear Regression Concepts - A4
6 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
ML Section2
No ratings yet
ML Section2
36 pages
Practical 5
No ratings yet
Practical 5
8 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
ML - UNIT 4 - Material - SVCK - CSE
No ratings yet
ML - UNIT 4 - Material - SVCK - CSE
19 pages
Regression
No ratings yet
Regression
25 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Notes 04
No ratings yet
Notes 04
50 pages
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
No ratings yet
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
20 pages
F 2 PDF
No ratings yet
F 2 PDF
9 pages
Database Lab: EER Diagrams
No ratings yet
Database Lab: EER Diagrams
9 pages
Poweroil: Power Gem Ep00, Ep0, Ep1 & Ep2 Extreme Pressure Greases
No ratings yet
Poweroil: Power Gem Ep00, Ep0, Ep1 & Ep2 Extreme Pressure Greases
1 page
Test 1 PDF
No ratings yet
Test 1 PDF
6 pages
Chromotherapy
100% (1)
Chromotherapy
10 pages
F1 Forecast Tech 3
No ratings yet
F1 Forecast Tech 3
3 pages
PM - I CIA
No ratings yet
PM - I CIA
5 pages
Ge 7 Morph Report
No ratings yet
Ge 7 Morph Report
19 pages
Preschool Daily Schedule
No ratings yet
Preschool Daily Schedule
1 page
Aircraft Dji Enterprise Mavic 3 Thermal
No ratings yet
Aircraft Dji Enterprise Mavic 3 Thermal
19 pages
Action Research in Education Innovation
No ratings yet
Action Research in Education Innovation
80 pages
Pers Soc Psychol Schultz
No ratings yet
Pers Soc Psychol Schultz
13 pages
WLP Q1 G11-Philosophy
No ratings yet
WLP Q1 G11-Philosophy
8 pages
Aquatic Plant Presentation
No ratings yet
Aquatic Plant Presentation
17 pages
Tank Flush Simulation Tutorial
No ratings yet
Tank Flush Simulation Tutorial
23 pages
UA5000 LMG PVMD Operations Guide
No ratings yet
UA5000 LMG PVMD Operations Guide
22 pages
Microspectrofluorimetry of Fluorescent Dyes and Brighteners On Single Textile Fibres
No ratings yet
Microspectrofluorimetry of Fluorescent Dyes and Brighteners On Single Textile Fibres
18 pages
Construction Blueprint Details
100% (1)
Construction Blueprint Details
2 pages
Product Manual 36693 (Revision D, 5/2015) : PG Base Assemblies
No ratings yet
Product Manual 36693 (Revision D, 5/2015) : PG Base Assemblies
10 pages
The Shiphandlers Guide
No ratings yet
The Shiphandlers Guide
143 pages
Java 8 Features
No ratings yet
Java 8 Features
42 pages
Compal Electronics Engineering Document
75% (4)
Compal Electronics Engineering Document
1 page
1) Segmentación: Las Bases de Segmentación Utilizada Por Claro en Sus
No ratings yet
1) Segmentación: Las Bases de Segmentación Utilizada Por Claro en Sus
5 pages
Design & Modification On Automatic and Pneumatic Jack System
No ratings yet
Design & Modification On Automatic and Pneumatic Jack System
4 pages
JCB ENGLISH Fault Finding COMPLETE PDF
97% (29)
JCB ENGLISH Fault Finding COMPLETE PDF
129 pages
Assessing Wildfire Vulnerability of Vegetated Serpentine Soils in The Balkan Peninsula
No ratings yet
Assessing Wildfire Vulnerability of Vegetated Serpentine Soils in The Balkan Peninsula
13 pages
Cos 202
No ratings yet
Cos 202
28 pages
Wycliffe's Work in South Sudan
No ratings yet
Wycliffe's Work in South Sudan
5 pages
Focus-On Opta en
No ratings yet
Focus-On Opta en
3 pages
Engine TCM
No ratings yet
Engine TCM
165 pages

Lecture 4

Uploaded by

Lecture 4

Uploaded by

Lecture 4

Ex: House price prediction:

10. Predicted mean  Actual mean

Web for datasets: Auto MPG - UCI Machine Learning Repository

You might also like