0% found this document useful (0 votes)

70 views40 pages

LinearRegression Tutorial

This document provides an overview of linear regression and its implementation using gradient descent, stochastic gradient descent, and the normal equation. It introduces key concepts like the cost function, updating weights, and minimizing the cost function. Code examples are provided to generate regression data and fit linear regression models from scratch using gradient descent and with scikit-learn using SGD and linear regression. Visualizations of the learned weights and predictions on the data are also presented.

Uploaded by

22520750

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views40 pages

LinearRegression Tutorial

Uploaded by

22520750

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Machine Learning

Linear Regression

Quan Minh Phan & Ngoc Hoang Luong

University of Information Technology

-
Vietnam National University Ho Chi Minh City

October 7, 2022

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 1 / 40
New Packages

numpy → very frequently used in ML (python)

Link: https://numpy.org/doc/stable/user/index.html#user

>> import numpy as np

matplotlib → for visualization

Link: https://matplotlib.org/stable/tutorials/index.html

>> import matplotlib.pyplot as plt

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 2 / 40
Generate A Regression Problem

>> from sklearn.datasets import make regression

>> X, y = make regression(n samples=500, n features=1,
n informative=1, noise=25, random state=42)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 3 / 40
Data Visualization

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

plt.xlabel(’X’)
plt.ylabel(’y’)
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 4 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 5 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 6 / 40
Recall (Linear Regression)

Figure: The general concept of Linear Regression

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 7 / 40
Minimizing cost function with gradient descent

Cost function (Squared Error):

1 X (i)
J(w ) = (y − ŷ (i) )2 (1)
2
i

Update the weights:

wt+1 := wt + ∆w (2)
∆w = −η∇J(w ) (3)

∂J X (i)
=− (y (i) − ŷ (i) )xj (4)
∂wj
i

∂J X (i)
∆wj = −η =η (y (i) − ŷ (i) )xj (5)
∂wj
i

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 8 / 40
Minimizing cost function with gradient descent (cont.)

(
wj + η ∗ sum(y − ŷ ) j =0
wj = (i)
wj + η ∗ i (y (i) − ŷ (i) )xj
P
j ∈ [1, . . . , n]

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 9 / 40
Pseudocode of the Training Process

Algorithm 1 Gradient Descent

1: Initialize the weights, w
2: while Stopping Criteria is not satisfied do
3: Compute the output value, ŷ
4: Updates the weights
5: Compute the difference between y and ŷ
6: Update the intercept
7: Update the coefficients
8: end while

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 10 / 40
Components

Hyperparameters
eta (float): the initial learning rate
max iter (int): the maximum number of iterations
random state (int)

Parameters
w (list/array): the weight values
costs (list/array): the list containing the cost values over iterations

Methods
fit(X , y )
predict(X )

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 11 / 40
Implement (code from scratch)

class LinearRegression GD:

def init (self, eta = 0.001, max iter = 20, random state = 42):
self.eta = eta
self.max iter = max iter
self.random state = random state
self.w = None
self.costs = [ ]

def predict(self, X):

return np.dot(X, self.w[1:]) + self.w[0]

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 12 / 40
’fit’ method

def fit(self, X, y):

rgen = np.random.RandomState(self.random state)
self.w = rgen.normal(loc = 0.0, scale = 0.01, size = 1 + X.shape[1])
self.costs = [ ]
for n iters in range(self.max iter):
y pred = self.predict(X)
diff = y - y pred
self.w[0] += self.eta * np.sum(diff)
for j in range(X.shape[1]): // j ← [0, 1, ..., X.shape[1]]
delta = 0.0
for i in range(X.shape[0]): // i ← [0, 1, ..., X.shape[0]]
delta += self.eta * diff[i] * X[i][j]
self.w[j + 1] += delta
cost = np.sum(diff ** 2) / 2
self.costs.append(cost)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 13 / 40
’fit’ method (2)

def fit(self, X, y):

rgen = np.random.RandomState(self.random state)
self.w = rgen.normal(loc = 0.0, scale = 0.01, size = 1 + X.shape[1])
self.costs = [ ]
for n iters in range (self.max iter):
y pred = self.predict(X)
diff = y - y pred
self.w[0] += self.eta * np.sum(diff)
self.w[1:] += self.eta * np.dot(X.T, diff)
cost = np.sum(diff ** 2) / 2
self.costs.append(cost)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 14 / 40
Train Model

Gradient Descent
>> reg GD = LinearRegression GD(eta=0.001, max iter=20,
random state=42)
reg GD.fit(X, y)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 15 / 40
Visualize the trend in the cost values (Gradient Descent)

>> plt.plot(range(1, len(reg GD.costs) + 1), reg GD.costs)

plt.xlabel(’Epochs’)
plt.ylabel(’Cost’)
plt.title(’Gradient Descent’)
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 16 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 17 / 40
Visualize on Data

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

plt.plot(X, reg GD.predict(X), color=’green’, lw=6, label=’Gradient
Descent’)
plt.xlabel(’X’)
plt.ylabel(’y’)
plt.legend()
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 18 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 19 / 40
Weight values

>> w GD = reg GD.w

w GD
>> [-0.9794002, 63.18592509]

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 20 / 40
Implement (package)

Stochastic Gradient Descent

from sklearn.linear model import SGDRegressor

Hyperparameters Parameters Methods

eta0 intercept fit(X, y)
max iter coef predict(X)
random state

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 21 / 40
Implement (package) (cont.)

Normal Equation
from sklearn.linear model import LinearRegression

Parameters Methods
intercept fit(X, y)
coef predict(X)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 22 / 40
Differences

Gradient Descent
w := w + ∆w
∆w = η i (y (i) − ŷ (i) )x i
P

Stochastic Gradient Descent

w := w + ∆w
∆w = η(y (i) − ŷ (i) )x i

Normal Equation
w = (X T X )−1 X T y

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 23 / 40
Practice (cont.)

Stochastic Gradient Descent

>> from sklearn.linear model import SGDRegressor
>> reg SGD = SGDRegressor(eta0=0.001, max iter=20,
random state=42, learning rate=’constant’)
reg SGD.fit(X, y)

Normal Equation
>> from sklearn.linear model import LinearRegression
>> reg NE = LinearRegression()
reg NE.fit(X, y)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 24 / 40
Weight Values Comparisons

Gradient Descent (ours)

>> w GD = reg GD.w
w GD
>> [-0.9794002, 63.18592509]

Stochastic Gradient Descent

>> w SGD = np.append(reg SGD.intercept , reg SGD.coef )
w SGD
>> [-1.02681553, 63.08630288]

Normal Equation
>> w NE = np.append(reg NE.intercept , reg NE.coef )
w NE
>> [-0.97941333, 63.18605572]
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 25 / 40
Visualize on Data (all)

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

plt.plot(X, reg GD.predict(X), color=’green’, lw=6, label=’Gradient
Descent’)
plt.plot(X, reg SGD.predict(X), color=’black’, lw=4,
label=’Stochastic Gradient Descent’)
plt.plot(X, reg NE.predict(X), color=’orange’, lw=2, label=’Normal
Equation’)
plt.xlabel(’X’)
plt.ylabel(’y’)
plt.legend()
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 26 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 27 / 40
Performance Evaluation

Mean Absolute Error (MAE)

1 X (i)
MAE (y , ŷ ) = |y − ŷ (i) | (6)
n
i

Mean Squared Error (MSE)

1 X (i)
MSE (y , ŷ ) = (y − ŷ (i) )2 (7)
n
i

R-Squared (R2)
P (i)
2 (y − ŷ (i) )2
R (y , ŷ ) = 1 − Pi (i) − y )2
(8)
i (y

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 28 / 40
Performance Evaluation

>> from sklearn.metrics import mean absolute error as MAE

from sklearn.metrics import mean squared error as MSE
from sklearn.metrics import r2 score as R2

>> y pred GD = reg GD.predict(X)

>> y pred SGD = reg SGD.predict(X)

>> y pred NE = reg NE.predict(X)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 29 / 40
Performance Evaluation (cont.)

Mean Absolute Error

>> print(’MAE of GD:’, round(MAE(y, y pred GD), 6))
print(’MAE of SGD:’, round(MAE(y, y pred SGD), 6))
print(’MAE of NE:’, round(MAE(y, y pred NE), 6))

Mean Squared Error

>> print(’MSE of GD:’, round(MSE(y, y pred GD), 6))
print(’MSE of SGD:’, round(MSE(y, y pred SGD), 6))
print(’MSE of NE:’, round(MSE(y, y pred NE), 6))

R 2 score
>> print(’R2 of GD:’, round(R2(y, y pred GD), 6))
print(’R2 of SGD:’, round(R2(y, y pred SGD), 6))
print(’R2 of NE:’, round(R2(y, y pred NE), 6))

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 30 / 40
Run Gradient Descent with lr = 0.005

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 31 / 40
Polynominal Regression

Example
X = [258.0, 270.0, 294.0, 320.0, 342.0, 368.0, 396.0, 446.0, 480.0, 586.0]
y = [236.4, 234.4, 252.8, 298.6, 314.2, 342.2, 360.8, 368.0, 391.2, 390.8]

>> X = np.array([258.0, 270.0, 294.0, 320.0, 342.0, 368.0, 396.0, 446.0,

480.0, 586.0])[:, np.newaxis]
y = np.array([236.4, 234.4, 252.8, 298.6, 314.2, 342.2, 360.8, 368.0,
391.2, 390.8])

>> plt.scatter(X, y, label=’Training points’)

plt.xlabel(’X’)
plt.ylabel(’y’)
plt.legend()
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 32 / 40
Visualize data

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 33 / 40
Experiment with Linear Regression

>> from sklearn.linear model import LinearRegression

lr = LinearRegression()
lr.fit(X, y)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 34 / 40
Experiment with Linear Regression (cont.)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 35 / 40
Experiment with Polynominal Regression

Syntax
from sklearn.preprocessing import PolynomialFeatures

>> from sklearn.preprocessing import PolynomialFeatures

quadratic = PolynomialFeatures(degree=2)
X quad = quadratic.fit transform(X)
pr = LinearRegression()
pr.fit(X quad, y)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 36 / 40
Experiment with Polynominal Regression (cont.)

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 37 / 40
>> X test = np.arange(250, 600, 10)[:, np.newaxis]

>> y pred linear = lr.predict(X test)

y pred quad = pr.predict(quadratic.fit transform(X test))

>> plt.scatter(X, y, label=’Training points’)

plt.xlabel(’X’)
plt.ylabel(’y’)
plt.plot(X test, y pred linear, label=’Linear fit’, c=’black’)
plt.plot(X test, y pred quad, label=’Quadratic fit’, c=’orange’)
plt.legend()
plt.show()

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 38 / 40
Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 39 / 40
Practice

Dataset: ’Boston Housing’ (housing.csv) (14 attributes: 13

independent variables + 1 target variable)

File: boston housing.iypnb

Q.M. Phan & N.H. Luong (VNU-HCM UIT) Machine Learning October 7, 2022 40 / 40

Machine Learning: Practice 2
No ratings yet
Machine Learning: Practice 2
74 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
Python Linear Regression Guide
No ratings yet
Python Linear Regression Guide
23 pages
Chapter 6 - Advanced Machine Learning PDF
No ratings yet
Chapter 6 - Advanced Machine Learning PDF
37 pages
ML Lab File
No ratings yet
ML Lab File
48 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
HCIA-AI Machine Learning Lab Guide
No ratings yet
HCIA-AI Machine Learning Lab Guide
82 pages
Take It Easy: Created Status Last Read
No ratings yet
Take It Easy: Created Status Last Read
55 pages
C1 W2 Lab05 Sklearn GD Soln
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
3 pages
Implementation of Linear Regression: Sir Syed University of Engineering & Technology, Karachi
No ratings yet
Implementation of Linear Regression: Sir Syed University of Engineering & Technology, Karachi
11 pages
1 Tutorial: Linear Regression
No ratings yet
1 Tutorial: Linear Regression
8 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Regression
No ratings yet
Regression
25 pages
DL Assignment
No ratings yet
DL Assignment
5 pages
MLDL I Linear Regression With Gradient Descent - Ipynb Colaboratory
No ratings yet
MLDL I Linear Regression With Gradient Descent - Ipynb Colaboratory
15 pages
ML Labs
No ratings yet
ML Labs
46 pages
ML Day2
No ratings yet
ML Day2
7 pages
Machine Learning Algorithm Guide
100% (1)
Machine Learning Algorithm Guide
37 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
Book Pytorch Scikit Learn Numpy
No ratings yet
Book Pytorch Scikit Learn Numpy
247 pages
Mayhoc
No ratings yet
Mayhoc
51 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
36 pages
Geoscience Machine Learning Guide
No ratings yet
Geoscience Machine Learning Guide
4 pages
Unit-III Advanced Machine Learning
No ratings yet
Unit-III Advanced Machine Learning
8 pages
Regression
No ratings yet
Regression
16 pages
C1 W2 Lab02 Multiple Variable Soln
No ratings yet
C1 W2 Lab02 Multiple Variable Soln
11 pages
Lab02
No ratings yet
Lab02
14 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
Locally Weighted Regression Guide
No ratings yet
Locally Weighted Regression Guide
6 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Multiple Linear Regression Guide
No ratings yet
Multiple Linear Regression Guide
7 pages
AI Regression & Classification Guide
No ratings yet
AI Regression & Classification Guide
47 pages
Lecture04. Training Models (Regression in Chapter 4)
No ratings yet
Lecture04. Training Models (Regression in Chapter 4)
44 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
Linear Regression Exercise Guide
100% (1)
Linear Regression Exercise Guide
3 pages
Ai Lab
No ratings yet
Ai Lab
19 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
Chapter04 Training Models
No ratings yet
Chapter04 Training Models
33 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
6 390 Lecture Notes Fall24
No ratings yet
6 390 Lecture Notes Fall24
146 pages
MMDS Da3
No ratings yet
MMDS Da3
8 pages
'/content/drive': From Import Import As Import As Import As
No ratings yet
'/content/drive': From Import Import As Import As Import As
9 pages
ML Regression & Classification Guide
100% (1)
ML Regression & Classification Guide
45 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
No ratings yet
Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
143 pages
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
No ratings yet
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
26 pages
Submission Template 513 E Div
No ratings yet
Submission Template 513 E Div
53 pages
6 390 Lecture Notes Spring24
No ratings yet
6 390 Lecture Notes Spring24
144 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
5 pages
Machine Learning Assignment Guide
No ratings yet
Machine Learning Assignment Guide
2 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
Sofcomputing Da2
No ratings yet
Sofcomputing Da2
7 pages
Plant List Manas Ayurved
No ratings yet
Plant List Manas Ayurved
23 pages
Chapter Seven The Behavior of Proteins: Enzymes, Mechanisms, and Control
No ratings yet
Chapter Seven The Behavior of Proteins: Enzymes, Mechanisms, and Control
40 pages
SML Prospekt Cast PET - Final
No ratings yet
SML Prospekt Cast PET - Final
32 pages
CASEL CSI Emerging Insights Brief 2020
100% (1)
CASEL CSI Emerging Insights Brief 2020
37 pages
Rating Scale For Student Teachers
100% (3)
Rating Scale For Student Teachers
3 pages
Myrna Accordion and Orchestra Score
No ratings yet
Myrna Accordion and Orchestra Score
21 pages
MITWPU - Unit 2-Theory of Computation
No ratings yet
MITWPU - Unit 2-Theory of Computation
50 pages
Long-Distance Nationalism - Worl - Benedict Anderson
No ratings yet
Long-Distance Nationalism - Worl - Benedict Anderson
11 pages
Part - A Answer The Following Questions (10x1 10)
No ratings yet
Part - A Answer The Following Questions (10x1 10)
2 pages
Job Search and Application Practice
No ratings yet
Job Search and Application Practice
19 pages
Assam Project List & Name of The Contractor - Email
No ratings yet
Assam Project List & Name of The Contractor - Email
4 pages
Ancillary Eqpmnts
No ratings yet
Ancillary Eqpmnts
24 pages
Ro-Buh-Qpl: Express Worldwide
No ratings yet
Ro-Buh-Qpl: Express Worldwide
3 pages
Material Selection For The Aircraft: Design Criteria
100% (1)
Material Selection For The Aircraft: Design Criteria
5 pages
Listening Compre and Dictation Grade 3
No ratings yet
Listening Compre and Dictation Grade 3
3 pages
Books List
No ratings yet
Books List
3 pages
Linear Equation in Two Unknowns PDF
No ratings yet
Linear Equation in Two Unknowns PDF
16 pages
Gold Standard Benchmark For Cisco IOS Routers. Gold Standard Benchmark Version 3.0.1
No ratings yet
Gold Standard Benchmark For Cisco IOS Routers. Gold Standard Benchmark Version 3.0.1
37 pages
Experiment 3 Two Stage Air Reciprocating Compressor
No ratings yet
Experiment 3 Two Stage Air Reciprocating Compressor
13 pages
MEG2
No ratings yet
MEG2
64 pages
Soil Permeability Calculations
No ratings yet
Soil Permeability Calculations
2 pages
Lily (1) The Differences Between News On TV and News in Magazines
No ratings yet
Lily (1) The Differences Between News On TV and News in Magazines
2 pages
BTD-300 Software Manual - 106546.04C
No ratings yet
BTD-300 Software Manual - 106546.04C
46 pages
Zug Medical Accessories Catalog
No ratings yet
Zug Medical Accessories Catalog
6 pages
Applied Radiological Anatomy 2nd Semester
No ratings yet
Applied Radiological Anatomy 2nd Semester
7 pages
Jamaican Maroons: Cultural Legacy
100% (4)
Jamaican Maroons: Cultural Legacy
138 pages
Aakriti Mahajan
No ratings yet
Aakriti Mahajan
45 pages
Navagraha Table
No ratings yet
Navagraha Table
2 pages
The Level of Interest Between Small Scale and Large Scale
No ratings yet
The Level of Interest Between Small Scale and Large Scale
30 pages
Result Declared - MJ - 2025 - 05.07.2025
No ratings yet
Result Declared - MJ - 2025 - 05.07.2025
47 pages

LinearRegression Tutorial

Uploaded by

LinearRegression Tutorial

Uploaded by

Machine Learning

Quan Minh Phan & Ngoc Hoang Luong

University of Information Technology

numpy → very frequently used in ML (python)

>> import numpy as np

matplotlib → for visualization

>> import matplotlib.pyplot as plt

>> from sklearn.datasets import make regression

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

Figure: The general concept of Linear Regression

Cost function (Squared Error):

Update the weights:

Algorithm 1 Gradient Descent

class LinearRegression GD:

def predict(self, X):

def fit(self, X, y):

def fit(self, X, y):

>> plt.plot(range(1, len(reg GD.costs) + 1), reg GD.costs)

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

>> w GD = reg GD.w

Stochastic Gradient Descent

Hyperparameters Parameters Methods

Stochastic Gradient Descent

Stochastic Gradient Descent

Gradient Descent (ours)

Stochastic Gradient Descent

>> plt.scatter(X, y, facecolor=’tab:blue’, edgecolor=’white’, s=70)

Mean Absolute Error (MAE)

Mean Squared Error (MSE)

>> from sklearn.metrics import mean absolute error as MAE

>> y pred GD = reg GD.predict(X)

>> y pred SGD = reg SGD.predict(X)

>> y pred NE = reg NE.predict(X)

Mean Absolute Error

Mean Squared Error

>> X = np.array([258.0, 270.0, 294.0, 320.0, 342.0, 368.0, 396.0, 446.0,

>> plt.scatter(X, y, label=’Training points’)

>> from sklearn.linear model import LinearRegression

>> from sklearn.preprocessing import PolynomialFeatures

>> y pred linear = lr.predict(X test)

>> plt.scatter(X, y, label=’Training points’)

Dataset: ’Boston Housing’ (housing.csv) (14 attributes: 13

File: boston housing.iypnb

You might also like