0% found this document useful (0 votes)

70 views7 pages

Assignment 4

The document outlines a process for creating a linear regression model using Python libraries such as Pandas, Numpy, and Matplotlib. It includes steps for data preparation, model creation using the Polyfit function, predictions, and performance evaluation using R-squared. Additionally, it introduces the Boston Housing dataset, detailing its attributes and characteristics.

Uploaded by

Omkar Landge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views7 pages

Assignment 4

Uploaded by

Omkar Landge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

In [1]:

# Import libraries and create alias for Pandas, Numpy and Matplotlib import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

In [2]:
# Create a Dataframe with Dependent Variable(x) and independent variable y. x=np.array([95,85,80,70,60])

y=np.array([85,95,70,65,70])

In [3]:
# Create Linear Regression Model using Polyfit Function: model= np.polyfit(x, y,
1)

In [4]:
# Observe the coefficients of the model.
model
array([ 0.64383562,
26.78082192])
Out[4]:

In [5]:
# Predict the Y value for X and observe the output. predict =
np.poly1d(model)
predict(65)
68.63013698630
137
Out[5]:

In [6]:
# Predict the y_pred for all values of x. y_pred = predict(x)
y_pred
array([87.94520548, 81.50684932, 78.28767123,
71.84931507, 65.4109589 ])
Out[6]:

In [7]:
# Evaluate the performance of Model (R-Suare) from
sklearn.metrics import r2_score
r2_score(y, y_pred)
0.4803218090889
322
Out[7]:

In [8]:
# Plotting the linear regression model y_line = model[1] +
model[0]* x
plt.plot(x, y_line, c = 'r')
plt.scatter(x,y_pred)
plt.scatter(x,y,c='r')
<matplotlib.collections.PathCollection at
0x1c17e8ab490>
Out[8]:

Loading [MathJax]/extensions/Safe.js

Algorithm (Boston Dataset):

In [9]:
# Import libraries and create alias for Pandas, Numpy and Matplotlib import numpy as np

import pandas as pd
import matplotlib.pyplot as plt

In [16]:
# Import the Boston Housing dataset
from sklearn.datasets import load_boston
Boston = load_boston()

In [18]:
Boston
Loading [MathJax]/extensions/Safe.js
{'data': array([[6.3200e-03, 1.8000e+01, 2.3100e+00, ..., 1.5300e+01, 3.9690e+02, 4.9800e+00],
[2.7310e-02, 0.0000e+00, 7.0700e+00, ..., 1.7800e+01, 3.9690e+02,
9.1400e+00],
[2.7290e-02, 0.0000e+00, 7.0700e+00, ..., 1.7800e+01, 3.9283e+02,
4.0300e+00],
...,
[6.0760e-02, 0.0000e+00, 1.1930e+01, ..., 2.1000e+01, 3.9690e+02,
5.6400e+00],
[1.0959e-01, 0.0000e+00, 1.1930e+01, ..., 2.1000e+01, 3.9345e+02,
6.4800e+00],
[4.7410e-02, 0.0000e+00, 1.1930e+01, ..., 2.1000e+01, 3.9690e+02,
7.8800e+00]]), 'target': array([24. , 21.6, 34.7, 33.4, 36.2, 28.7, 22.9, 27.1, 16.5, 18.9, 15. ,
18.9, 21.7, 20.4, 18.2, 19.9, 23.1, 17.5, 20.2, 18.2, 13.6, 19.6,
15.2, 14.5, 15.6, 13.9, 16.6, 14.8, 18.4, 21. , 12.7, 14.5, 13.2,
13.1, 13.5, 18.9, 20. , 21. , 24.7, 30.8, 34.9, 26.6, 25.3, 24.7,
21.2, 19.3, 20. , 16.6, 14.4, 19.4, 19.7, 20.5, 25. , 23.4, 18.9,
35.4, 24.7, 31.6, 23.3, 19.6, 18.7, 16. , 22.2, 25. , 33. , 23.5,
19.4, 22. , 17.4, 20.9, 24.2, 21.7, 22.8, 23.4, 24.1, 21.4, 20. ,
20.8, 21.2, 20.3, 28. , 23.9, 24.8, 22.9, 23.9, 26.6, 22.5, 22.2,
23.6, 28.7, 22.6, 22. , 22.9, 25. , 20.6, 28.4, 21.4, 38.7, 43.8,
33.2, 27.5, 26.5, 18.6, 19.3, 20.1, 19.5, 19.5, 20.4, 19.8, 19.4,
21.7, 22.8, 18.8, 18.7, 18.5, 18.3, 21.2, 19.2, 20.4, 19.3, 22. ,
20.3, 20.5, 17.3, 18.8, 21.4, 15.7, 16.2, 18. , 14.3, 19.2, 19.6,
23. , 18.4, 15.6, 18.1, 17.4, 17.1, 13.3, 17.8, 14. , 14.4, 13.4,
15.6, 11.8, 13.8, 15.6, 14.6, 17.8, 15.4, 21.5, 19.6, 15.3, 19.4,
17. , 15.6, 13.1, 41.3, 24.3, 23.3, 27. , 50. , 50. , 50. , 22.7,
25. , 50. , 23.8, 23.8, 22.3, 17.4, 19.1, 23.1, 23.6, 22.6, 29.4,
23.2, 24.6, 29.9, 37.2, 39.8, 36.2, 37.9, 32.5, 26.4, 29.6, 50. ,
32. , 29.8, 34.9, 37. , 30.5, 36.4, 31.1, 29.1, 50. , 33.3, 30.3,
34.6, 34.9, 32.9, 24.1, 42.3, 48.5, 50. , 22.6, 24.4, 22.5, 24.4,
20. , 21.7, 19.3, 22.4, 28.1, 23.7, 25. , 23.3, 28.7, 21.5, 23. ,
26.7, 21.7, 27.5, 30.1, 44.8, 50. , 37.6, 31.6, 46.7, 31.5, 24.3,
31.7, 41.7, 48.3, 29. , 24. , 25.1, 31.5, 23.7, 23.3, 22. , 20.1,
22.2, 23.7, 17.6, 18.5, 24.3, 20.5, 24.5, 26.2, 24.4, 24.8, 29.6,
42.8, 21.9, 20.9, 44. , 50. , 36. , 30.1, 33.8, 43.1, 48.8, 31. ,
36.5, 22.8, 30.7, 50. , 43.5, 20.7, 21.1, 25.2, 24.4, 35.2, 32.4,
32. , 33.2, 33.1, 29.1, 35.1, 45.4, 35.4, 46. , 50. , 32.2, 22. ,
20.1, 23.2, 22.3, 24.8, 28.5, 37.3, 27.9, 23.9, 21.7, 28.6, 27.1,
20.3, 22.5, 29. , 24.8, 22. , 26.4, 33.1, 36.1, 28.4, 33.4, 28.2,
22.8, 20.3, 16.1, 22.1, 19.4, 21.6, 23.8, 16.2, 17.8, 19.8, 23.1,
21. , 23.8, 23.1, 20.4, 18.5, 25. , 24.6, 23. , 22.2, 19.3, 22.6,
19.8, 17.1, 19.4, 22.2, 20.7, 21.1, 19.5, 18.5, 20.6, 19. , 18.7,
32.7, 16.5, 23.9, 31.2, 17.5, 17.2, 23.1, 24.5, 26.6, 22.9, 24.1,
18.6, 30.1, 18.2, 20.6, 17.8, 21.7, 22.7, 22.6, 25. , 19.9, 20.8,
16.8, 21.9, 27.5, 21.9, 23.1, 50. , 50. , 50. , 50. , 50. , 13.8,
13.8, 15. , 13.9, 13.3, 13.1, 10.2, 10.4, 10.9, 11.3, 12.3, 8.8,
7.2, 10.5, 7.4, 10.2, 11.5, 15.1, 23.2, 9.7, 13.8, 12.7, 13.1,
12.5, 8.5, 5. , 6.3, 5.6, 7.2, 12.1, 8.3, 8.5, 5. , 11.9,
27.9, 17.2, 27.5, 15. , 17.2, 17.9, 16.3, 7. , 7.2, 7.5, 10.4,
8.8, 8.4, 16.7, 14.2, 20.8, 13.4, 11.7, 8.3, 10.2, 10.9, 11. ,
9.5, 14.5, 14.1, 16.1, 14.3, 11.7, 13.4, 9.6, 8.7, 8.4, 12.8,
10.5, 17.1, 18.4, 15.4, 10.8, 11.8, 14.9, 12.6, 14.1, 13. , 13.4,
15.2, 16.1, 17.8, 14.9, 14.1, 12.7, 13.5, 14.9, 20. , 16.4, 17.7,
19.5, 20.2, 21.4, 19.9, 19. , 19.1, 19.1, 20.1, 19.9, 19.6, 23.2,
Loading [MathJax]/extensions/Safe.js
29.8, 13.8, 13.3, 16.7, 12. , 14.6, 21.4, 23. , 23.7, 25. , 21.8,
20.6, 21.2, 19.1, 20.6, 15.2, 7. , 8.1, 13.6, 20.1, 21.8, 24.5,
23.1, 19.7, 18.3, 21.2, 17.5, 16.8, 22.4, 20.6, 23.9, 22. , 11.9]), 'feature_name s': array(['CRIM', 'ZN',
'INDUS', 'CHAS', 'NOX', 'RM', 'AGE', 'DIS', 'RAD', 'TAX', 'PTRATIO', 'B', 'LSTAT'], dtype='<U7'), 'DESCR': "..
_boston_dataset:\n\nB oston house prices dataset\n---------------------------\n\n**Data Set Characteristics:**
\n\n :Number of Instances: 506 \n\n :Number of Attributes: 13 numeric/categorical predictive. Median Value
(attribute 14) is usually the target.\n\n :Attribute Informa tion (in order):\n - CRIM per capita crime rate by
town\n - ZN p roportion of residential land zoned for lots over 25,000 sq.ft.\n - INDUS prop ortion of
non-retail business acres per town\n - CHAS Charles River dummy var iable (= 1 if tract bounds river; 0
otherwise)\n - NOX nitric oxides concent ration (parts per 10 million)\n - RM average number of rooms per
dwelling\n - AGE proportion of owner-occupied units built prior to 1940\n - DIS we ighted distances to five
Boston employment centres\n - RAD index of accessib ility to radial highways\n - TAX full-value property-tax
rate per $10,000\n - PTRATIO pupil-teacher ratio by town\n - B 1000(Bk - 0.63)^2 where Bk is the
proportion of black people by town\n - LSTAT % lower status of the populat ion\n - MEDV Median value of
owner-occupied homes in $1000's\n\n :Missing Attribute Values: None\n\n :Creator: Harrison, D. and
Rubinfeld, D.L.\n\nThis is a co py of UCI ML housing
dataset.\nhttps://archive.ics.uci.edu/ml/machine-learning-database s/housing/\n\n\nThis dataset was taken
from the StatLib library which is maintained at C arnegie Mellon University.\n\nThe Boston house-price data
of Harrison, D. and Rubinfeld, D.L. 'Hedonic\nprices and the demand for clean air', J. Environ. Economics &
Managemen t,\nvol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch, 'Regression diagnostics\n...', Wiley,
1980. N.B. Various transformations are used in the table on\npages 244-261 of t he latter.\n\nThe Boston
house-price data has been used in many machine learning papers that address regression\nproblems. \n
\n.. topic:: References\n\n - Belsley, Kuh & Welsch, 'Regression diagnostics: Identifying Influential Data and
Sources of Collinear ity', Wiley, 1980. 244-261.\n - Quinlan,R. (1993). Combining Instance-Based and Model
Based Learning. In Proceedings on the Tenth International Conference of Machine Learnin g, 236-243,
University of Massachusetts, Amherst. Morgan Kaufmann.\n", 'filename': 'bost on_house_prices.csv',
'data_module': 'sklearn.datasets.data'}

In [20]:
# Initialize the data frame
data = pd.DataFrame(boston.data)

# Add the feature names to the dataframe

data.columns = boston.feature_names
data.head()
7.07 0.0 0.469 7.185 61.1 4.9671 2.0 242.0 17.8 392.83 4.03 3

Out[20]: 0.03237 0.0 2.18 0.0 0.458 6.998 45.8 6.0622 3.0 222.0 18.7
CRIM ZN INDUS CHAS NOX RM AGE DIS RAD TAX
394.63 2.94 4 0.06905 0.0 2.18 0.0 0.458 7.147 54.2 6.0622 3.0
PTRATIO B LSTAT 0 0.00632 18.0 2.31 0.0 0.538 6.575 65.2
222.0 18.7 396.90 5.33
4.0900 1.0 296.0 15.3 396.90 4.98 1 0.02731 0.0 7.07 0.0 0.469

6.421 78.9 4.9671 2.0 242.0 17.8 396.90 9.14 2 0.02729 0.0

In [21]:
# Adding target variable to dataframe data['PRICE'] =
boston.target
Loading [MathJax]/extensions/Safe.js
In [22]:
# Perform Data Preprocessing( Check for missing values) data.isnull().sum()
TAX 0
Out[22]: PTRATIO 0 B
CRIM 0 ZN 0 0 LSTAT 0
INDUS 0 PRICE 0
CHAS 0 NOX dtype: int64
0 RM 0 AGE 0
DIS 0 RAD 0

In [23]:
# Split dependent variable and independent variables x =
data.drop(['PRICE'], axis = 1)
y = data['PRICE']

In [25]:
# splitting data to training and testing dataset. from sklearn.model_selection
import train_test_split
xtrain, xtest, ytrain, ytest = train_test_split(x, y, test_size =0.2,random_state = 0)
In [27]:
# Use linear regression( Train the Machine ) to Create Model from
sklearn.linear_model import LinearRegression
lm = LinearRegression()
model = lm.fit(xtrain, ytrain)

In [28]:
# Predict the y_pred for all values of train_x and test_x ytrain_pred =
lm.predict(xtrain)
ytest_pred = lm.predict(xtest)

In [29]:
# Evaluate the performance of Model for train_y and test_y df =
pd.DataFrame(ytrain_pred,ytrain)
df = pd.DataFrame(ytest_pred,ytest)

In [30]:
# Calculate Mean Square Paper for train_y and test_y from sklearn.metrics
import mean_squared_error, r2_score mse = mean_squared_error(ytest, ytest_pred)
print(mse)

Loading [MathJax]/extensions/Safe.js
mse = mean_squared_error(ytrain_pred,ytrain)
print(mse)

33.44897999767649
19.326470203585725

In [33]:
# Plotting the linear regression model
plt.scatter(ytrain ,ytrain_pred,c='blue',marker='o',label='Training data')
plt.scatter(ytest,ytest_pred,c='lightgreen',marker='s',label='Test data') plt.xlabel('True values')
plt.ylabel('Predicted')
plt.title("True value vs Predicted value")
plt.legend(loc='upper left')
plt.plot()
plt.show()
In [ ]:

Loading [MathJax]/extensions/Safe.js

Linear Reg
No ratings yet
Linear Reg
25 pages
DNN Tutorial for Data Scientists
No ratings yet
DNN Tutorial for Data Scientists
9 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Python ML for Engineers: Week 3
No ratings yet
Python ML for Engineers: Week 3
12 pages
Pandas
No ratings yet
Pandas
4 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
ML Manual
No ratings yet
ML Manual
30 pages
The Boston Housing Dataset
100% (2)
The Boston Housing Dataset
4 pages
Boston Dataset
No ratings yet
Boston Dataset
6 pages
One Hot Encoding
No ratings yet
One Hot Encoding
12 pages
DL 1
No ratings yet
DL 1
4 pages
Week 6 LAB
No ratings yet
Week 6 LAB
13 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
California Housing Data Analysis
No ratings yet
California Housing Data Analysis
1 page
Data Analytics I: Link of The Dataset
No ratings yet
Data Analytics I: Link of The Dataset
12 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Continuous Assessment
No ratings yet
Continuous Assessment
4 pages
ML Observation
No ratings yet
ML Observation
29 pages
Ds Pract 5 Data Analytics1 Vedanti
No ratings yet
Ds Pract 5 Data Analytics1 Vedanti
7 pages
Argha's ML LAB - 240927 - 121838
No ratings yet
Argha's ML LAB - 240927 - 121838
13 pages
ML Spy Programs
No ratings yet
ML Spy Programs
16 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
Regression Problem
No ratings yet
Regression Problem
28 pages
Pattern - Recognition - 3 - Code With Output
No ratings yet
Pattern - Recognition - 3 - Code With Output
7 pages
Python - Vectorized - Tute - Jupyter Notebook
No ratings yet
Python - Vectorized - Tute - Jupyter Notebook
16 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
A09Ass04 - Jupyter Notebook
No ratings yet
A09Ass04 - Jupyter Notebook
10 pages
ML Programs
No ratings yet
ML Programs
14 pages
Data Scientists' Guide to Predicting House Prices
No ratings yet
Data Scientists' Guide to Predicting House Prices
9 pages
ML Manual
No ratings yet
ML Manual
9 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
Data Analytucs 1
No ratings yet
Data Analytucs 1
5 pages
Making Predictions
No ratings yet
Making Predictions
13 pages
ML Lab Manual
No ratings yet
ML Lab Manual
60 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
33 pages
Prg7a - Jupyter Notebook
No ratings yet
Prg7a - Jupyter Notebook
12 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
A4 Dsbda Sana
No ratings yet
A4 Dsbda Sana
16 pages
Regression Analysis - Lasso and Ridge Regularization
No ratings yet
Regression Analysis - Lasso and Ridge Regularization
17 pages
Boston House Prediction - Colab1
No ratings yet
Boston House Prediction - Colab1
10 pages
DSBDA Prac4 2
No ratings yet
DSBDA Prac4 2
1 page
ML Labmanual
No ratings yet
ML Labmanual
33 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Assignment No 8
No ratings yet
Assignment No 8
17 pages
Data Analysis with Boston Dataset
No ratings yet
Data Analysis with Boston Dataset
4 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
5 - One - Hot - Encoding - Ipynb - Colaboratory
No ratings yet
5 - One - Hot - Encoding - Ipynb - Colaboratory
8 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
127 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
ML Lab Manual
No ratings yet
ML Lab Manual
25 pages
Lab Extern L
No ratings yet
Lab Extern L
8 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
RegresiÃ N Lineal Con Python - Ipynb
No ratings yet
RegresiÃ N Lineal Con Python - Ipynb
83 pages
ML - Datascience Manual
No ratings yet
ML - Datascience Manual
64 pages
House Price Prediction: Project Description
No ratings yet
House Price Prediction: Project Description
11 pages
ANN Mini ProjectSwami Exam
No ratings yet
ANN Mini ProjectSwami Exam
18 pages
Black and White Greyscale Photo Student Resume
No ratings yet
Black and White Greyscale Photo Student Resume
1 page
Intership Report
No ratings yet
Intership Report
20 pages
Ds Report
No ratings yet
Ds Report
20 pages
Internship Index
No ratings yet
Internship Index
10 pages
SIDD36
No ratings yet
SIDD36
8 pages
Non-Consumer Power Connectors Guide
No ratings yet
Non-Consumer Power Connectors Guide
7 pages
SPSS Data Analysis for Researchers
No ratings yet
SPSS Data Analysis for Researchers
13 pages
Economatrics Postmte 1
No ratings yet
Economatrics Postmte 1
46 pages
Demographic Profile of Region 1: Indicators
No ratings yet
Demographic Profile of Region 1: Indicators
2 pages
Syllabus III
100% (2)
Syllabus III
85 pages
Chapter 2
No ratings yet
Chapter 2
62 pages
Data Science Lab: Linear Regression
No ratings yet
Data Science Lab: Linear Regression
9 pages
Engineering Economy Essentials
No ratings yet
Engineering Economy Essentials
19 pages
Assignment#3 Multiple Regression and Manova 2021
No ratings yet
Assignment#3 Multiple Regression and Manova 2021
9 pages
Linear Regression for Crab Age
No ratings yet
Linear Regression for Crab Age
3 pages
Chapter 11 - 250305 - 102157
No ratings yet
Chapter 11 - 250305 - 102157
7 pages
BBS11 ISM Ch14
No ratings yet
BBS11 ISM Ch14
50 pages
(PDF) Risk Management Current Issues and Challenges PDF
No ratings yet
(PDF) Risk Management Current Issues and Challenges PDF
595 pages
Logit and Probit Models Explained
No ratings yet
Logit and Probit Models Explained
11 pages
Econ 104 Project 1 Ace Team 88
No ratings yet
Econ 104 Project 1 Ace Team 88
16 pages
Econometrics
No ratings yet
Econometrics
9 pages
Joint Life & Survivor Functions Practice
No ratings yet
Joint Life & Survivor Functions Practice
3 pages
Ch13slides Generalized Linear Models
No ratings yet
Ch13slides Generalized Linear Models
24 pages
Regression Analysis Case Study
No ratings yet
Regression Analysis Case Study
9 pages
CM1A - Feb 25 - SOL
No ratings yet
CM1A - Feb 25 - SOL
2 pages
Warnings: PLUM - Ordinal Regression
No ratings yet
Warnings: PLUM - Ordinal Regression
2 pages
HW 1
No ratings yet
HW 1
7 pages
Accenture Data Quality Key Solvency Requirements
No ratings yet
Accenture Data Quality Key Solvency Requirements
12 pages
Linear and Multiple Regression Models
No ratings yet
Linear and Multiple Regression Models
3 pages
Econometrics: Instrumental Variables
No ratings yet
Econometrics: Instrumental Variables
21 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
4 pages
Cost Estimation for Manufacturing
No ratings yet
Cost Estimation for Manufacturing
9 pages
Machine Learning Regression Assignment
No ratings yet
Machine Learning Regression Assignment
2 pages
Time Value of Money Problems Single and Mixed Streams
No ratings yet
Time Value of Money Problems Single and Mixed Streams
2 pages
Deloitte UK Pension Scheme Valuations Challenges Opportunities 2015
No ratings yet
Deloitte UK Pension Scheme Valuations Challenges Opportunities 2015
8 pages
Stat 475 Life Contingencies
No ratings yet
Stat 475 Life Contingencies
42 pages

Assignment 4

Uploaded by

Assignment 4

Uploaded by

In [1]:

Algorithm (Boston Dataset):

# Add the feature names to the dataframe

You might also like