0% found this document useful (0 votes)

10 views7 pages

Hemraj Python Ass1

The document outlines assignments for building linear and logistic regression models using various datasets, including sales, real estate, user demographics, fish species, and iris flowers. It provides step-by-step programming instructions using Python libraries such as pandas, numpy, and scikit-learn for data manipulation and model training. Each assignment includes dataset creation, data splitting, model training, prediction, and evaluation of model accuracy.

Uploaded by

hemrajbhongale8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views7 pages

Hemraj Python Ass1

Uploaded by

hemrajbhongale8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Assignment 1: Linear and Logistic Regression

SET A
1.Create 'sales' Data set having 5 columns namely: ID, TV, Radio, Newspaper
and Sales. (random 500 entries) Build a linear regression model by identifying
independent and target variable. Split the variables into training and testing
sets. then divide the training and testing sets into a 7:3 ratio, respectively and
print them. Build a simple linear regression model.
Program:-
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt

# Step 1: Create the sales dataset

np.random.seed(42)
ID = np.arange(1, 501)
TV = np.random.uniform(0, 100, 500)
Radio = np.random.uniform(0, 50, 500)
Newspaper = np.random.uniform(0, 30, 500)
Sales = 3 + 0.05 * TV + 0.1 * Radio + 0.02 * Newspaper + np.random.normal(0, 5, 500)

sales_data = pd.DataFrame({
'ID': ID,
'TV': TV,
'Radio': Radio,
'Newspaper': Newspaper,
'Sales': Sales
})

# Step 2: Split the data into independent (X) and target (y) variables
X = sales_data[['TV', 'Radio', 'Newspaper']]
y = sales_data['Sales']

# Step 3: Split the dataset into training and testing sets (7:3 ratio)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Step 4: Print the split data

print("Training set (X_train):")
print(X_train.head())
print("Testing set (X_test):")
print(X_test.head())
# Step 5: Build the linear regression model
model = LinearRegression()
model.fit(X_train, y_train)

# Step 6: Make predictions

y_pred = model.predict(X_test)

# Print the coefficients

print("Coefficients:", model.coef_)
print("Intercept:", model.intercept_)

# Step 7: Plot the results

plt.scatter(y_test, y_pred)
plt.xlabel("Actual Sales")
plt.ylabel("Predicted Sales")
plt.title("Linear Regression: Actual vs Predicted Sales")
plt.show()
Output:-
Example output for training set:

Training set (X_train):

TV Radio Newspaper
374 4.537760 25.451522 9.047601
28 70.243315 25.989796 22.231161
456 80.651719 44.563722 12.669033
209 60.330544 16.218829 26.485149
431 96.945695 27.497699 18.827547
Example output for testing set:

Testing set (X_test):

TV Radio Newspaper
80 8.139962 43.664348 3.476083
125 45.285008 15.660353 28.916305
225 65.058937 27.791765 3.798982
282 72.334036 48.151510 12.084336
305 55.535741 37.179261 9.443671
Coefficients and Intercept: After training the model, you will see the model's coefficients
and intercept printed, showing the relationship between the independent variables and the
target (sales).

Example output:

Coefficients: [0.05023864 0.09843639 0.02031991]

Intercept: 3.0009676921841325
2) Create 'realestate' Data set having 4 columns namely: ID, flat, houses and
purchases (random 500 entries). Build a linear regression model by
identifying independent and target variable. Split the variables into training
and testing sets and print them. Build a simple linear regression model for
predicting purchases.
Program:-
# Step 1: Create the real estate dataset
flat = np.random.uniform(50, 200, 500)
houses = np.random.uniform(1, 10, 500)
purchases = 200 + 1.5 * flat + 3 * houses + np.random.normal(0, 50, 500)

realestate_data = pd.DataFrame({
'ID': ID,
'flat': flat,
'houses': houses,
'purchases': purchases
})

# Step 2: Split the data into independent (X) and target (y) variables
X_realestate = realestate_data[['flat', 'houses']]
y_realestate = realestate_data['purchases']

# Step 3: Split the dataset into training and testing sets

X_train_realestate, X_test_realestate, y_train_realestate, y_test_realestate =
train_test_split(X_realestate, y_realestate, test_size=0.3, random_state=42)

# Step 4: Print the split data

print("Training set (X_train_realestate):")
print(X_train_realestate.head())
print("Testing set (X_test_realestate):")
print(X_test_realestate.head())

# Step 5: Build the linear regression model

model_realestate = LinearRegression()
model_realestate.fit(X_train_realestate, y_train_realestate)

# Step 6: Make predictions

y_pred_realestate = model_realestate.predict(X_test_realestate)

# Print the coefficients

print("Coefficients:", model_realestate.coef_)
print("Intercept:", model_realestate.intercept_)

# Step 7: Plot the results

plt.scatter(y_test_realestate, y_pred_realestate)
plt.xlabel("Actual Purchases")
plt.ylabel("Predicted Purchases")
plt.title("Linear Regression: Actual vs Predicted Purchases")
plt.show()
Output:-
Example structure of the dataset:

Copy
ID flat houses purchases
1 150.5 5.2 853.0
2 130.0 3.1 725.5
3 178.9 8.7 935.8
4 124.3 4.5 688.2

3) Create 'User' Data set having 5 columns namely: User ID, Gender, Age,
EstimatedSalary and Purchased. Build a logistic regression model that can
predict whether on the given parameter a person will buy a car or not.
Program:-

from sklearn.linear_model
import LogisticRegression
from sklearn.preprocessing
import LabelEncoder
from sklearn.metrics
import accuracy_score

# Step 1: Create the User dataset

user_id = np.arange(1, 501)
gender = np.random.choice(['Male', 'Female'], 500)
age = np.random.randint(18, 70, 500)
estimated_salary = np.random.uniform(15000, 120000, 500)
purchased = np.random.choice([0, 1], 500)

user_data = pd.DataFrame({
'User ID': user_id,
'Gender': gender,
'Age': age,
'EstimatedSalary': estimated_salary,
'Purchased': purchased
})

# Step 2: Encode categorical 'Gender' feature

le = LabelEncoder()
user_data['Gender'] = le.fit_transform(user_data['Gender'])

# Step 3: Split the data into independent (X) and target (y) variables
X_user = user_data[['Age', 'EstimatedSalary', 'Gender']]
y_user = user_data['Purchased']

# Step 4: Split the dataset into training and testing sets

X_train_user, X_test_user, y_train_user, y_test_user = train_test_split(X_user, y_user,
test_size=0.3, random_state=42)

# Step 5: Build the logistic regression model

log_reg_model = LogisticRegression()
log_reg_model.fit(X_train_user, y_train_user)

# Step 6: Make predictions

y_pred_user = log_reg_model.predict(X_test_user)

# Step 7: Print accuracy

accuracy = accuracy_score(y_test_user, y_pred_user)
print("Accuracy of the Logistic Regression Model:", accuracy)

Output:-
Accuracy of the Logistic Regression Model: 0.89

SET B

1) Build a simple linear regression model for Fish Species Weight Prediction.
(download dataset https://www.kaggle.com/aungpyaeap/fish-
market?select=Fish.csv)
Program:-

import pandas as pd
from sklearn.linear_model
import LinearRegression
from sklearn.model_selection
import train_test_split

# Step 1: Load the fish dataset

fish_data = pd.read_csv('Fish.csv')

# Step 2: Split the data into independent (X) and target (y) variables
X_fish = fish_data[['Length', 'Width', 'Height']]
y_fish = fish_data['Weight']
# Step 3: Split the dataset into training and testing sets
X_train_fish, X_test_fish, y_train_fish, y_test_fish = train_test_split(X_fish, y_fish,
test_size=0.3, random_state=42)

# Step 4: Build the linear regression model

fish_model = LinearRegression()
fish_model.fit(X_train_fish, y_train_fish)

# Step 5: Make predictions

y_pred_fish = fish_model.predict(X_test_fish)

# Print the coefficients

print("Coefficients:", fish_model.coef_)
print("Intercept:", fish_model.intercept_)
Output:-
Length1 Length2 Length3
Height Width Weight
0 23.2 25.4 30.011.54.0242.0
1 24.0 26.3 31.212.04.8290.0
2 23.9 26.5 31.112.24.8340.0
3 26.3 29.0 33.512.45.0363.0
4 26.5 29.0 34.012.54.9430.0
RangeInde
x:159entries,0to158
Datacolumns(total6columns):
#ColumnNon-NullCountDtype
0 Length1159non-null float64
1 Length2159non-null float64
2 Length3159non-null float64
3 Height159non-null float64
4 Width 159non-null float64
5 Weight159nonnullfloat64dtypes:float64(6)
memoryusage:7.6KBN
one
MeanSquaredError:2746.50Rsquared:0.885

2) Use the iris dataset. Write a Python program to view some basic statistical
details like percentile, mean, std etc. of the species of 'Iris- setosa', 'Iris-
versicolor' and 'Iris-virginica'. Apply logistic regression on the dataset to
identify different species (setosa, versicolor, verginica) of Iris flowers given
just 4 features: sepal and petal lengths and widths.. Find the accuracy of the
model.
Program:-
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

# Step 1: Load the Iris dataset

iris = load_iris()
X_iris = iris.data
y_iris = iris.target

# Step 2: Split the dataset into training and testing sets

X_train_iris, X_test_iris, y_train_iris, y_test_iris = train_test_split(X_iris, y_iris, test_size=0.3,
random_state=42)

# Step 3: Build the logistic regression model

log_reg_iris = LogisticRegression(max_iter=200)
log_reg_iris.fit(X_train_iris, y_train_iris)

# Step 4: Make predictions

y_pred_iris = log_reg_iris.predict(X_test_iris)

# Step 5: Calculate accuracy

accuracy_iris = accuracy_score(y_test_iris, y_pred_iris)
print("Accuracy of Logistic Regression Model for Iris Dataset:", accuracy_iris)
Output:-

Accuracy of Logistic Regression Model for Iris Dataset: 0.9777777777777777

DA Practicle Answers Easyw
No ratings yet
DA Practicle Answers Easyw
30 pages
Data Analytics Program
No ratings yet
Data Analytics Program
11 pages
ML Lab Programs
No ratings yet
ML Lab Programs
9 pages
Da 012307
No ratings yet
Da 012307
8 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
ML 4,5,6 (Sample1)
No ratings yet
ML 4,5,6 (Sample1)
6 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
Week-7 DS Practical
No ratings yet
Week-7 DS Practical
8 pages
Python Regression: Linear & Logistic
No ratings yet
Python Regression: Linear & Logistic
3 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Tut 4 6
No ratings yet
Tut 4 6
3 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Web II & DA Slip Solution
No ratings yet
Web II & DA Slip Solution
40 pages
Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
8 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
Train
No ratings yet
Train
17 pages
Easy Pract ML
No ratings yet
Easy Pract ML
7 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
DL Lab 5
No ratings yet
DL Lab 5
3 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
ML File
No ratings yet
ML File
10 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Data Analytics Assignment Solutions
No ratings yet
Data Analytics Assignment Solutions
20 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
14 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
No ratings yet
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
7 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Good-Logistic Regression With A Real-World Example in Python - MarkTechPost
No ratings yet
Good-Logistic Regression With A Real-World Example in Python - MarkTechPost
9 pages
Lab (Work) Experiment File Priyanka Rajak 0901MC221056
No ratings yet
Lab (Work) Experiment File Priyanka Rajak 0901MC221056
19 pages
Day 3 ML
No ratings yet
Day 3 ML
4 pages
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 2
No ratings yet
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 2
6 pages
ml1 PRG
No ratings yet
ml1 PRG
2 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
Lab Exam ... Roll No 24cs4103
No ratings yet
Lab Exam ... Roll No 24cs4103
4 pages
Shashank ML
No ratings yet
Shashank ML
23 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
ML External Xerox
No ratings yet
ML External Xerox
1 page
Practical # 10
No ratings yet
Practical # 10
5 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
Chapter 4
No ratings yet
Chapter 4
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
ML 6 7 8
No ratings yet
ML 6 7 8
10 pages
Arima Model For Aapl
No ratings yet
Arima Model For Aapl
16 pages
Data Analytics III I
No ratings yet
Data Analytics III I
86 pages
Environmental Forecasting
67% (3)
Environmental Forecasting
26 pages
Thesis
No ratings yet
Thesis
37 pages
Trace Quantitative Analysis by Mass Spectrometry 1st Edition Robert K. Boyd Download
100% (1)
Trace Quantitative Analysis by Mass Spectrometry 1st Edition Robert K. Boyd Download
41 pages
Logit and Spss
No ratings yet
Logit and Spss
37 pages
Econometric Project - Linear Regression Model
No ratings yet
Econometric Project - Linear Regression Model
17 pages
Document
100% (2)
Document
533 pages
ADC直方图分析对鼻咽癌调强放疗患者放射诱导颞叶损伤的预测价值
No ratings yet
ADC直方图分析对鼻咽癌调强放疗患者放射诱导颞叶损伤的预测价值
11 pages
Agronomy MCQs
100% (1)
Agronomy MCQs
198 pages
Psychological Testing by Kaplan Notes
No ratings yet
Psychological Testing by Kaplan Notes
24 pages
CFA Formula Cheatsheet
100% (1)
CFA Formula Cheatsheet
166 pages
Data Science
No ratings yet
Data Science
13 pages
Individual Assignment (MBA, 2012)
No ratings yet
Individual Assignment (MBA, 2012)
1 page
Project Report Hotel Industry
78% (18)
Project Report Hotel Industry
84 pages
8614 Assignment 2
No ratings yet
8614 Assignment 2
14 pages
Econometrics for Researchers
No ratings yet
Econometrics for Researchers
31 pages
II-Sem-MULTIVARIATE DATA ANALYSIS
No ratings yet
II-Sem-MULTIVARIATE DATA ANALYSIS
2 pages
Ship Design Reference Guide
No ratings yet
Ship Design Reference Guide
5 pages
Answers Consulting Feedback Software Tutorials Links: Structural Equation Modeling Using AMOS: An Introduction
No ratings yet
Answers Consulting Feedback Software Tutorials Links: Structural Equation Modeling Using AMOS: An Introduction
47 pages
Churn Predict Analysis
100% (1)
Churn Predict Analysis
23 pages
Data Science Short Notes
No ratings yet
Data Science Short Notes
21 pages
Permeability From Grain Size Distribution PDF
No ratings yet
Permeability From Grain Size Distribution PDF
137 pages
Linear Regression Models: Applications in R (Chapman & Hall/CRC Statistics in The Social and Behavioral Sciences) 1st Edition John P. Hoffmann PDF Download
100% (1)
Linear Regression Models: Applications in R (Chapman & Hall/CRC Statistics in The Social and Behavioral Sciences) 1st Edition John P. Hoffmann PDF Download
50 pages
7Qc Tools: Pareto Diagram
100% (1)
7Qc Tools: Pareto Diagram
32 pages
Green Home Concept Consumers Perception Towards Green Homes in Sri Lanka - October - 2012 - 1598755149 - 5804814
No ratings yet
Green Home Concept Consumers Perception Towards Green Homes in Sri Lanka - October - 2012 - 1598755149 - 5804814
3 pages
Workshop 10 Data
No ratings yet
Workshop 10 Data
14 pages
CH07 Linear Regression
No ratings yet
CH07 Linear Regression
39 pages
Report - Project8 - FRA - Surabhi - Report
100% (2)
Report - Project8 - FRA - Surabhi - Report
15 pages
Summary of 3 Research Papers Related To Data Analysis in R
No ratings yet
Summary of 3 Research Papers Related To Data Analysis in R
6 pages

Hemraj Python Ass1

Uploaded by

Hemraj Python Ass1

Uploaded by

Assignment 1: Linear and Logistic Regression

# Step 1: Create the sales dataset

# Step 4: Print the split data

# Step 6: Make predictions

# Print the coefficients

# Step 7: Plot the results

Training set (X_train):

Testing set (X_test):

Coefficients: [0.05023864 0.09843639 0.02031991]

# Step 3: Split the dataset into training and testing sets

# Step 4: Print the split data

# Step 5: Build the linear regression model

# Step 6: Make predictions

# Print the coefficients

# Step 7: Plot the results

# Step 1: Create the User dataset

# Step 2: Encode categorical 'Gender' feature

# Step 4: Split the dataset into training and testing sets

# Step 5: Build the logistic regression model

# Step 6: Make predictions

# Step 7: Print accuracy

# Step 1: Load the fish dataset

# Step 4: Build the linear regression model

# Step 5: Make predictions

# Print the coefficients

# Step 1: Load the Iris dataset

# Step 2: Split the dataset into training and testing sets

# Step 3: Build the logistic regression model

# Step 4: Make predictions

# Step 5: Calculate accuracy

Accuracy of Logistic Regression Model for Iris Dataset: 0.9777777777777777

You might also like