0% found this document useful (0 votes)

63 views18 pages

Machine Learning for Beginners

ml notes

Uploaded by

Naveen K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views18 pages

Machine Learning for Beginners

ml notes

Uploaded by

Naveen K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Machine Learning

Types of AI

Machine Learning
Types of ML

Machine Learning 1
Machine Learning 2
Supervised learning: The computer is provided with labeled training data and
learns to map inputs to outputs.

Unsupervised learning: The computer is provided with unlabeled data and

learns to find underlying structures or patterns in the data.

Reinforcement learning: The computer learns to make decisions in an

environment by receiving rewards or punishments for its actions.

Deep learning: A type of machine learning that involves training artificial

neural networks with multiple layers to learn complex patterns in data.

Supervised ML algos:
Linear regression

Logistic regression

Machine Learning 3
Decision tree

Support vector machine (SVM)

Naive Bayes

Linear discriminant analysis

K-nearest neighbors (KNN)

Neural networks

Random forest

Gradient boosting

XGBoost

Stochastic gradient descent

Adaptive boosting (AdaBoost)

Bagging

Classification and regression trees (CART)

Conditional random fields (CRF)

Gaussian processes (GP)

Hidden Markov models (HMM)

Kalman filter

Maximum entropy (MaxEnt)

Unsupervised Learning:
K means clustering

Hierarchy clustering

DBSCAN

GMM - Gussian Mixture Models

PCA - Principal component Analysis

Machine Learning 4
t-SNE

Associate Rule Learning (Apriori Learning)

Auto encoder

Self-organizing maps (SOM)

Reinforcement Learning:
Q-learning

Deep Q-network (QN)

Policy gradient methods

Actor critic methods (A2C)

Proximal Policy optimization (PPO)

Transfer Learning:

Pre-trained models, fine-tuning, domain adoption, Multi-task learning, Model

ensemble, one-shot learning

Deep Learning:

CNN, RNN, GANs, Auto-encoders, transformers, DBNs

Ensemble Learning:

Bagging, Boosting, Stacking, Voting

Terminologies:
Overfitting: Holds the exact pattern of data - couldn’t do well on test data

Underfitting: Didn’t hold the pattern - couldn’t predict well on test data

Machine Learning 5
Batch/offline ML: entire dataset is trained on Local Machine - deployed on Server

Online ML: Data will be dynamically feeded into model - Dynamic learning

Model Based ML: Draws the best fit line is the model

Instance based ML: Load all the data as model - calculate distance between test
data vs model data (training data) - Lazy Learner

MLDLC - ML Development Life Cycle

1. Frame the problem

2. Gather the data

3. Data Preprocessing

4. Exploratory Data Analysis (EDA)

5. Feature Engineering and Feature Selection

6. Model Training, Evaluation and selection

7. Model Deployment

8. Testing

9. Optimize

Machine Learning 6
1. Frame the problem:

2. Gather the data:

Machine Learning 7
Loading a CSV file

# Import necessary libraries

import pandas as pd

# Load data from csv file

data = pd.read_csv('filename.csv')

# Print the first 5 rows of the dataframe

print(data.head())

Collection of data from an API

# Import necessary libraries

import requests
import json

# Define the API endpoint

url = '<https://api.example.com/data>'

Machine Learning 8
# Send a GET request to the API
response = requests.get(url)

# Convert the response to JSON format

data = response.json()

# Print the data

print(json.dumps(data, indent=4))

https://youtu.be/roTZJaxjnJc?feature=shared

Web Scraping:

# Import necessary libraries

from bs4 import BeautifulSoup
import requests

# Specify url
url = '<https://www.example.com>'

# Send a GET request to the website

response = requests.get(url)

# Parse the html content

soup = BeautifulSoup(response.content, 'html.parser')

# Print out the parsed HTML

print(soup.prettify())

Machine Learning 9
https://youtu.be/8NOdgjC1988?feature=shared

From JSON/SQL

https://youtu.be/fFwRC-fapIU?feature=shared

3. Data Preprocessing

Structural Issues

Data from different sources - Not compatable

Remove Duplicates

Handle Missing Values

Outliers

Scale - Standardization or Normalization

Few General Operations:

df.shape()

df.head()
df.tail()
df.sample()

Machine Learning 10
df.isnull().sum()
df.dupliacted().sum()

df.describe() # High level maths

df.info() # Column details

df.corr()
df.corr()['Age']

Here are some of the operations we perform during data preprocessing, along
with their respective Python codes:

1. Removing Duplicates:

import pandas as pd

# Assuming df is your DataFrame

df = pd.read_csv('filename.csv')

# Removing duplicates
df = df.drop_duplicates()

2. Handling Missing Values:

# You can fill missing values with some value or median, mean
of the column
df = df.fillna(value)

# Or you can drop rows with missing values

df = df.dropna()

3. Handling Outliers:

# Assuming 'column' is a column in df with outliers

Q1 = df['column'].quantile(0.25)
Q3 = df['column'].quantile(0.75)

Machine Learning 11
IQR = Q3 - Q1

# Removing outliers
df = df[~((df['column'] < (Q1 - 1.5 * IQR)) |(df['column'] >
(Q3 + 1.5 * IQR)))]

4. Feature Scaling (Standardization):

from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()

# Assuming X is your features DataFrame

X = pd.DataFrame(scaler.fit_transform(X), columns = X.column
s)

5. Feature Scaling (Normalization):

from sklearn.preprocessing import MinMaxScaler

scaler = MinMaxScaler()

# Assuming X is your features DataFrame

X = pd.DataFrame(scaler.fit_transform(X), columns = X.column
s)

4. Exploratory Data Analysis (EDA):

“Study of relationship between INPUT and OUTPUT features”

“Getting Idea about the data”

“Experiment and Extract the relationships”

Machine Learning 12
Visualization

Univariate Analysis/ Bivariate Analysis

Outlier Detection

Data Imbalance

1. Visualization
the first question would be if data is numerical or categorical?

working on Categorical columns

1. Count plot

sns.countplot(df['survived'])

df['survived'].value_counts().plot(kind='bar')

2. Pie chart

df['survived'].value_counts().plot(kind='pie', autopct='%.2f')

working on Numerical columns:

1. Histogram

import matplotlib.pyplot as plt

plt.hist(df['Age'], bins=10) #bin - kinda ZOOMIN

Machine Learning 13
2. Distplot - pdf [probability Density Function]

sns.distplot(df['Age'])

3. Box plot

sns.boxplot(df['Age'])

2. Bivariate and Multi variate Analysis

1. Scatter plot [Num vs Num]

bivariate

sns.scatterplot(tips['totalbill'], tips['tip'])

Machine Learning 14
multivariate

sns.scatterplot(tips['total_bill'], tips['tip'], hue=df['sex])

sns.scatterplot(tips['total_bill'], tip['tip'], hue=df['sex'], s

# hue - change in color

# style - change in shape
# size - change in size

5. Feature Engineering and Selection:

Selecting Features

Merging columns

Machine Learning 15
Minimizing Columns - Time & Cost efficient

Gradient Descent

https://www.youtube.com/watch?v=qg4PchTECck&list=PLqwozWPBo-Fvu
HWx3_aYwG2WVdbb-wC6q&index=2

import numpy as np

# Initialize parameters
learning_rate = 0.01
num_iterations = 1000
m, theta = np.zeros(shape=(2,1)), 0

# Gradient Descent
for i in range(num_iterations):
prediction = m * X + theta
error = prediction - y

m = m - learning_rate * (1/n) * np.dot(X.T, error)

theta = theta - learning_rate * error.sum()

print("Gradient Descent has converged at m = ", m, ", theta =

", theta)

Linear Regression:
Use linear regression in machine learning when you have a continuous target
variable and want to model the linear relationship between input features and the

Machine Learning 16
target, making it suitable for predicting numerical outcomes.

https://www.youtube.com/watch?v=CtsRRUddV2s

# Import necessary libraries

import numpy as np
from sklearn.linear_model import LinearRegression

# Load dataset
dataset = np.loadtxt("[dataset_file_name]", delimiter=",")
X = dataset[:, 0:n_features] # X is a 2D array of feature data (
y = dataset[:, n_features:] # y is a 1D array of target data
# Train the model
regr = LinearRegression()
regr.fit(X, y)

# Make predictions
predictions = regr.predict(X)

# Evaluate model performance

[evaluation_metric] = regr.score(X, y)

Logistic regression:

https://www.youtube.com/watch?v=L_xBe7MbPwk

Unsupervised Learning

Machine Learning 17
PCA - Principal Component Analysis

https://www.youtube.com/watch?v=FD4DeN81ODY

Machine Learning 18

Module 1
No ratings yet
Module 1
25 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
Social Media Analytics Techniques
No ratings yet
Social Media Analytics Techniques
77 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
ML Module I
No ratings yet
ML Module I
71 pages
Foundations of Machine Learning and Data Science - Concepts, Techniques, and Applications
No ratings yet
Foundations of Machine Learning and Data Science - Concepts, Techniques, and Applications
9 pages
Module - 1
No ratings yet
Module - 1
9 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
ML Revision
No ratings yet
ML Revision
207 pages
Machine Learning (ML) - Comprehensive Summary
No ratings yet
Machine Learning (ML) - Comprehensive Summary
7 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Unit 1
No ratings yet
Unit 1
43 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Aimlmid 2 Notes
No ratings yet
Aimlmid 2 Notes
4 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
30 pages
Present Explain
No ratings yet
Present Explain
11 pages
Algorithmeknn 121213175830 Phpapp02
No ratings yet
Algorithmeknn 121213175830 Phpapp02
52 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
2 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
23 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Designing Machine Learning Systems by Chip Huygen by Rick
100% (1)
Designing Machine Learning Systems by Chip Huygen by Rick
15 pages
Workflow of A Machine Learning Project
No ratings yet
Workflow of A Machine Learning Project
12 pages
Model Evaluation
No ratings yet
Model Evaluation
39 pages
MCS224 Dec 2024 Solved
No ratings yet
MCS224 Dec 2024 Solved
22 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
6 pages
NEP Syllabus Questions
No ratings yet
NEP Syllabus Questions
3 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
3 pages
Introduction to AI: Machine Learning Basics
No ratings yet
Introduction to AI: Machine Learning Basics
72 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
5 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
ML Notes
No ratings yet
ML Notes
16 pages
Introduction To Machine Learning Lecture Notes
No ratings yet
Introduction To Machine Learning Lecture Notes
3 pages
ML Insem
No ratings yet
ML Insem
46 pages
Pa 2
No ratings yet
Pa 2
13 pages
Lec2 Intro To ML
No ratings yet
Lec2 Intro To ML
35 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
ML - Unit 1
No ratings yet
ML - Unit 1
68 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
Machine Learning Bootcamp Guide
No ratings yet
Machine Learning Bootcamp Guide
6 pages
7 Data Preprocessing Steps in Machine Learning
No ratings yet
7 Data Preprocessing Steps in Machine Learning
5 pages
ML Sem
No ratings yet
ML Sem
24 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Class1 - Introduction and Foundation-1717413257735
No ratings yet
Class1 - Introduction and Foundation-1717413257735
23 pages
Weak AI Generative AI Strong AI:-Machine Learning Tutorial 1.supervised Leaning 2.un Supervised Learning 3.reinforcement Learning
No ratings yet
Weak AI Generative AI Strong AI:-Machine Learning Tutorial 1.supervised Leaning 2.un Supervised Learning 3.reinforcement Learning
53 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Machine Learning Life Cycle
No ratings yet
Machine Learning Life Cycle
11 pages
Final ML Project File
No ratings yet
Final ML Project File
16 pages
Unit-1 Introduction To Machine Learning (5hrs)
No ratings yet
Unit-1 Introduction To Machine Learning (5hrs)
8 pages
Week 3 A
No ratings yet
Week 3 A
18 pages
MMC102 - Module 4 - Notes
No ratings yet
MMC102 - Module 4 - Notes
39 pages
226 ChatGPT Prompts A-Z ChatGPT Prompt Engineering BootCamp
90% (20)
226 ChatGPT Prompts A-Z ChatGPT Prompt Engineering BootCamp
120 pages
15000+ ChatGPT Prompts, (Crafti - Pro) - Tareas
96% (26)
15000+ ChatGPT Prompts, (Crafti - Pro) - Tareas
367 pages
Top 100 Applications of Generative AI 1683282083
100% (20)
Top 100 Applications of Generative AI 1683282083
119 pages
Prompt Engineering Bible Join and Master The AI Revolution Profit Online With GPT-4 Plugins For Effortless Money Making (Robert E. Miller) (Z-Library)
100% (10)
Prompt Engineering Bible Join and Master The AI Revolution Profit Online With GPT-4 Plugins For Effortless Money Making (Robert E. Miller) (Z-Library)
209 pages
The Best ChatGPT
100% (49)
The Best ChatGPT
8 pages
200 ChatGPT Prompts
87% (60)
200 ChatGPT Prompts
14 pages
ChatGPT Data Science Prompts
80% (15)
ChatGPT Data Science Prompts
67 pages
Prompt Engineer 101
97% (34)
Prompt Engineer 101
45 pages
Mastering AI Agents
100% (9)
Mastering AI Agents
93 pages
Unlocking The Potential of ChatGPT
100% (22)
Unlocking The Potential of ChatGPT
45 pages
Prompt Engineering Tutorial
100% (10)
Prompt Engineering Tutorial
217 pages
70 AI Tools To Boost Productivity
83% (24)
70 AI Tools To Boost Productivity
72 pages
AI Prompt Mastery Guide
100% (8)
AI Prompt Mastery Guide
17 pages
Gen AI Companies 1679276337830
100% (1)
Gen AI Companies 1679276337830
1 page
Codi Byte - Chat GPT Bible - 10 Books in 1_ Everything You Need to Know About AI and Its Applications to Improve Your Life, Boost Productivity, Earn Money, Advance Your Career, And Develop New Skills.
93% (30)
Codi Byte - Chat GPT Bible - 10 Books in 1_ Everything You Need to Know About AI and Its Applications to Improve Your Life, Boost Productivity, Earn Money, Advance Your Career, And Develop New Skills.
447 pages
100 Best ChatGPT Prompts For All Kinds of Workflow - Beebom
85% (13)
100 Best ChatGPT Prompts For All Kinds of Workflow - Beebom
39 pages
Awesome ChatGPT Prompts PDF
100% (13)
Awesome ChatGPT Prompts PDF
103 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
97% (33)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
Generative AI for Business Leaders
100% (18)
Generative AI for Business Leaders
80 pages
Exec Guide Gen Ai
100% (7)
Exec Guide Gen Ai
48 pages
60 ChatGPT Prompts Ebook
100% (9)
60 ChatGPT Prompts Ebook
37 pages
How To Use ChatGPT To Write Better, Faster and More Effectively
92% (13)
How To Use ChatGPT To Write Better, Faster and More Effectively
20 pages
100 Generative AI Use Cases Examples For Industries
100% (10)
100 Generative AI Use Cases Examples For Industries
63 pages
Generative AI Checklist
100% (1)
Generative AI Checklist
10 pages
CHAT GPT CHEAT CODES - v1.5
94% (48)
CHAT GPT CHEAT CODES - v1.5
77 pages
Generative AI For Executive
100% (6)
Generative AI For Executive
164 pages
AI Hacks for Content Creators
92% (36)
AI Hacks for Content Creators
57 pages
Chat GPT
92% (77)
Chat GPT
34 pages
Agentic AI Projects
50% (4)
Agentic AI Projects
9 pages
Chatgpt Prompts PDF
91% (11)
Chatgpt Prompts PDF
69 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
22 pages
My ML Tiny Notes
No ratings yet
My ML Tiny Notes
5 pages
Research PPR 1
No ratings yet
Research PPR 1
7 pages
Deep Learning
No ratings yet
Deep Learning
28 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
Machine Learning for IT Students
No ratings yet
Machine Learning for IT Students
13 pages
Predicting Root Cause Analysis (RCA) Bucket For
No ratings yet
Predicting Root Cause Analysis (RCA) Bucket For
4 pages
Fruit Recognition Deep Learning
No ratings yet
Fruit Recognition Deep Learning
53 pages
Machine Learning: Asst. Prof. Dr. Mohammed Najm Abdullah
No ratings yet
Machine Learning: Asst. Prof. Dr. Mohammed Najm Abdullah
46 pages
AI Project List
No ratings yet
AI Project List
6 pages
Artificial Intelligence Techniques (2025) - Week 3
No ratings yet
Artificial Intelligence Techniques (2025) - Week 3
37 pages
AI & Data Science Conference 2024 Papers
No ratings yet
AI & Data Science Conference 2024 Papers
1 page
Wa0004.
No ratings yet
Wa0004.
3 pages
Preprocessing NLTK
No ratings yet
Preprocessing NLTK
5 pages
Object Tracking
No ratings yet
Object Tracking
50 pages
Detection of Ocular Cataracts With Convolutional Neural Networks
No ratings yet
Detection of Ocular Cataracts With Convolutional Neural Networks
10 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
EEG Seizure Prediction Model
No ratings yet
EEG Seizure Prediction Model
9 pages
05 Linear Classifiers
No ratings yet
05 Linear Classifiers
59 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
52 pages
Neural Networks & Deep Learning Seminar
No ratings yet
Neural Networks & Deep Learning Seminar
3 pages
Speech LLM
No ratings yet
Speech LLM
72 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Data Science Statement of Purpose
No ratings yet
Data Science Statement of Purpose
3 pages
Improving Position Encoding of Transformers For Multivariate Time Series Classification
No ratings yet
Improving Position Encoding of Transformers For Multivariate Time Series Classification
28 pages
Graph Neural Network For Fraud Detection Via Spatial-Temporal Attention
No ratings yet
Graph Neural Network For Fraud Detection Via Spatial-Temporal Attention
14 pages
Age and Gender Classification Using CNN CVPR2015
No ratings yet
Age and Gender Classification Using CNN CVPR2015
9 pages
Machine Learning For Hydrologic Sciences: An Introductory Overview
No ratings yet
Machine Learning For Hydrologic Sciences: An Introductory Overview
40 pages
hw9 Sol
No ratings yet
hw9 Sol
5 pages
19CSE456 - VI Sem May 2022
No ratings yet
19CSE456 - VI Sem May 2022
6 pages

Machine Learning for Beginners

Uploaded by

Machine Learning for Beginners

Uploaded by

Machine Learning

Unsupervised learning: The computer is provided with unlabeled data and

Reinforcement learning: The computer learns to make decisions in an

Deep learning: A type of machine learning that involves training artificial

Support vector machine (SVM)

Linear discriminant analysis

K-nearest neighbors (KNN)

Stochastic gradient descent

Adaptive boosting (AdaBoost)

Classification and regression trees (CART)

Conditional random fields (CRF)

Gaussian processes (GP)

Hidden Markov models (HMM)

Maximum entropy (MaxEnt)

GMM - Gussian Mixture Models

PCA - Principal component Analysis

Associate Rule Learning (Apriori Learning)

Self-organizing maps (SOM)

Deep Q-network (QN)

Policy gradient methods

Actor critic methods (A2C)

Proximal Policy optimization (PPO)

Pre-trained models, fine-tuning, domain adoption, Multi-task learning, Model

CNN, RNN, GANs, Auto-encoders, transformers, DBNs

Bagging, Boosting, Stacking, Voting

MLDLC - ML Development Life Cycle

2. Gather the data

4. Exploratory Data Analysis (EDA)

5. Feature Engineering and Feature Selection

6. Model Training, Evaluation and selection

2. Gather the data:

# Import necessary libraries

# Load data from csv file

# Print the first 5 rows of the dataframe

Collection of data from an API

# Import necessary libraries

# Define the API endpoint

# Convert the response to JSON format

# Print the data

# Import necessary libraries

# Send a GET request to the website

# Parse the html content

# Print out the parsed HTML

Data from different sources - Not compatable

Handle Missing Values

Scale - Standardization or Normalization

Few General Operations:

df.describe() # High level maths

# Assuming df is your DataFrame

2. Handling Missing Values:

# Or you can drop rows with missing values

# Assuming 'column' is a column in df with outliers

4. Feature Scaling (Standardization):

from sklearn.preprocessing import StandardScaler

# Assuming X is your features DataFrame

5. Feature Scaling (Normalization):

from sklearn.preprocessing import MinMaxScaler

# Assuming X is your features DataFrame

4. Exploratory Data Analysis (EDA):

“Study of relationship between INPUT and OUTPUT features”

“Getting Idea about the data”

Univariate Analysis/ Bivariate Analysis

working on Categorical columns

working on Numerical columns:

import matplotlib.pyplot as plt

2. Bivariate and Multi variate Analysis

1. Scatter plot [Num vs Num]

sns.scatterplot(tips['total_bill'], tips['tip'], hue=df['sex])

sns.scatterplot(tips['total_bill'], tip['tip'], hue=df['sex'], s

# hue - change in color

5. Feature Engineering and Selection:

m = m - learning_rate * (1/n) * np.dot(X.T, error)

print("Gradient Descent has converged at m = ", m, ", theta =

# Import necessary libraries

# Evaluate model performance

You might also like