0% found this document useful (0 votes)

12 views6 pages

Machine Learning With Python

This document describes Machine Learning and its different types, including supervised, unsupervised, and reinforcement learning. It also explains how to avoid overfitting through data retention and cross-validation. Additionally, it outlines the key steps to build an ML model: collecting, processing, and exploring data, training and evaluating algorithms, and using the model. Finally, it mentions that Python is popular for ML due to its abundant machine learning libraries.

Uploaded by

ScribdTranslations

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

Machine Learning With Python

Uploaded by

ScribdTranslations

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Machine Learning with Python

Machine Learning

One of the fields of study that is gaining more popularity is within the
Computer science is machine learning. Many of
the services we use in our daily lives like Google, Gmail, Netflix, Spotify or
Amazon utilizes the tools provided by Machine Learning to achieve
an increasingly personalized service and thus achieve competitive advantages over its
rivals.

What is Machine Learning?

But what exactly is Machine Learning? Machine Learning is the design and
study of the computer tools that use past experience to make
future decisions; it is the study of programs that can learn from data. The
The fundamental objective of Machine Learning is to generalize, or induce a rule
unknown from examples where that rule is applied. The most typical example
where we can see the use of Machine Learning is in the filtering of spam emails or
spam. Through the observation of thousands of emails that have been marked
Previously as garbage, spam filters learn to classify new messages.
Machine Learning combines concepts and techniques from different areas of knowledge,
like mathematics, statistics, and computer science; for this reason, there is
many ways to learn the discipline.

Types of Machine Learning

Machine Learning has a wide range of applications, including engines of

search, medical diagnoses, fraud detection in credit card usage,
analysis of the stock market, classification of DNA sequences, recognition of
speech and written language, games and robotics. But in order to address each one of
These topics are crucial, first of all, to distinguish the different types of problems of
Machine Learning that we can encounter.
Supervised learning

In supervised learning problems, the algorithm is taught or trained based on

data that already comes labeled with the correct answer. The larger the set is
from data, the algorithm can learn about the topic. Once completed the
training, new data is provided, no longer with the labels of the answers
correct, and the learning algorithm uses the past experience it acquired during
the training phase to predict an outcome. This is similar to the method of
learning that is used in schools, where we are taught problems and the ways
to solve them, so that we can later apply the same methods in situations
similar.

Unsupervised learning

In unsupervised learning problems, the algorithm is trained using a

dataset that has no labels; in this case, it is never told to the
algorithm what the data represent. The idea is that the algorithm can find for itself
only patterns that help to understand the dataset. Unsupervised learning
it is similar to the method we use to learn to speak when we are babies, in a
At first we listen to our parents talking and we don’t understand anything; but as we go on
as we listen to thousands of conversations, our brain will start to form a
model about how language works and we will begin to recognize patterns and to
wait for certain sounds.

Reinforcement learning

In reinforcement learning problems, the algorithm learns by observing the world.

that surrounds him. His input information is the feedback he receives.
from the outside world in response to its actions. Therefore, the system learns to
trial-and-error basis. A good example of this type of learning can be found
in the games, where we try new strategies and we select and
perfecting those that help us win the game. As we go forward
gaining more practice, the cumulative effect of reinforcement on our actions
victorious will end up creating a winning strategy.
Overtraining

As we mentioned when we defined Machine Learning, the fundamental idea is

find patterns that we can generalize in order to then apply this generalization
about the cases that we have not yet observed and make predictions. But also
It may happen that during training we only discover coincidences in the data.
that resemble interesting patterns, but do not generalize. This is what is
known as overtraining or overfitting.

Overtraining is the tendency that most algorithms have

Machine Learning to adjust to very specific characteristics of the data from
training that has no causal relationship with the objective function we are
seeking to generalize. The most extreme example of an overfitted model is a
model that only memorizes the correct answers; this model when used with
data that has never been seen before will have a random performance, as it never achieved
generalize a pattern to predict.

How to avoid overtraining

As we mentioned earlier, all Machine Learning models have

tendency towards overtraining; this is why we must learn to live with it
same and try to take preventive measures to reduce it as much as possible. The two
Main strategies to deal with overtraining are: data retention and
cross-validation.

In the first case, the idea is to divide our dataset into one or more subsets.
of training and other evaluation sets. That is to say, we are not going to pass it
we will not give all our data to the algorithm during training, but we will retain one
part of the training data to perform an evaluation of the effectiveness of
model. With this, what we seek is to prevent the same data that we use for
train be the same ones we use to evaluate. In this way we will be able to
analyze more precisely how the model behaves as we provide more of it
let's train and be able to detect the critical point at which the model stops
generalize and begins to overfit the training data.
Cross-validation is a more sophisticated procedure than the previous one. Instead of
just to get a simple estimate of the effectiveness of generalization; the idea is
conduct a statistical analysis to obtain other measures of estimated performance,
such as the mean and variance, and thus understand how performance is expected
varies across different datasets. This variation is fundamental for the
evaluation of confidence in performance estimation. Cross-validation
It also makes better use of a limited dataset; since unlike the
simple division of the data into one for training and another for evaluation; the validation
Cruzada calculates its estimates on the entire dataset by conducting
of multiple divisions and systematic exchanges between training data and data
of evaluation.

Steps to build a machine learning model

Building a Machine Learning model is not just about using an algorithm

learning or using a Machine Learning library; rather, it is a whole process that
usually involves the following steps:

Collect the data. We can collect the data from many sources, we can
for example, extracting data from a website or obtaining data using an API or
from a database. We can also use other devices that collect the
data by us; or use data that is publicly available. The number of options
What we have to collect data is endless! This step seems obvious, but it is one of
those that bring the most complications and take the most time.

Preprocess the data. Once we have the data, we need to ensure that
it has the correct format to feed our learning algorithm. It is practically
inevitable to perform several preprocessing tasks before being able to use the
data. Likewise, this point is usually much simpler than the previous step.

Explore the data. Once we have the data and it is in the correct format,
we can perform a preliminary analysis to correct the cases of missing values or try
find at first glance some pattern in them that facilitates the construction of the
model. In this stage, statistical measures and the
graphs in 2 and 3 dimensions to get a visual idea of how our behave
data. At this point, we can detect outliers that we should discard; or
find the characteristics that have the most influence for making a prediction.

Train the algorithm. This is where we start to use Machine techniques.

Learning really. At this stage, we nourish the learning algorithm(s) with the
data that we have been processing in the previous stages. The idea is that the algorithms
they can extract useful information from the data we pass them to then be able to make
predictions.

Evaluate the algorithm. At this stage, we test the information or knowledge.

that the algorithm obtained from the training of the previous step. We assess how accurate
it is the algorithm in its predictions and if we are not very satisfied with its performance,
We can go back to the previous stage and continue training the algorithm by changing
some parameters until achieving an acceptable performance.

Use the model. In this last stage, we set our model to face the
real problem. Here we can also measure its performance, which may force us to
review all the previous steps.

Python libraries for machine learning

As I always like to mention, one of the great advantages that Python offers over
other programming languages; it is how large and prolific the community is
developers around him; a community that has contributed with a great variety of
first-level libraries that extend the functionalities of the language. In the case of
Machine Learning, the main libraries that we can use are:

Scikit-Learn

Scikit-learn is the main library available for working with Machine Learning, it includes
the implementation of a large number of learning algorithms. We can use it
for classifications, feature extraction, regressions, groupings, reduction
of dimensions, model selection, or preprocessing. It has an API that is
consistent across all models and integrates very well with the rest of the packages
scientists that Python offers. This library also facilitates evaluation tasks,
diagnosis and cross validations since it provides us with several factory methods
to be able to carry out these tasks in a very simple way.

Statsmodels

Statsmodels is another great library that focuses on statistical models and is used
mainly for predictive and exploratory analysis. Just like Scikit-learn, it also
it integrates very well with the other scientific packages of Python. If we want
fit linear models, conduct statistical analysis, or maybe a bit of modeling
predictive, then Statsmodels is the ideal library. The statistical tests it offers
They are quite broad and cover validation tasks for most cases.

PyMC

pyMC is a Python module that implements Bayesian statistical models,

including Monte Carlo Markov chain (MCMC). pyMC offers functionalities for
make Bayesian analysis as simple as possible. Include Bayesian models,
statistical distributions and diagnostic tools for covariance of the
models. If we want to perform a Bayesian analysis, this is undoubtedly the library to use.

NTLK

NLTK is the leading library for natural language processing or NLP for its acronym.
provides user-friendly interfaces to over 50 bodies and lexical resources,
like WordNet, along with a set of text processing libraries for the
classification, tokenization, labeling, analysis, and semantic reasoning.

Obviously, here I am only listing a few of the many libraries that exist in
Python to work with Machine Learning problems, I invite you to create your own
research on the topic.

Types of ML
No ratings yet
Types of ML
4 pages
Unit 3 - ML
No ratings yet
Unit 3 - ML
15 pages
Machine Learning Notes
91% (11)
Machine Learning Notes
19 pages
Unit 1
No ratings yet
Unit 1
62 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
13 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Introduction To Machine Learning: Suresh Singh Rajpurohit
No ratings yet
Introduction To Machine Learning: Suresh Singh Rajpurohit
28 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
MLES
No ratings yet
MLES
30 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Machine Learning Basics & Challenges
No ratings yet
Machine Learning Basics & Challenges
6 pages
Machine Learning Module Overview
No ratings yet
Machine Learning Module Overview
29 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Machine Learning Basics and Applications For Beginners
No ratings yet
Machine Learning Basics and Applications For Beginners
15 pages
ML
No ratings yet
ML
19 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Summer Internship Report
No ratings yet
Summer Internship Report
27 pages
Unit 1
No ratings yet
Unit 1
95 pages
Made By: Swati Tripathi
No ratings yet
Made By: Swati Tripathi
31 pages
Chapter 1
No ratings yet
Chapter 1
30 pages
Machine Learning - v1
No ratings yet
Machine Learning - v1
30 pages
A 6 Step Field Guide For Building Machine Learning Projects
No ratings yet
A 6 Step Field Guide For Building Machine Learning Projects
17 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Unit 01
No ratings yet
Unit 01
32 pages
Null 5
No ratings yet
Null 5
16 pages
Introduction To ML
No ratings yet
Introduction To ML
17 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
19 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Machine Learning Basics and Types
No ratings yet
Machine Learning Basics and Types
20 pages
Truncated Doc 4
No ratings yet
Truncated Doc 4
3 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
Unit 4
No ratings yet
Unit 4
34 pages
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
No ratings yet
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
25 pages
UNIT III (ML, Classification, Regression, Types of ML)
No ratings yet
UNIT III (ML, Classification, Regression, Types of ML)
19 pages
Unit III
No ratings yet
Unit III
19 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Introduction To Machine Learning Basics
No ratings yet
Introduction To Machine Learning Basics
12 pages
An Enlightenment To Machine Learning - Resp
No ratings yet
An Enlightenment To Machine Learning - Resp
22 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Unit 1
No ratings yet
Unit 1
6 pages
Internship Report On Machine Learning With Python For Business
No ratings yet
Internship Report On Machine Learning With Python For Business
25 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
9 pages
I. The Types of Machine Learning
No ratings yet
I. The Types of Machine Learning
8 pages
There Are Key Areas in The Process of Machine Learning, Like
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
45 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
Machine Learning 1
No ratings yet
Machine Learning 1
34 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Shoe upper model maker
No ratings yet
Shoe upper model maker
4 pages
9. TOOLBOX
No ratings yet
9. TOOLBOX
5 pages
evaluation module 1 sales management
No ratings yet
evaluation module 1 sales management
5 pages
Diagnostic Evaluation of the Area of Education for Work
No ratings yet
Diagnostic Evaluation of the Area of Education for Work
2 pages
Problems of Ford Motor Company in the Year 2000
No ratings yet
Problems of Ford Motor Company in the Year 2000
3 pages
sisu202_regular_call_homologation_stage1
No ratings yet
sisu202_regular_call_homologation_stage1
22 pages
Writing letters in French
No ratings yet
Writing letters in French
3 pages
Financial indicators questionnaire
No ratings yet
Financial indicators questionnaire
21 pages
GIINN_U3_A3_RORF
No ratings yet
GIINN_U3_A3_RORF
6 pages
Essay on Good Luck
No ratings yet
Essay on Good Luck
2 pages
The secret of bathroom measurements
No ratings yet
The secret of bathroom measurements
3 pages
Pablo's strategies for church growth
No ratings yet
Pablo's strategies for church growth
3 pages
Height Work Test
No ratings yet
Height Work Test
2 pages
Rule of Rose Walkthrough
No ratings yet
Rule of Rose Walkthrough
5 pages
OIL PAINTING
No ratings yet
OIL PAINTING
9 pages
TOPIC 51
No ratings yet
TOPIC 51
9 pages
COPLAS ON THE DISCOVERY OF AMERICA
No ratings yet
COPLAS ON THE DISCOVERY OF AMERICA
2 pages
Copy of Solution Manual_Unit_02
No ratings yet
Copy of Solution Manual_Unit_02
16 pages
PARENTS' MEETING AGENDA 2024
No ratings yet
PARENTS' MEETING AGENDA 2024
3 pages
Wood Work Guide
No ratings yet
Wood Work Guide
20 pages
Preterite and Imperfect or Indefinite
No ratings yet
Preterite and Imperfect or Indefinite
4 pages
Unit 3 and 4 RESEARCH.docx
No ratings yet
Unit 3 and 4 RESEARCH.docx
29 pages
HUMAN RESOURCES IN SODIMAC.docx
No ratings yet
HUMAN RESOURCES IN SODIMAC.docx
8 pages
Guide 06_REMOTE STORAGE_2020-II
No ratings yet
Guide 06_REMOTE STORAGE_2020-II
11 pages
THE POWER OF HARMONIC RESONANCE
No ratings yet
THE POWER OF HARMONIC RESONANCE
21 pages
BACKGROUND of Community Center Project
No ratings yet
BACKGROUND of Community Center Project
8 pages
WES form
No ratings yet
WES form
2 pages
Virtual Assistant (Monograph)
No ratings yet
Virtual Assistant (Monograph)
10 pages
azangaro.docx
No ratings yet
azangaro.docx
9 pages
physics exam pre.docx
No ratings yet
physics exam pre.docx
3 pages
SAP C_TS462_2022 Exam Q&A Demo
No ratings yet
SAP C_TS462_2022 Exam Q&A Demo
5 pages
All Programming Interview Q&A
No ratings yet
All Programming Interview Q&A
14 pages
Ulysses - TEST
0% (1)
Ulysses - TEST
2 pages
Inverse Trigonometric Function: Multiple Choice Questions
100% (1)
Inverse Trigonometric Function: Multiple Choice Questions
6 pages
Add/Subtract Rational Expressions
No ratings yet
Add/Subtract Rational Expressions
14 pages
Light Class 7 QUS ANS
No ratings yet
Light Class 7 QUS ANS
3 pages
Thesis Statement Practice Worksheet For Middle School
100% (1)
Thesis Statement Practice Worksheet For Middle School
4 pages
Abinitio Intvw Questions
100% (1)
Abinitio Intvw Questions
20 pages
Data Structures: Stacks & Queues
No ratings yet
Data Structures: Stacks & Queues
74 pages
Features of Maltego
No ratings yet
Features of Maltego
6 pages
1400 GMAT Vocabulary Flashcards
No ratings yet
1400 GMAT Vocabulary Flashcards
100 pages
Meaning and Intentionality in Wittgenstein's Later Philosophy
No ratings yet
Meaning and Intentionality in Wittgenstein's Later Philosophy
13 pages
Aughaval Parishes (Church of Ireland) Bulletin, Final, 10 Nov 2024
No ratings yet
Aughaval Parishes (Church of Ireland) Bulletin, Final, 10 Nov 2024
5 pages
Apple: The Evolution of Communication: James Pippins, Sureda Sanouvong, William Gray, Courtney Rouse, and Sharan Miles
No ratings yet
Apple: The Evolution of Communication: James Pippins, Sureda Sanouvong, William Gray, Courtney Rouse, and Sharan Miles
7 pages
Chaucer Style and Language
No ratings yet
Chaucer Style and Language
3 pages
Lets Build A House Lesson
No ratings yet
Lets Build A House Lesson
5 pages
SAP Hybris V6.2 Certified Development Professional Study Guide - Quuth5ootaip
No ratings yet
SAP Hybris V6.2 Certified Development Professional Study Guide - Quuth5ootaip
260 pages
Developing Cascading Style Sheets Edited133
No ratings yet
Developing Cascading Style Sheets Edited133
62 pages
CMS Substructure in ANSYS Workbench
No ratings yet
CMS Substructure in ANSYS Workbench
8 pages
Alster-Sumerian Proverbs
No ratings yet
Alster-Sumerian Proverbs
17 pages
Plato VS Aristotle PDF
No ratings yet
Plato VS Aristotle PDF
14 pages
Linux OS for Computer Engineering Students
No ratings yet
Linux OS for Computer Engineering Students
3 pages
Final Kisi Soal English Provinsi
No ratings yet
Final Kisi Soal English Provinsi
7 pages
A Foundation Course in Spoken English
100% (1)
A Foundation Course in Spoken English
10 pages
Maths IB (EM) BLM 21-22
No ratings yet
Maths IB (EM) BLM 21-22
111 pages
Natural Language Processing in Python
No ratings yet
Natural Language Processing in Python
214 pages
Leonora Miano's, La Saison de L'ombre, Through The Prism of Edouard Glissant's Concept of Creolisation.
No ratings yet
Leonora Miano's, La Saison de L'ombre, Through The Prism of Edouard Glissant's Concept of Creolisation.
15 pages
Weblogic Server Setup Guide
No ratings yet
Weblogic Server Setup Guide
32 pages
Ebin - Pub - Essays On Indic History
No ratings yet
Ebin - Pub - Essays On Indic History
198 pages
Book of Kings
No ratings yet
Book of Kings
6 pages

Machine Learning With Python

Uploaded by

Machine Learning With Python

Uploaded by

Machine Learning with Python

What is Machine Learning?

Types of Machine Learning

Machine Learning has a wide range of applications, including engines of

In supervised learning problems, the algorithm is taught or trained based on

In unsupervised learning problems, the algorithm is trained using a

In reinforcement learning problems, the algorithm learns by observing the world.

As we mentioned when we defined Machine Learning, the fundamental idea is

Overtraining is the tendency that most algorithms have

How to avoid overtraining

As we mentioned earlier, all Machine Learning models have

Steps to build a machine learning model

Building a Machine Learning model is not just about using an algorithm

Train the algorithm. This is where we start to use Machine techniques.

Evaluate the algorithm. At this stage, we test the information or knowledge.

Python libraries for machine learning

pyMC is a Python module that implements Bayesian statistical models,

You might also like