KEMBAR78
Pybcn machine learning for dummies with python | PDF
Machine Learning
for dummies
with Python
PYBCN
Javier Arias
@javier_arilos
One day in your life
June 2016
One day in your life
Time to leave the office
One day in your life
Tesla autopilot
One day in your life
One day in your life
Playing music
One day in your life
Your photos organized
One day in your life
Machine Learning is here,
it is everywhere and
it is going to stay
About this presentation
Why Machine Learning (ML) matters
A journey on Machine Learning
Some ML technologies and resources
Some basic ML concepts, with code sample
Machine Learning is the next big thing
Are machines already intelligent?
Image-net challenge
2015: machines outperform people
Chess
1997: Deepblue defeats Kasparov
Game of Go
2016: AlphaGo wins world champion Lee Sedol
The journey
Learning about ML
MOOC - Massive Open Online Courses
Contents by the best universities and companies
Some superstar teachers
Udacity, Coursera, EdX
Udacity - Intro to Machine Learning
Pattern Recognition for Fun and Profit
- Very well organized contents
- Python + sklearn
- Sebastian Thrun
- Free
- At your own pace
Udacity - Intro to Machine Learning
Pattern Recognition for Fun and Profit
Udacity - Intro to Machine Learning
Pattern Recognition for Fun and Profit
Udacity - Deep Learning
Take machine Learning to the next level
ML branch based on algorithms that use multiple processing layers
● By Google
● Python and Tensorflow
● No wine for the moment :-(
What is Machine Learning?
Solving a complex problem
somethingfeatures
(data)
prediction
First approach, programming
tell the computer what to do in very tiny steps
… represented primarily
as if-then rules ...
Artificial Intelligence - expert systems
Machine Learning
show the computer some real world data and let it
learn from it
Machine Learning, implications
We can train computers to do things we do not
know how to do
ML example: character recognition
Not-MNIST dataset
Thousands of 28x28 grayscale images with labels
features x 1000s labels x 1000s
F G F J
ML step 1: get samples (training data)
ML step 2: choose an algorithm
Linear regression
Support Vector Mach.
k-Means
Decision Trees
Random Forests
Neural networks
Convolutional NN
Naive Bayes
ML step 3: train your algorithm
features x 1000s labels x 1000sML algorithm
F G F J
ML, last step: getting predictions
ML algorithmfeatures
(data)
prediction
D
Tricky Question
How good are our
predictions?
The Tools
The Tools
Python sklearn tensorflow
The Tools: Python
● Opensource
● Expressive
● Interpreted, dynamically typed
● Widely used many different problems
● Batteries included: Notebook, Libraries
The Tools: sklearn
● Opensource, Python
● Wonderful documentation
● Support to full ML lifecycle:
○ Feauture engineering
○ Algorithms
○ Validation
○ Datasets
The Tools: TensorFlow
● Opensource, Python
● Deep Learning
● Data flow graphs.
○ Nodes: mathematical operations
○ Edges: Tensors, multidimensional arrays
A summary of ML process
● Get features (with labels)
● Choose and configure an algorithm
● Train your algorithm
● Do predictions
● Validate your results
train your model
tr_ds, _, tr_lbl, _ = train_test_split(dataset, labels,
train_size=size,
random_state=17)
clf = LogisticRegression()
clf.fit(tr_ds, tr_lbl) # fit with train dataset and train labels
train_ds test_ds
dataset
make predictions
pred = clf.predict(test_dataset)
How good are our predictions?
accuracy
test_predicions = clf.predict(test_dataset)
acc = accuracy_score(test_labels, test_predictions)
89% accuracy
Improving prediction results
Training data
Algorithm + config
Is Machine Learning black magic?
Machine Learning is mathematics
Linear regression - predict house prices
Linear regression
Linear regression
price = m·size + n
Linear regression - starting training
Linear regression - training iteration
Linear regression - error minimized
SVM classification example
ML finds boundaries in your data
89% accuracy… It can never be too much.
First trained algorithm - 30% accuracy
75% accuracy - should we aim for 100%?
100% accuracy, now we are talking!
overfitting
Let’s recap
Questions?
Pybcn machine learning for dummies with python

Pybcn machine learning for dummies with python