KEMBAR78
Scikit Learn intro | PPTX
1617 – Akshay Gaonkar
scikit-learn
Machine Learning in Python
Simple and efficient tools for data mining and data analysis
Accessible to everybody, and reusable in various contexts
Built on NumPy, SciPy, and matplotlib
Open source, commercially usable - BSD license
Scikit Provides
Classification
Identifying to which category an object belongs to.
Applications: Spam detection, Image recognition.
Algorithms: SVM, nearest neighbors, random forest
Scikit Provides
Regression
Predicting a continuous-valued attribute associated with an object.
Applications: Drug response, Stock prices.
Algorithms: SVR, ridge regression, Lasso
Scikit Provides
Clustering
Automatic grouping of similar objects into sets.
Applications: Customer segmentation, Grouping experiment outcomes.
Algorithms: k-Means, spectral clustering, mean-shift
Scikit Provides
Dimensionality reduction
Reducing the number of random variables to consider.
Applications: Visualization, Increased efficiency
Algorithms: PCA, feature selection, non-negative matrix
factorization
Scikit Provides
Model selection
Comparing, validating and choosing parameters and models.
Goal: Improved accuracy via parameter tuning.
Modules: grid search, cross validation, metrics.
Scikit Provides
Preprocessing
Feature extraction and normalization.
Goal: Transforming input data such as text for use with machine
learning algorithms.
Modules: preprocessing, feature extraction.
Most Importantly
Good Documentaion
Good Community Support
Installation
Scikit-learn requires:
Python (>= 2.7 or >= 3.3) with pip
NumPy (>= 1.8.2)
SciPy (>= 0.13.3)
Scikit-learn install:
python -m pip install -U pip
pip install scikit-learn
pip install numpy
pip install scipy
pip install matplotlib
Intentionally kept blank
Who is using scikit-learn?
and many more ..
Thank You

Scikit Learn intro