KEMBAR78
Python Libraries For Data Science | PDF | Python (Programming Language) | Computer Programming
0% found this document useful (0 votes)
22 views10 pages

Python Libraries For Data Science

The document discusses several popular Python libraries for data science including NumPy, SciPy, Pandas, Scikit-Learn, matplotlib, and Seaborn. It provides brief descriptions and links for each library.

Uploaded by

Meetanshi Gor
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views10 pages

Python Libraries For Data Science

The document discusses several popular Python libraries for data science including NumPy, SciPy, Pandas, Scikit-Learn, matplotlib, and Seaborn. It provides brief descriptions and links for each library.

Uploaded by

Meetanshi Gor
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Python Libraries for

Data Science

MD Arshad Ahmad
15 Years+ Experience in Data Science
Mentored 100+ people
Many popular Python
toolboxes/libraries:

• NumPy

• SciPy

• Pandas
• SciKit-Learn

Visualization libraries

• matplotlib• Seaborn

3
Python Libraries for Data
Science

NumPy:
▪ introduces objects for
multidimensional arrays and matrices,
as well as functions that allow to
easily perform advanced mathematical
and statistical operations on those
objects

▪ provides vectorization of
mathematical operations on arrays
and matrices which significantly
improves the performance

▪ many other python libraries are built on


NumPy

Link: http://www.numpy.org/

Python Libraries for Data


Science
SciPy:
▪ collection of algorithms for linear
algebra, differential equations,
numerical integration, optimization,
statistics and more

▪ part of SciPy Stack

▪ built on NumPy
Link: https://www.scipy.org/scipylib/

Python Libraries for Data


Science
Pandas:
▪ adds data structures and tools
designed to work with table-like data
(similar to Series and Data Frames in
R)

▪ provides tools for data


manipulation: reshaping, merging,
sorting, slicing, aggregation etc.

▪ allows handling missing data


Link: http://pandas.pydata.org/

Python Libraries for Data


Science
SciKit-Learn:
▪ provides machine learning
algorithms: classification, regression,
clustering, model validation etc.

▪ built on NumPy, SciPy and matplotlib

Link: http://scikit-learn.org/
7

Python Libraries for Data


Science
matplotlib:
▪ python 2D plotting library which
produces publication quality figures
in a variety of hardcopy formats

▪ a set of functionalities similar to those of


MATLAB

▪ line plots, scatter plots, barcharts, histograms,


pie charts etc.

▪ relatively low-level; some effort


needed to create advanced visualization

Link: https://matplotlib.org/

Python Libraries for Data


Science
Seaborn:
▪ based on matplotlib

▪ provides high level interface for

drawing attractive statistical

graphics ▪ Similar (in style) to the


popular ggplot2 library in R

Link: https://seaborn.pydata.org/

You might also like