KEMBAR78
Python pandas Library | PPTX
Data Analysis in
Python u s i n g
Pandas
2
Pandas
o Pyton Data analysis library
o Built on top of Numpy
o Abbreviation of Panel Data System
o Used in production in many companies
33
T h e I d e a l t o o l f o r
d a t a S c i e n t i s t s
oManaging data
oCleaning data
oAnalyzing
oModeling data
oOrganizing the data in a form suitable for plotting or tabular
display
4
5
A d d i n g i n d e x t o t h e
s e r i e s
6
E x t r a c t i n g d a t a b y
i n d e x
7
C o n v e r t P y t h o n D i c t t o
P a n d a s S e r i e s
8
C h a n g i n g S e r i e s ’
I n d e x
9
oPython DataFrame is a data structure containing and
ordered collecetions of columns.
oEach column may hold numeric, string, boolean etc.
Values
oDataFrame has both row and column index
D a t a F r a m e
1010
oA pandas DataFrame can be created using various inputs
like
--Lists
--Dict
--Series
--Numpy ndarrays
--Another DataFrame
C r e a t i n g a D a t a F r a m e
11
12
13
Thanks!
Any questions?
14
15
16
17
18
19
20
21
22
23
24
25
26
P y t h o n P a n d a s
I n p u t / O u t p u t T O O L S
oThe Pandas I/O API is a set of top level reader functions accessed like
pd.read_csv() that generally return a Pandas object.
oThe two functions for reading text files are read_csv() and
read_table(). They both intelligently convert tabular data into a
DataFrame object
27
28
29
Python Pandas
30
31
32
33
34
35
36
Py th o n
Pandas
C o n c a t e n a t i o n

Python pandas Library

  • 1.
    Data Analysis in Pythonu s i n g Pandas
  • 2.
    2 Pandas o Pyton Dataanalysis library o Built on top of Numpy o Abbreviation of Panel Data System o Used in production in many companies
  • 3.
    33 T h eI d e a l t o o l f o r d a t a S c i e n t i s t s oManaging data oCleaning data oAnalyzing oModeling data oOrganizing the data in a form suitable for plotting or tabular display
  • 4.
  • 5.
    5 A d di n g i n d e x t o t h e s e r i e s
  • 6.
    6 E x tr a c t i n g d a t a b y i n d e x
  • 7.
    7 C o nv e r t P y t h o n D i c t t o P a n d a s S e r i e s
  • 8.
    8 C h an g i n g S e r i e s ’ I n d e x
  • 9.
    9 oPython DataFrame isa data structure containing and ordered collecetions of columns. oEach column may hold numeric, string, boolean etc. Values oDataFrame has both row and column index D a t a F r a m e
  • 10.
    1010 oA pandas DataFramecan be created using various inputs like --Lists --Dict --Series --Numpy ndarrays --Another DataFrame C r e a t i n g a D a t a F r a m e
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
    26 P y th o n P a n d a s I n p u t / O u t p u t T O O L S oThe Pandas I/O API is a set of top level reader functions accessed like pd.read_csv() that generally return a Pandas object. oThe two functions for reading text files are read_csv() and read_table(). They both intelligently convert tabular data into a DataFrame object
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
    36 Py th on Pandas C o n c a t e n a t i o n