0% found this document useful (0 votes)

208 views4 pages

PANDAS Cheatsheet

Uploaded by

Masuddar Rahaman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

208 views4 pages

PANDAS Cheatsheet

Uploaded by

Masuddar Rahaman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

🐼

Pandas Cheatsheet

KEY IMPORTS
We’ll use shorthand in this Import these to start
cheat sheet import pandas as pd
import numpy as np
df - A pandas DataFrame

object
s - A pandas Series object

IMPORTING DATA
If file you are importing is in different directory so in place of filename, write path
of your file.

CODE WORKING

pd.read_csv(filename) From a CSV file

pd.read_table(filename) From a delimited text file (like TSV)

pd.read_excel(filename) From an Excel file

pd.read_sql(query, connection_object) Reads from a SQL table/database

pd.read_json(json_string) Reads from a JSON formatted string, URL or file.

Pandas Cheatsheet 1
CODE WORKING

From a dict, # keys for columns names, # values for

pd.DataFrame(dict)
data as lists.

EXPORTING DATA
CODE WORKING

df.to_csv(filename) Writes to a CSV file

df.to_excel(filename) Writes to an Excel file

df.to_sql(table_name,
Writes to a SQL table
connection_object)

df.to_json(filename) Writes to a file in JSON format

VIEWING/INSPECTING DATA
CODE WORKING

df.head(n) First n rows of the DataFrame

df.tail(n) Last n rows of the DataFrame

df.shape Number of rows and columns

df.info( ) Index, Datatype and Memory information

df.describe( ) Summary statistics for numerical columns

s.value_counts(dropna=False) Views unique values and counts

df.apply(pd.Series.value_counts) Unique values and counts for all columns

SELECTION
CODE WORKING

df[col] Returns column with label col as series

df[[col1, col2]] Returns Columns as a new DataFrame

s.iloc[0] Selection by position

s.loc[0] Selection by index

df.iloc[0, :] First row

df.iloc[0, 0] First element of first column

Pandas Cheatsheet 2
DATA CLEANING
CODE WORKING

df.columns = ['a', 'b', 'c'] Renames columns

pd.isnull() Checks for null Values, Returns Boolean Array

pd.notnull() Opposite of s.isnull()

pd.dropna() Drops all rows that contain null values

df.dropna(axis=1) Drops all columns that contain null values.

Drop all columns that have fewer than n non-NaN

df.dropna(thresh=n)
values

df.fillna(x) Replaces all null values with x

Replaces all null values with the mean(mean can

s.fillna(s.mean()) be replaced with almost any function from the
statistics section)

s.astype(float) Converts the datatype of the series to float

s.replace(1, 'one') Replaces all values equal to 1 with ‘one’

s.replace([1, 3], ['one', 'three']) Replaces all 1 with ‘one’and 3 with ‘three’

df.rename(columns=lambda x: x + 1) Mass renaming of columns

df.rename(columns={'old_name':
Selective renaming
'new_name'})
df.set_index('column_one') Changes the index

df.rename(index = lambda x: x + 1) Mass renaming of index

FILTER, SORT, & GROUPBY

CODE WORKING

df[df[col] > 5] Rows where the col column is greater than 5

df[(df[col] > 5) & (df[col] < 7)] Rows where 7 > col > 5

df.sort_values(col1) Sorts values by col1 in ascending order

df.sort_values(col2, ascending = False) Sorts values by col2 in descending order

df.sort_values([col1, col2], ascending = Sorts values by col1 in ascending order then

[True, False]) col2 in descending order.
Returns a groupby object for values from one
df.groupby(col)
columns

Pandas Cheatsheet 3
CODE WORKING

Returns a groupby object values from multiple

df.groupby([col1, col2])
columns

Returns the mean of the values in col2, grouped

df.groupby(col1)[col2].mean( ) by the values in col1 (mean can be replaced with
almost any function from the statistics section)

Finds the average across all columns for every

df.groupby(col1).agg(np.mean)
unique column 1 group

df.apply(np.mean) Applies a function across each column

df.apply(np.max, axis = 1) Applies a function across each row.

JOIN/COMBINE
CODE WORKING

Adds the rows in df1 to the end of df2(columns

df1.append(df2)
should be identical)

Adds the columns in df1 to the end of df2 (rows

pd.concat([df1, df2], axis=1)
should be identical)

STATISTICS
CODE WORKING

df.describe( ) Summary statistics for numerical columns

df.mean( ) Returns the mean of all columns

Returns the correlation between columns in a

df.corr( )
DataFrame

Returns the number of non-null values in each

df.count( )
DataFrame column

df.max( ) Return the highest value in each column

df.min( ) Returns the lowest value in each column

df.median( ) Returns the median of each column

df.std( ) Returns the standard deviation of each column

Pandas Cheatsheet 4

Pandas
No ratings yet
Pandas
13 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
5 pages
Pandas
No ratings yet
Pandas
41 pages
Pandas
No ratings yet
Pandas
30 pages
Pandas Guide for Data Analysts
No ratings yet
Pandas Guide for Data Analysts
9 pages
Pandas Notes
No ratings yet
Pandas Notes
6 pages
Module1-Cheat-Sheet-LINE PLOT
No ratings yet
Module1-Cheat-Sheet-LINE PLOT
3 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
18 Pandas
No ratings yet
18 Pandas
33 pages
Pandas Methods
No ratings yet
Pandas Methods
6 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
ML Lab1 Python Panda
No ratings yet
ML Lab1 Python Panda
9 pages
Pandas 6 1716219621
No ratings yet
Pandas 6 1716219621
17 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Pandas
No ratings yet
Pandas
86 pages
HTML-Notes 1
No ratings yet
HTML-Notes 1
27 pages
HTML Note Imp HTML
No ratings yet
HTML Note Imp HTML
165 pages
The Racers Life
No ratings yet
The Racers Life
74 pages
HTML
No ratings yet
HTML
12 pages
Pandas
No ratings yet
Pandas
8 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
HTML Tutorial
No ratings yet
HTML Tutorial
42 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
Pandas Guide for Beginners
No ratings yet
Pandas Guide for Beginners
18 pages
1 - Interactive Data Visualization With Bokeh
No ratings yet
1 - Interactive Data Visualization With Bokeh
31 pages
Pandas
No ratings yet
Pandas
27 pages
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
100% (1)
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
2 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
4 pages
HTML Notes
No ratings yet
HTML Notes
22 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
19 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
Python Data Analysis Basics
No ratings yet
Python Data Analysis Basics
246 pages
P Unit-4 NP
No ratings yet
P Unit-4 NP
30 pages
Pandas
No ratings yet
Pandas
14 pages
40 NumPy and Pandas Interview Questions With Answers 1740141557
No ratings yet
40 NumPy and Pandas Interview Questions With Answers 1740141557
6 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
10 pages
Class 6 Pandas
No ratings yet
Class 6 Pandas
13 pages
HTML Basics and Tags Guide
No ratings yet
HTML Basics and Tags Guide
44 pages
Data Science Python Cheat Sheet
No ratings yet
Data Science Python Cheat Sheet
25 pages
Pandas Guide for Data Science
No ratings yet
Pandas Guide for Data Science
42 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
1 Pandas Basics
No ratings yet
1 Pandas Basics
13 pages
Unit - 1 - Python Pandas
No ratings yet
Unit - 1 - Python Pandas
176 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
87 pages
Unit-1 Python Pandas
No ratings yet
Unit-1 Python Pandas
56 pages
Chapter - 6 Dictionary
100% (2)
Chapter - 6 Dictionary
25 pages
HTML Cheat Sheet - Copie
No ratings yet
HTML Cheat Sheet - Copie
9 pages
Research Paper Presentation Pandas Moshiul Arefin
No ratings yet
Research Paper Presentation Pandas Moshiul Arefin
30 pages
Pandas Notes Basic To Advance
No ratings yet
Pandas Notes Basic To Advance
21 pages
Top 50 Pandas Interview Questions and Answers (2024)
No ratings yet
Top 50 Pandas Interview Questions and Answers (2024)
34 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
Ipl Data Anlysis
No ratings yet
Ipl Data Anlysis
20 pages
Pandas For Machine Learning: Acadview
No ratings yet
Pandas For Machine Learning: Acadview
18 pages
HTML Notes by IronCoding
No ratings yet
HTML Notes by IronCoding
9 pages
XII-IP - Data Visualisation
No ratings yet
XII-IP - Data Visualisation
65 pages
Analyzing Data Using Python Filtering Data in Pandas
No ratings yet
Analyzing Data Using Python Filtering Data in Pandas
52 pages
Pandas Course Slides
No ratings yet
Pandas Course Slides
90 pages
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Weeks 1 To 4 Fundamental Analysis
No ratings yet
Weeks 1 To 4 Fundamental Analysis
166 pages
M.SC., M.Ed.,Ph.D., P.G Assistant in Botany, Melsevalambadi - Villupuram Dist. - 9943437766
No ratings yet
M.SC., M.Ed.,Ph.D., P.G Assistant in Botany, Melsevalambadi - Villupuram Dist. - 9943437766
48 pages
Aci sp-226-2005
No ratings yet
Aci sp-226-2005
158 pages
OBD Tools for BMW & China Cars
No ratings yet
OBD Tools for BMW & China Cars
50 pages
Department of Education: Consolidated Individual Instructional Supervision (Technical Assistance)
No ratings yet
Department of Education: Consolidated Individual Instructional Supervision (Technical Assistance)
4 pages
Windows Client Setup Guide
No ratings yet
Windows Client Setup Guide
13 pages
Data Structures for CS Students
No ratings yet
Data Structures for CS Students
16 pages
DCP-F-CTL-052014 (Exi FCU Catalogue)
100% (1)
DCP-F-CTL-052014 (Exi FCU Catalogue)
12 pages
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
No ratings yet
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
354 pages
Latihan Soal Bahasa Inggris Kelas Viii-1
No ratings yet
Latihan Soal Bahasa Inggris Kelas Viii-1
5 pages
Rubric Malikhaing Pagkukwento - 2015 PDF
No ratings yet
Rubric Malikhaing Pagkukwento - 2015 PDF
4 pages
DIFAL Calculation
100% (1)
DIFAL Calculation
8 pages
Get Ready For IELTS Speaking - Collins
No ratings yet
Get Ready For IELTS Speaking - Collins
9 pages
PhD Research Topic Selection Guide
0% (2)
PhD Research Topic Selection Guide
177 pages
OPT A2 U08 Vocab Standard
No ratings yet
OPT A2 U08 Vocab Standard
1 page
The Medical Science Liaison Career Guide How To Break Into Your First Role A Hiring Manager Reveals The Secrets For Success Official Test Bank
No ratings yet
The Medical Science Liaison Career Guide How To Break Into Your First Role A Hiring Manager Reveals The Secrets For Success Official Test Bank
402 pages
Serrano v. Central Bank of The Philippines
No ratings yet
Serrano v. Central Bank of The Philippines
2 pages
Pedagogy of The Oppressed - Quotes and Reflection
No ratings yet
Pedagogy of The Oppressed - Quotes and Reflection
2 pages
Cape Physics Unit 2 Formula Sheet
No ratings yet
Cape Physics Unit 2 Formula Sheet
4 pages
AWS Academy Cloud Foundations Module 08 Student Guide
100% (2)
AWS Academy Cloud Foundations Module 08 Student Guide
69 pages
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
No ratings yet
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
1 page
Afternoon OR Nurse Position Application
No ratings yet
Afternoon OR Nurse Position Application
2 pages
2017 Student Placement Summary
No ratings yet
2017 Student Placement Summary
3 pages
BSD 1307 Object Oriented Analysis and Design
No ratings yet
BSD 1307 Object Oriented Analysis and Design
2 pages
Perhitungan Tugas Besar Geometri Jalan Raya (Andre Gunawan 1622201019)
No ratings yet
Perhitungan Tugas Besar Geometri Jalan Raya (Andre Gunawan 1622201019)
77 pages
Affidavit in Reply
No ratings yet
Affidavit in Reply
19 pages
2005 FBLA Introduction To Business Communication
No ratings yet
2005 FBLA Introduction To Business Communication
7 pages
Seed Germination Techniques Guide
No ratings yet
Seed Germination Techniques Guide
50 pages
Higher Novemeber 2009 Paper 3
No ratings yet
Higher Novemeber 2009 Paper 3
16 pages
Cell: The Building Blocks of Life: Awaluddin, M.Kes
No ratings yet
Cell: The Building Blocks of Life: Awaluddin, M.Kes
39 pages

PANDAS Cheatsheet

Uploaded by

PANDAS Cheatsheet

Uploaded by

🐼

pd.read_csv(filename) From a CSV file

pd.read_table(filename) From a delimited text file (like TSV)

pd.read_excel(filename) From an Excel file

pd.read_sql(query, connection_object) Reads from a SQL table/database

pd.read_json(json_string) Reads from a JSON formatted string, URL or file.

From a dict, # keys for columns names, # values for

df.to_csv(filename) Writes to a CSV file

df.to_excel(filename) Writes to an Excel file

df.to_json(filename) Writes to a file in JSON format

df.head(n) First n rows of the DataFrame

df.tail(n) Last n rows of the DataFrame

df.shape Number of rows and columns

df.info( ) Index, Datatype and Memory information

df.describe( ) Summary statistics for numerical columns

s.value_counts(dropna=False) Views unique values and counts

df.apply(pd.Series.value_counts) Unique values and counts for all columns

df[col] Returns column with label col as series

df[[col1, col2]] Returns Columns as a new DataFrame

s.iloc[0] Selection by position

s.loc[0] Selection by index

df.iloc[0, :] First row

df.iloc[0, 0] First element of first column

df.columns = ['a', 'b', 'c'] Renames columns

pd.isnull() Checks for null Values, Returns Boolean Array

pd.notnull() Opposite of s.isnull()

pd.dropna() Drops all rows that contain null values

df.dropna(axis=1) Drops all columns that contain null values.

Drop all columns that have fewer than n non-NaN

df.fillna(x) Replaces all null values with x

Replaces all null values with the mean(mean can

s.astype(float) Converts the datatype of the series to float

s.replace(1, 'one') Replaces all values equal to 1 with ‘one’

df.rename(columns=lambda x: x + 1) Mass renaming of columns

df.rename(index = lambda x: x + 1) Mass renaming of index

FILTER, SORT, & GROUPBY

df[df[col] > 5] Rows where the col column is greater than 5

df.sort_values(col1) Sorts values by col1 in ascending order

df.sort_values(col2, ascending = False) Sorts values by col2 in descending order

df.sort_values([col1, col2], ascending = Sorts values by col1 in ascending order then

Returns a groupby object values from multiple

Returns the mean of the values in col2, grouped

Finds the average across all columns for every

df.apply(np.mean) Applies a function across each column

df.apply(np.max, axis = 1) Applies a function across each row.

Adds the rows in df1 to the end of df2(columns

Adds the columns in df1 to the end of df2 (rows

df.describe( ) Summary statistics for numerical columns

df.mean( ) Returns the mean of all columns

Returns the correlation between columns in a

Returns the number of non-null values in each

df.max( ) Return the highest value in each column

df.min( ) Returns the lowest value in each column

df.median( ) Returns the median of each column

df.std( ) Returns the standard deviation of each column

You might also like