0% found this document useful (0 votes)

69 views2 pages

Data Analysis CheatSheet

This cheat sheet provides essential commands and techniques for data analysis and visualization in Python using libraries like NumPy, Pandas, Matplotlib, and Seaborn. It covers topics such as importing libraries, creating and manipulating arrays and DataFrames, handling missing data, plotting, merging data, and file handling. Additionally, it includes examples of statistical operations, data binning, and removing duplicates.

Uploaded by

Aditya singh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views2 pages

Data Analysis CheatSheet

Uploaded by

Aditya singh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

📘 Data Analysis & Visualization in Python - Exam Cheat Sheet

1. Importing Required Libraries

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

2. NumPy Arrays
a1 = np.zeros((2, 3)) # 2x3 zero array
a2 = [[3, 4, 5], [7, 8, 9]]
np.add(a1, a2) # Matrix addition
np.append(a1, a2, axis=0) # Append rows
np.shape(a1) # Get shape

3. Pandas DataFrames
df = pd.DataFrame({'Name': ['Amit', 'Neha', 'Amit'], 'Age': [23, 30,
25]})
df.head() # First 5 rows
df.describe() # Stats summary
df.info() # Structure
df.columns # Column names
df['Age'].mean() # Average age
df['Name'].nunique() # Unique names
df.groupby('Name')['Age'].mean() # Group avg

4. Merging and Joining

df1 = pd.DataFrame({'RollNo': [1, 2], 'Name': ['Ravi', 'Megha']})
df2 = pd.DataFrame({'RollNo': [2, 3], 'Name': ['Megha', 'Karan']})
pd.merge(df1, df2, on='Name', how='inner') # Merge on Name
pd.merge(df1, df2, on=['RollNo', 'Name'], how='inner') # Exact match

5. Handling Missing Data

df.dropna(thresh=2) # Keep rows with >=2 non-NA
df.fillna(method='ffill', limit=2) # Forward fill

6. Plotting and Visualization

plt.plot(days, rainfall, 'ro-') # Line Plot
plt.scatter(df['Salary'], df['Age']) # Scatter Plot
df['column'].value_counts().plot(kind='bar') # Bar Plot
sns.boxplot(data=df, y='sales') # Box Plot
sns.heatmap(df.corr(), annot=True) # Heatmap
7. MultiIndex and Swap Level
df.index.names = ['key1', 'key2']
df = df.swaplevel('key1', 'key2')
df = df.sort_index(level=0)

8. Binning Data
ages = [21, 25, 33, 45, 62]
pd.cut(ages, bins=[18, 25, 35, 60, 100], labels=['Youth', 'YoungAdult',
'MiddleAged', 'Senior'])
pd.qcut(ages, 4) # Equal-sized bins

9. File Handling
pd.read_csv("data.csv") # Read CSV
pd.read_excel("data.xlsx") # Read Excel
df.to_csv("out.csv", index=False) # Save CSV
df.to_excel("out.xlsx", index=False) # Save Excel
pd.read_excel("data.xlsx", index_col="Employee ID")

10. Subplots and Save Plot

fig, axs = plt.subplots(1, 2)
axs[0].scatter(df['Salary'], df['Age'])
axs[1].bar(df['Role'].value_counts().index, df['Role'].value_counts())
plt.savefig("plot.png")

11. Correlation and Covariance

df[['Hours', 'Marks']].corr() # Correlation
df[['Hours', 'Marks']].cov() # Covariance

12. Remove Duplicate Rows

df.drop_duplicates(['col1', 'col2'], keep='last')

13. Series Rank & Comparison

s1 = pd.Series([5, 0, -4, 8])
s1.rank() # Rank values
df2 > df1['B'].min() # Element-wise comparison

14. Experience: Practice Scenario

df = pd.read_csv("employee.csv")
df.groupby('Role')['Salary'].sum() # Role-wise total
salary
df[df['Gender'] == 'Female'].shape[0] # Female count
df[df['Salary'] >= df['Salary'].mean()] # Filter by average
salary

Pandas
No ratings yet
Pandas
13 pages
Cheat Sheet - Pandas
No ratings yet
Cheat Sheet - Pandas
6 pages
Data Visualization & Preprocessing Guide
No ratings yet
Data Visualization & Preprocessing Guide
18 pages
IP Employee Project
No ratings yet
IP Employee Project
31 pages
EDA Cheat Sheet
No ratings yet
EDA Cheat Sheet
7 pages
Pandas Trampas
No ratings yet
Pandas Trampas
9 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
17 pages
EDA With Pandas
No ratings yet
EDA With Pandas
8 pages
Pandas Dataframe All Operations 1735471870
No ratings yet
Pandas Dataframe All Operations 1735471870
4 pages
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
No ratings yet
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
10 pages
12 IP Practial Programs 2025-26
No ratings yet
12 IP Practial Programs 2025-26
10 pages
EDA Step by Step
No ratings yet
EDA Step by Step
2 pages
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
No ratings yet
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
7 pages
Pandas Dataframe Cheat Sheet
No ratings yet
Pandas Dataframe Cheat Sheet
3 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Kunj Project 1
No ratings yet
Kunj Project 1
34 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Programs of Python Pandas
No ratings yet
Programs of Python Pandas
15 pages
Python Pandas: 12 Data Manipulation Techniques
100% (2)
Python Pandas: 12 Data Manipulation Techniques
19 pages
Learn Pandas
No ratings yet
Learn Pandas
37 pages
Pandas Fuction Notes
No ratings yet
Pandas Fuction Notes
3 pages
Python and MySQL Real Estate Project
No ratings yet
Python and MySQL Real Estate Project
46 pages
Solution
No ratings yet
Solution
8 pages
Da Pra Week-8 (Karthik S) - 074713
No ratings yet
Da Pra Week-8 (Karthik S) - 074713
9 pages
Eda Code Snippets
No ratings yet
Eda Code Snippets
17 pages
Python Assignment-2
No ratings yet
Python Assignment-2
3 pages
Geo Python Doc (1) 7,8 Bavesh
No ratings yet
Geo Python Doc (1) 7,8 Bavesh
9 pages
Universal Data Analytics Algorithm
No ratings yet
Universal Data Analytics Algorithm
51 pages
Cheat Sheet
No ratings yet
Cheat Sheet
15 pages
Sakina Assign1 Batch3
No ratings yet
Sakina Assign1 Batch3
8 pages
Data Preprocess Steps
No ratings yet
Data Preprocess Steps
2 pages
Data Preprocessing & Visualization1
No ratings yet
Data Preprocessing & Visualization1
2 pages
Exp 8 - LM
No ratings yet
Exp 8 - LM
10 pages
Project Report Certificate: Hardware Requirement
No ratings yet
Project Report Certificate: Hardware Requirement
11 pages
Observation: Import As Import As Import As Import As
No ratings yet
Observation: Import As Import As Import As Import As
31 pages
Grade 12 - IP Practicals (1 To 9)
No ratings yet
Grade 12 - IP Practicals (1 To 9)
12 pages
Assignment Ds Midterm
No ratings yet
Assignment Ds Midterm
2 pages
Viksit Ip Project File
No ratings yet
Viksit Ip Project File
33 pages
Employee Info
No ratings yet
Employee Info
2 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Parth IP Employee Management Project
No ratings yet
Parth IP Employee Management Project
32 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
DataFrame 1
No ratings yet
DataFrame 1
3 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Lab Record IP
No ratings yet
Lab Record IP
13 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Ali Bhai's IP Project
No ratings yet
Ali Bhai's IP Project
31 pages
GR12 Record Programs 6TH Onwards
No ratings yet
GR12 Record Programs 6TH Onwards
18 pages
Kunj Project 1
No ratings yet
Kunj Project 1
34 pages
Step-by-Step Explanation of Python Data Preprocessing Script
No ratings yet
Step-by-Step Explanation of Python Data Preprocessing Script
9 pages
Data Analysis Exam for CS Majors
No ratings yet
Data Analysis Exam for CS Majors
12 pages
Dav 2024 Pyq
No ratings yet
Dav 2024 Pyq
7 pages
Dataframe in Pandas - Cheatsheet
No ratings yet
Dataframe in Pandas - Cheatsheet
8 pages
Python Pandas-DataFrames Complete - Jupyter Notebook
No ratings yet
Python Pandas-DataFrames Complete - Jupyter Notebook
34 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
EDS - Python Cheat Sheet
0% (1)
EDS - Python Cheat Sheet
3 pages
Vantika Kamra's Practical File 12 Diamond (26600872)
No ratings yet
Vantika Kamra's Practical File 12 Diamond (26600872)
46 pages
Pandas For Python Pro Level Cheat Sheet
No ratings yet
Pandas For Python Pro Level Cheat Sheet
14 pages
Name: Booking ID:: Sumit Kumar +14 D/AU/220225/1153567
No ratings yet
Name: Booking ID:: Sumit Kumar +14 D/AU/220225/1153567
1 page
Successive Differentiation Guide
No ratings yet
Successive Differentiation Guide
10 pages
The Evolving Landscape of Cyber Security and Cyberspace
No ratings yet
The Evolving Landscape of Cyber Security and Cyberspace
8 pages
Threats in The Digital World Data Breaches and Cyber Attacks
No ratings yet
Threats in The Digital World Data Breaches and Cyber Attacks
9 pages
Cyber Security 14SL
No ratings yet
Cyber Security 14SL
14 pages
Number System Sheet-3 - 423278 - Crwill
No ratings yet
Number System Sheet-3 - 423278 - Crwill
4 pages
Wind Pressure Calculation ASCE 7-05
100% (1)
Wind Pressure Calculation ASCE 7-05
8 pages
Fundamentals of Python:: Chapter 2: Software Development, Data Types, and Expressions
No ratings yet
Fundamentals of Python:: Chapter 2: Software Development, Data Types, and Expressions
30 pages
The Democratization of Hedge Funds - (J.P. Morgan Asset Management)
No ratings yet
The Democratization of Hedge Funds - (J.P. Morgan Asset Management)
4 pages
Lab - Rotational Inertia Lab
No ratings yet
Lab - Rotational Inertia Lab
4 pages
18IS62 - Software Testing - Question Bank
No ratings yet
18IS62 - Software Testing - Question Bank
8 pages
Syllabus
No ratings yet
Syllabus
107 pages
Farouki Presentation
No ratings yet
Farouki Presentation
45 pages
ACTIVITY 5 Techniques of Integration Part 2 PDF
No ratings yet
ACTIVITY 5 Techniques of Integration Part 2 PDF
2 pages
GR 3 Math Chapter 7
No ratings yet
GR 3 Math Chapter 7
15 pages
Case Study On AlphaGo Zero
100% (1)
Case Study On AlphaGo Zero
21 pages
Grade 9 4. Herons Formula Worksheet - 2025-26
No ratings yet
Grade 9 4. Herons Formula Worksheet - 2025-26
1 page
It (r22) 3-1 Artificial Intelligence Digital Notes
No ratings yet
It (r22) 3-1 Artificial Intelligence Digital Notes
123 pages
chapter3 答案
No ratings yet
chapter3 答案
11 pages
Rubik's Cube Solver by Ben Botto
No ratings yet
Rubik's Cube Solver by Ben Botto
17 pages
Dickens Hard Times 1854
No ratings yet
Dickens Hard Times 1854
280 pages
Health Statistics Study Guide
No ratings yet
Health Statistics Study Guide
13 pages
Grade 10 Math: Composite Functions
No ratings yet
Grade 10 Math: Composite Functions
2 pages
Nonhomogeneous Differential Equations
No ratings yet
Nonhomogeneous Differential Equations
13 pages
Student Exploration: Golf Range
No ratings yet
Student Exploration: Golf Range
6 pages
The Determinacy of Long Games Reprint 2015 Itay Neeman Download
100% (4)
The Determinacy of Long Games Reprint 2015 Itay Neeman Download
87 pages
ACE AP Physics 1 by RitvikRustagi
No ratings yet
ACE AP Physics 1 by RitvikRustagi
173 pages
Part 1
No ratings yet
Part 1
412 pages
X Project Topics
No ratings yet
X Project Topics
1 page
Skills Builder 8 Workbook Answers: Integers, Powers and Roots
100% (2)
Skills Builder 8 Workbook Answers: Integers, Powers and Roots
26 pages
DAM Class 21-24 Regression Analysis
No ratings yet
DAM Class 21-24 Regression Analysis
93 pages
First Summative Test in Math 7 For Quarter 2 Written Works (Part 1)
No ratings yet
First Summative Test in Math 7 For Quarter 2 Written Works (Part 1)
2 pages
PySpark RDD Basics PDF
No ratings yet
PySpark RDD Basics PDF
1 page
Eee 543 1 Reliability - Concepts 2024
No ratings yet
Eee 543 1 Reliability - Concepts 2024
15 pages
CH - 12 LINEAR PROGRAMMING
No ratings yet
CH - 12 LINEAR PROGRAMMING
27 pages

Data Analysis CheatSheet

Uploaded by

Data Analysis CheatSheet

Uploaded by

📘 Data Analysis & Visualization in Python - Exam Cheat Sheet

1. Importing Required Libraries

4. Merging and Joining

5. Handling Missing Data

6. Plotting and Visualization

10. Subplots and Save Plot

11. Correlation and Covariance

12. Remove Duplicate Rows

13. Series Rank & Comparison

14. Experience: Practice Scenario

You might also like