0% found this document useful (0 votes)

86 views6 pages

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

The document describes various operations performed on a pandas DataFrame containing bird observation data. The DataFrame is created from a dictionary of data and list of index labels. Summary statistics are displayed and various data selections, filters, and transformations are applied, such as calculating group means, sorting, and replacing values.

Uploaded by

Abhishek Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views6 pages

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

Abhishek Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

pandas_basics_practice

April 7, 2021

Consider the following Python dictionary data and Python list labels:
data = {‘birds’: [‘Cranes’, ‘Cranes’, ‘plovers’, ‘spoonbills’, ‘spoonbills’, ‘Cranes’, ‘plovers’, ‘Cranes’,
‘spoonbills’, ‘spoonbills’], ‘age’: [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4], ‘visits’: [2, 4, 3, 4, 3, 4,
2, 2, 3, 2], ‘priority’: [‘yes’, ‘yes’, ‘no’, ‘yes’, ‘no’, ‘no’, ‘no’, ‘yes’, ‘no’, ‘no’]}
labels = [‘a’, ‘b’, ‘c’, ‘d’, ‘e’, ‘f’, ‘g’, ‘h’, ‘i’, ‘j’]
1. Create a DataFrame birds from this dictionary data which has the index labels.

[1]: import pandas as pd

import numpy as np

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills',␣

,→'Cranes', 'plovers', 'Cranes', 'spoonbills', 'spoonbills'],

'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],

'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no',␣
,→'no']}

labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

df=pd.DataFrame(data, index=labels)
df

[1]: birds age visits priority

a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

2. Display a summary of the basic information about birds DataFrame and its data.

[114]: df.columns

1
[114]: Index(['birds', 'age', 'visits', 'priority'], dtype='object')

[115]: df.describe()

[115]: age visits

count 8.000000 10.000000
mean 4.437500 2.900000
std 2.007797 0.875595
min 1.500000 2.000000
25% 3.375000 2.000000
50% 4.000000 3.000000
75% 5.625000 3.750000
max 8.000000 4.000000

3. Print the first 2 rows of the birds dataframe

[43]: df.iloc[[0,1]]

[43]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes

4. Print all the rows with only ‘birds’ and ‘age’ columns from the dataframe

[52]: df[['birds','age']]

[52]: birds age

labels
a Cranes 3.5
b Cranes 4.0
c plovers 1.5
d spoonbills NaN
e spoonbills 6.0
f Cranes 3.0
g plovers 5.5
h Cranes NaN
i spoonbills 8.0
j spoonbills 4.0

5. select [2, 3, 7] rows and in columns [‘birds’, ‘age’, ‘visits’]

[60]: df.loc[['b','c','g'],['birds','age','visits']]

[60]: birds age visits

labels
b Cranes 4.0 4
c plovers 1.5 3
g plovers 5.5 2

2
6. select the rows where the number of visits is less than 4

[59]: filt=df['visits']<4
df[filt]

[59]: birds age visits priority

labels
a Cranes 3.5 2 yes
c plovers 1.5 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

7. select the rows with columns [‘birds’, ‘visits’] where the age is missing i.e NaN

[2]: filt= df['age'].isnull()

df[filt][['birds','visits']]

[2]: birds visits

d spoonbills 4
h Cranes 2

8. Select the rows where the birds is a Cranes and the age is less than 4

[68]: filt=(df['birds']=='Cranes') & (df['age']<4)

df[filt]

[68]: birds age visits priority

labels
a Cranes 3.5 2 yes
f Cranes 3.0 4 no

9. Select the rows the age is between 2 and 4(inclusive)

[70]: filt=(df['age']>=2) & (df['age']<=4)

df[filt]

[70]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
f Cranes 3.0 4 no
j spoonbills 4.0 2 no

10. Find the total number of visits of the bird Cranes

[71]: df[df['birds']=='Cranes']['visits'].sum()

3
[71]: 12

11. Calculate the mean age for each different birds in dataframe.

[76]: birds_grp=df.groupby('birds')
birds_grp['age'].mean()

[76]: birds
Cranes 3.5
plovers 3.5
spoonbills 6.0
Name: age, dtype: float64

12. Append a new row ‘k’ to dataframe with your choice of values for each column.
Then delete that row to return the original DataFrame.

[106]: df.loc['k']=['Sparrow',3,4,'yes']
df

[106]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
k Sparrow 3.0 4 yes

[111]: df=df.drop('k')
df

[111]: birds age visits priority

4
13. Find the number of each type of birds in dataframe (Counts)

[73]: df['birds'].value_counts()

[73]: spoonbills 4
Cranes 4
plovers 2
Name: birds, dtype: int64

14. Sort dataframe (birds) first by the values in the ‘age’ in decending order, then by
the value in the ‘visits’ column in ascending order.

[116]: df.sort_values(by=['age','visits'],ascending=[False,True])

[116]: birds age visits priority

i spoonbills 8.0 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
j spoonbills 4.0 2 no
b Cranes 4.0 4 yes
a Cranes 3.5 2 yes
f Cranes 3.0 4 no
c plovers 1.5 3 no
h Cranes NaN 2 yes
d spoonbills NaN 4 yes

15. Replace the priority column values with’yes’ should be 1 and ‘no’ should be 0

[101]: def replace_priority(x):

if x=='yes':
return 1
else:
return 0
df['priority'].apply(replace_priority)
df

[101]: birds age visits priority

labels
a trumpeters 3.5 2 1
b trumpeters 4.0 4 1
c plovers 1.5 3 0
d spoonbills NaN 4 1
e spoonbills 6.0 3 0
f trumpeters 3.0 4 0
g plovers 5.5 2 0
h trumpeters NaN 2 1
i spoonbills 8.0 3 0
j spoonbills 4.0 2 0

16. In the ‘birds’ column, change the ‘Cranes’ entries to ‘trumpeters’.

5
[91]: df['birds']=df['birds'].replace({'Cranes':'trumpeters'})
df

[91]: birds age visits priority

labels
a trumpeters 3.5 2 yes
b trumpeters 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f trumpeters 3.0 4 no
g plovers 5.5 2 no
h trumpeters NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

[ ]:

Solutions To Pandas Basic Questions
No ratings yet
Solutions To Pandas Basic Questions
1 page
Pandas vs PySpark: Data Operations
No ratings yet
Pandas vs PySpark: Data Operations
3 pages
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
87 pages
Elite SQL Query Practice Guide
0% (1)
Elite SQL Query Practice Guide
20 pages
Pandas Cheatsheet 1743309413
No ratings yet
Pandas Cheatsheet 1743309413
11 pages
Orange 27-1-2025
No ratings yet
Orange 27-1-2025
20 pages
Snowflake Setup - MD
No ratings yet
Snowflake Setup - MD
2 pages
Django MVC vs MVT Explained
No ratings yet
Django MVC vs MVT Explained
3 pages
Mining Data Streams (Part 2)
No ratings yet
Mining Data Streams (Part 2)
56 pages
Appendix B DAX Reference
100% (1)
Appendix B DAX Reference
174 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
BigQuery CheatSheet
100% (1)
BigQuery CheatSheet
100 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
41 pages
Data Stream Processing Insights
No ratings yet
Data Stream Processing Insights
67 pages
Weka Data Mining Lab Guide
No ratings yet
Weka Data Mining Lab Guide
20 pages
Oracle Analytic Functions Guide
100% (1)
Oracle Analytic Functions Guide
3 pages
Ch-2 Panda: #Import The Pandas Library and Aliasing As PD
No ratings yet
Ch-2 Panda: #Import The Pandas Library and Aliasing As PD
5 pages
Pandas Interview Prep Guide
No ratings yet
Pandas Interview Prep Guide
5 pages
Data Analysis Exercises for Beginners
No ratings yet
Data Analysis Exercises for Beginners
43 pages
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
100% (1)
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
2 pages
Data Analyst Masters with PowerBI
No ratings yet
Data Analyst Masters with PowerBI
27 pages
Numpy Final - Removed
No ratings yet
Numpy Final - Removed
46 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
100 SQL Formulas Each Student Should Know
No ratings yet
100 SQL Formulas Each Student Should Know
10 pages
100 DSA Python
No ratings yet
100 DSA Python
45 pages
PySpark SQL Cheat Sheet Python
No ratings yet
PySpark SQL Cheat Sheet Python
1 page
Power BI Interview Questions at Deloitte
0% (1)
Power BI Interview Questions at Deloitte
6 pages
Python Date Time
No ratings yet
Python Date Time
6 pages
Snowflake Fundamentals Anand Jha
No ratings yet
Snowflake Fundamentals Anand Jha
50 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Spark Streaming Twitter Example
No ratings yet
Spark Streaming Twitter Example
4 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Database Setup for E-commerce
No ratings yet
Database Setup for E-commerce
4 pages
Power BI Math Functions Guide
No ratings yet
Power BI Math Functions Guide
9 pages
Data Engineer Path - Hands On SQL, Data Pipelines - Dataquest
No ratings yet
Data Engineer Path - Hands On SQL, Data Pipelines - Dataquest
1 page
Tableau Notes
No ratings yet
Tableau Notes
21 pages
Extract Transform Load
No ratings yet
Extract Transform Load
80 pages
Hive Cheat Sheet - Quick Reference
No ratings yet
Hive Cheat Sheet - Quick Reference
19 pages
KSR DATA VISION Fullstack - Powerbi - With - Fabric - Tools
No ratings yet
KSR DATA VISION Fullstack - Powerbi - With - Fabric - Tools
21 pages
Pandas
No ratings yet
Pandas
13 pages
DataStage Faq S
No ratings yet
DataStage Faq S
57 pages
SQL Vs PySpark 1678871778
No ratings yet
SQL Vs PySpark 1678871778
8 pages
DAX Cheat Sheet for Power BI
No ratings yet
DAX Cheat Sheet for Power BI
10 pages
SQL Functions
100% (1)
SQL Functions
16 pages
Introduction To Data Mining
100% (1)
Introduction To Data Mining
18 pages
Excel and Dax
No ratings yet
Excel and Dax
13 pages
EDA With Pandas
No ratings yet
EDA With Pandas
8 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
Azure Synapse Lab Guide
No ratings yet
Azure Synapse Lab Guide
21 pages
Python Data Visualization Guide
No ratings yet
Python Data Visualization Guide
16 pages
Introduction To Data Visualization With Python
No ratings yet
Introduction To Data Visualization With Python
47 pages
Hadoop Overview
100% (1)
Hadoop Overview
16 pages
Dinesh DM
No ratings yet
Dinesh DM
34 pages
UN CO2 Data Analysis with Pandas
No ratings yet
UN CO2 Data Analysis with Pandas
28 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
7 pages
2 Pandas Basics Practice 2 PDF
No ratings yet
2 Pandas Basics Practice 2 PDF
1 page
Pandas Library Problems For Parctice
No ratings yet
Pandas Library Problems For Parctice
13 pages
Assignment 2 - 1 TR2103
No ratings yet
Assignment 2 - 1 TR2103
6 pages
Variable Selection in SAS Enterprise Guide and SAS Enterprise Miner - Ask The Expert - May 11 2017
No ratings yet
Variable Selection in SAS Enterprise Guide and SAS Enterprise Miner - Ask The Expert - May 11 2017
66 pages
Solidworks Exercise Book PDF
74% (19)
Solidworks Exercise Book PDF
38 pages
Software Deadline Time Estimation
No ratings yet
Software Deadline Time Estimation
15 pages
Student-Adviser Agreement Form
No ratings yet
Student-Adviser Agreement Form
1 page
Assignment 2
No ratings yet
Assignment 2
2 pages
ITMN e
No ratings yet
ITMN e
272 pages
Training For TATA - 1646SM
100% (1)
Training For TATA - 1646SM
184 pages
小五組 Grade 5: 時限：分鐘 Time allowed: minutes
100% (1)
小五組 Grade 5: 時限：分鐘 Time allowed: minutes
5 pages
Catalog Perennial Plants Desene
No ratings yet
Catalog Perennial Plants Desene
15 pages
TAT 3.9 UserGuide 20170721
No ratings yet
TAT 3.9 UserGuide 20170721
35 pages
User ID Pass Word For Block Data Entry HWC
No ratings yet
User ID Pass Word For Block Data Entry HWC
4 pages
Synon 2E Database & Access Path Guide
No ratings yet
Synon 2E Database & Access Path Guide
6 pages
Debug in CRM
No ratings yet
Debug in CRM
5 pages
FNP Dedi 840097 Presentasi
No ratings yet
FNP Dedi 840097 Presentasi
7 pages
Ipv6 Application Services Dhcpv6: Huawei Technologies Co., LTD
No ratings yet
Ipv6 Application Services Dhcpv6: Huawei Technologies Co., LTD
15 pages
An Introduction On OMR Sheets: Instructions On How To Fill Registration Number and Question Paper Code On OMR Sheets
No ratings yet
An Introduction On OMR Sheets: Instructions On How To Fill Registration Number and Question Paper Code On OMR Sheets
2 pages
51 Log Siemens PDF
No ratings yet
51 Log Siemens PDF
2 pages
Audio Recipes For iOS
No ratings yet
Audio Recipes For iOS
79 pages
Integrating Revised Bloom Taxonomy in Multimedia and HCI With A Case Study of Food Dishes
No ratings yet
Integrating Revised Bloom Taxonomy in Multimedia and HCI With A Case Study of Food Dishes
11 pages
A Concise Grammar of The Arabic Language
No ratings yet
A Concise Grammar of The Arabic Language
199 pages
Computer Programming for Electrical Engineers
No ratings yet
Computer Programming for Electrical Engineers
31 pages
Python Functions and Error Fixing Worksheet
No ratings yet
Python Functions and Error Fixing Worksheet
4 pages
AMDAHL's LAW
No ratings yet
AMDAHL's LAW
3 pages
Bom Report Using Stko, Stpo, Stas, Mast Table
100% (1)
Bom Report Using Stko, Stpo, Stas, Mast Table
7 pages
Deteministic and Non Diterministic Finite Automata in Acd
No ratings yet
Deteministic and Non Diterministic Finite Automata in Acd
11 pages
FIR FILTER Implementation in ARM Instruction
No ratings yet
FIR FILTER Implementation in ARM Instruction
19 pages
Lesson 3
No ratings yet
Lesson 3
4 pages
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
No ratings yet
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
42 pages
Summer Training Report: Submitted in Partial Fulfillment For The Second Year Summer Internship of
No ratings yet
Summer Training Report: Submitted in Partial Fulfillment For The Second Year Summer Internship of
27 pages

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

pandas_basics_practice

[1]: import pandas as pd

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills',␣

'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],

[1]: birds age visits priority

[115]: age visits

3. Print the first 2 rows of the birds dataframe

[43]: birds age visits priority

[52]: birds age

5. select [2, 3, 7] rows and in columns [‘birds’, ‘age’, ‘visits’]

[60]: birds age visits

[59]: birds age visits priority

[2]: filt= df['age'].isnull()

[2]: birds visits

[68]: filt=(df['birds']=='Cranes') & (df['age']<4)

[68]: birds age visits priority

9. Select the rows the age is between 2 and 4(inclusive)

[70]: filt=(df['age']>=2) & (df['age']<=4)

[70]: birds age visits priority

10. Find the total number of visits of the bird Cranes

[106]: birds age visits priority

[111]: birds age visits priority

[116]: birds age visits priority

[101]: def replace_priority(x):

[101]: birds age visits priority

16. In the ‘birds’ column, change the ‘Cranes’ entries to ‘trumpeters’.

[91]: birds age visits priority

You might also like