Pandas Methods

The document discusses various pandas methods for analyzing and manipulating DataFrames. It defines methods like dropna() for dropping missing values, fillna() for replacing missing values, describe() for generating descriptive statistics, and sort_values() for sorting the DataFrame. It also discusses attributes like shape for getting the DataFrame dimensions and dtypes for getting the data types.

Uploaded by

Mido Khaled

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views6 pages

Pandas Methods

Uploaded by

Mido Khaled

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

dropna() Drop missing observations

dropna(how='all') Drop observations where all cells is NA

dropna(axis=1, how='all') Drop column if all the values are missing
dropna(thresh = 5) Drop rows that contain less than 5 non-missing values
fillna(0) Replace missing values with zeros
isnull() returns True if the value is missing
notnull() Returns True for non-missing values
describe() Basic statistics (count, mean, std, min, quantiles, max)
min, max()Minimum and maximum values
mean, median, mode() Arithmetic average, median and mode
var(), std() Variance and standard deviation
sem() Standard error of mean
skew() Sample skewness
kurt() kurtosis
distplot() histogram
barplot() estimate of central tendency for a numeric variable
violinplot() similar to boxplot, also shows the probability density of the
Data
df.dtypes() is an attribute of a DataFrame that returns a Series with the data type of each column.

df.axes() is an attribute of a DataFrame that returns a list of the row axis labels and column axis labels.

df.ndim() is an attribute of a DataFrame that returns the number of dimensions of the DataFrame. For
a DataFrame, it always returns 2.

df.shape() is an attribute of a DataFrame that returns a tuple representing the dimensions of the
DataFrame (number of rows, number of columns).

df.values() is an attribute of a DataFrame that returns a Numpy representation of the DataFrame's

data. It returns a 2D Numpy array.

df.dropna() is a method used to remove rows or columns with missing values (NaN). By default, it
removes rows with any NaN values, but you can specify additional parameters like axis and how to
control the behavior.

df.sample([n]) is a method used to randomly select rows from the DataFrame. It returns a sample of
the DataFrame. You can specify the number of rows (n) to sample.
df.loc:

df.loc() is a method used to access a group of rows and columns by labels or a boolean array. It is
primarily label-based, but may also be used with a boolean array.
df.iloc:

df.iloc() is a method used for integer-location based indexing. It is primarily integer position-based,
but may also be used with a boolean array.
df.sort_values:

df.sort_values() is a method used to sort the DataFrame by the values along either axis.
DataFrame():

DataFrame() is a class constructor used to create a new DataFrame object, which is a two-
dimensional, size-mutable, and heterogeneous tabular data structure with labeled axes (rows and
columns).

read_csv() is a function used to read data from a CSV file into a DataFrame. It automatically converts
the data into a DataFrame format.
head():
head() is a method used to display the first n rows of a DataFrame, where n is the number of rows
specified as an argument (default is 5).

tail() is a method used to display the last n rows of a DataFrame, where n is the number of rows
specified as an argument (default is 5).
info():

info() is a method used to print a concise summary of a DataFrame, including the index dtype and
column dtypes, non-null values, and memory usage.
describe():

describe() is a method used to generate descriptive statistics of the DataFrame's numerical columns,
including count, mean, standard deviation, minimum, maximum, and percentiles.

shape is an attribute of a DataFrame that returns a tuple representing the dimensions of the
DataFrame (number of rows, number of columns).

columns is an attribute of a DataFrame that returns an Index object containing the column labels.

index is an attribute of a DataFrame that returns an Index object containing the row labels.

set_index() is a method used to set the DataFrame's index (row labels) using one or more existing
columns.

reset_index() is a method used to reset the DataFrame's index (row labels) to the default integer
index and move the old index into a new column.

drop() is a method used to remove rows or columns from a DataFrame based on labels or indexes.

fillna() is a method used to fill missing (NaN) values in a DataFrame with a specified value or method.

groupby() is a method used to group DataFrame rows by values in one or more columns, allowing for
aggregation and analysis.

jointplot() Scatterplot
regplot() Regression plot
pairplot() Pairplot
boxplot() boxplot
swarmplot() categorical scatterplot
factorplot() General categorical plot
Pandas Methods Quiz

.
What does the df.head() method do in Pandas?
.
 A) Returns the first 10 rows of the DataFrame
 B) Returns the last 5 rows of the DataFrame
 C) Returns the first 5 rows of the DataFrame
 D) Returns the last 10 rows of the DataFrame
Answer: C) Returns the first 5 rows of the DataFrame
.
What does the df.tail(3) method do in Pandas?
.
 A) Returns the first 3 rows of the DataFrame
 B) Returns the last 3 rows of the DataFrame
 C) Returns the last 5 rows of the DataFrame
 D) Returns the first 5 rows of the DataFrame
Answer: B) Returns the last 3 rows of the DataFrame
.
What does the df.info() method do in Pandas?
.
 A) Provides basic information about the DataFrame, such as the
number of rows and columns
 B) Returns descriptive statistics of the DataFrame
 C) Removes rows with missing values
 D) Sorts the DataFrame based on specified columns
Answer: A) Provides basic information about the DataFrame, such as the number
of rows and columns
.
What does the df.describe() method do in Pandas?
.
 A) Provides basic information about the DataFrame, such as the
number of rows and columns
 B) Returns descriptive statistics of the DataFrame
 C) Removes rows with missing values
 D) Sorts the DataFrame based on specified columns
Answer: B) Returns descriptive statistics of the DataFrame
.
What does the df.shape attribute return in Pandas?
.
 A) Number of rows in the DataFrame
 B) Number of columns in the DataFrame
 C) Dimensions of the DataFrame (number of rows, number of
columns)
 D) Data types of each column in the DataFrame
Answer: C) Dimensions of the DataFrame (number of rows, number of columns)
.
What does the df.dropna() method do in Pandas?
.
 A) Removes duplicate rows from the DataFrame
 B) Removes rows with missing values
 C) Removes specified columns from the DataFrame
 D) Sorts the DataFrame based on specified columns
Answer: B) Removes rows with missing values
.
What does the df.groupby() method do in Pandas?
.
 A) Groups the DataFrame by specified columns and applies an
aggregate function
 B) Sorts the DataFrame based on specified columns
 C) Returns the unique values in a specified column
 D) Merges two DataFrames based on common columns
Answer: A) Groups the DataFrame by specified columns and applies an aggregate
function
.
What does the df.sort_values() method do in Pandas?
.
 A) Groups the DataFrame by specified columns and applies an
aggregate function
 B) Sorts the DataFrame based on specified columns
 C) Returns the unique values in a specified column
 D) Merges two DataFrames based on common columns
Answer: B) Sorts the DataFrame based on specified columns
.
What does the df.rename() method do in Pandas?
.
 A) Removes specified columns from the DataFrame
 B) Renames specified columns in the DataFrame
 C) Returns the unique values in a specified column
 D) Merges two DataFrames based on common columns
Answer: B) Renames specified columns in the DataFrame
.
What does the df.drop_duplicates() method do in Pandas?
.
 A) Removes duplicate rows from the DataFrame
 B) Removes rows with missing values
 C) Groups the DataFrame by specified columns and applies an
aggregate function
 D) Sorts the DataFrame based on specified columns
Answer: A) Removes duplicate rows from the DataFrame

You

more mcq

ChatGPT

Sure, here are some more multiple-choice questions (MCQs) about Pandas
methods:
.
What does the df.mean() method do in Pandas?
.
 A) Computes the mean value of each column in the DataFrame
 B) Computes the median value of each column in the DataFrame
 C) Computes the mode value of each column in the DataFrame
 D) Computes the standard deviation of each column in the
DataFrame
Answer: A) Computes the mean value of each column in the DataFrame
.
What does the df.sum() method do in Pandas?
.
 A) Computes the sum of each row in the DataFrame
 B) Computes the sum of each column in the DataFrame
 C) Computes the cumulative sum of each column in the DataFrame
 D) Computes the product of each column in the DataFrame
Answer: B) Computes the sum of each column in the DataFrame
.
What does the df.min() method do in Pandas?
.
 A) Returns the minimum value of each column in the DataFrame
 B) Returns the minimum value of each row in the DataFrame
 C) Returns the index of the minimum value in each column
 D) Returns the index of the minimum value in each row
Answer: A) Returns the minimum value of each column in the DataFrame
.
What does the df.max() method do in Pandas?
.
 A) Returns the maximum value of each column in the DataFrame
 B) Returns the maximum value of each row in the DataFrame
 C) Returns the index of the maximum value in each column
 D) Returns the index of the maximum value in each row
Answer: A) Returns the maximum value of each column in the DataFrame
.
What does the df.count() method do in Pandas?
.
 A) Counts the number of non-null values in each column of the
DataFrame
 B) Counts the number of null values in each column of the
DataFrame
 C) Counts the number of non-null values in each row of the
DataFrame
 D) Counts the number of null values in each row of the DataFrame
Answer: A) Counts the number of non-null values in each column of the
DataFrame
.
What does the df.median() method do in Pandas?
.
 A) Computes the median value of each column in the DataFrame
 B) Computes the mean value of each column in the DataFrame
 C) Computes the mode value of each column in the DataFrame
 D) Computes the standard deviation of each column in the
DataFrame
Answer: A) Computes the median value of each column in the DataFrame
.
What does the df.std() method do in Pandas?
.
 A) Computes the standard deviation of each column in the
DataFrame
 B) Computes the mean value of each column in the DataFrame
 C) Computes the mode value of each column in the DataFrame
 D) Computes the median value of each column in the DataFrame
Answer: A) Computes the standard deviation of each column in the DataFrame
.
What does the df.unique() method do in Pandas?
.
 A) Returns the unique values in a specified column
 B) Removes duplicate rows from the DataFrame
 C) Computes the mean value of each column in the DataFrame
 D) Computes the sum of each column in the DataFrame
Answer: A) Returns the unique values in a specified column
.
What does the df.nunique() method do in Pandas?
.
 A) Counts the number of unique values in each column of the DataFrame
 B) Returns the unique values in a specified column
 C) Removes duplicate rows from the DataFrame
 D) Computes the sum of each column in the DataFrame
Answer: A) Counts the number of unique values in each column of the DataFrame
.
What does the df.value_counts() method do in Pandas?
.
 A) Counts the number of unique values in each column of the DataFrame
 B) Returns the unique values in a specified column
 C) Computes the sum of each column in the DataFrame
 D) Counts the occurrences of each unique value in a specified column
Answer: D) Counts the occurrences of each unique value in a specified column

ML Lab1 Python Panda
No ratings yet
ML Lab1 Python Panda
9 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
8 pages
Pandas For Machine Learning: Acadview
No ratings yet
Pandas For Machine Learning: Acadview
18 pages
Data Analysis with Pandas
No ratings yet
Data Analysis with Pandas
31 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas Guide for Data Analysts
No ratings yet
Pandas Guide for Data Analysts
9 pages
Pandas
No ratings yet
Pandas
30 pages
Pandas Notes
No ratings yet
Pandas Notes
6 pages
PANDAS Cheatsheet
No ratings yet
PANDAS Cheatsheet
4 pages
18 Pandas
No ratings yet
18 Pandas
33 pages
Module1-Cheat-Sheet-LINE PLOT
No ratings yet
Module1-Cheat-Sheet-LINE PLOT
3 pages
Pandas 6 1716219621
No ratings yet
Pandas 6 1716219621
17 pages
Pandas
No ratings yet
Pandas
86 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
HTML-Notes 1
No ratings yet
HTML-Notes 1
27 pages
The Racers Life
No ratings yet
The Racers Life
74 pages
HTML Note Imp HTML
No ratings yet
HTML Note Imp HTML
165 pages
Pandas
No ratings yet
Pandas
41 pages
HTML
No ratings yet
HTML
12 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
Pandas
No ratings yet
Pandas
27 pages
Pandas Guide for Beginners
No ratings yet
Pandas Guide for Beginners
18 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
5 pages
HTML Tutorial
No ratings yet
HTML Tutorial
42 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
1 - Interactive Data Visualization With Bokeh
No ratings yet
1 - Interactive Data Visualization With Bokeh
31 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
4 pages
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
100% (1)
Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy
2 pages
Python Data Analysis Basics
No ratings yet
Python Data Analysis Basics
246 pages
Pandas
No ratings yet
Pandas
14 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
19 pages
40 NumPy and Pandas Interview Questions With Answers 1740141557
No ratings yet
40 NumPy and Pandas Interview Questions With Answers 1740141557
6 pages
P Unit-4 NP
No ratings yet
P Unit-4 NP
30 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
10 pages
1 Pandas Basics
No ratings yet
1 Pandas Basics
13 pages
HTML Notes
No ratings yet
HTML Notes
22 pages
Chapter - 6 Dictionary
100% (2)
Chapter - 6 Dictionary
25 pages
Data Science Python Cheat Sheet
No ratings yet
Data Science Python Cheat Sheet
25 pages
Unit-1 Python Pandas
No ratings yet
Unit-1 Python Pandas
56 pages
HTML Basics and Tags Guide
No ratings yet
HTML Basics and Tags Guide
44 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
87 pages
Pandas Guide for Data Science
No ratings yet
Pandas Guide for Data Science
42 pages
Class 6 Pandas
No ratings yet
Class 6 Pandas
13 pages
Unit - 1 - Python Pandas
No ratings yet
Unit - 1 - Python Pandas
176 pages
Pandas Notes Basic To Advance
No ratings yet
Pandas Notes Basic To Advance
21 pages
XII-IP - Data Visualisation
No ratings yet
XII-IP - Data Visualisation
65 pages
HTML Cheat Sheet - Copie
No ratings yet
HTML Cheat Sheet - Copie
9 pages
Top 50 Pandas Interview Questions and Answers (2024)
No ratings yet
Top 50 Pandas Interview Questions and Answers (2024)
34 pages
Ipl Data Anlysis
No ratings yet
Ipl Data Anlysis
20 pages
Python Data Analysis for Beginners
No ratings yet
Python Data Analysis for Beginners
100 pages
Research Paper Presentation Pandas Moshiul Arefin
No ratings yet
Research Paper Presentation Pandas Moshiul Arefin
30 pages
EDA With Pandas
No ratings yet
EDA With Pandas
8 pages
Analyzing Data Using Python Filtering Data in Pandas
No ratings yet
Analyzing Data Using Python Filtering Data in Pandas
52 pages
Unit IV
No ratings yet
Unit IV
49 pages
Chapter Python Pandas-II
No ratings yet
Chapter Python Pandas-II
15 pages
Sampling Theory
No ratings yet
Sampling Theory
24 pages
Lecture Notes 7 1 1
No ratings yet
Lecture Notes 7 1 1
12 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
3 pages
USACE Risk Tools for Hydropower
No ratings yet
USACE Risk Tools for Hydropower
26 pages
Bnad Lab 7
No ratings yet
Bnad Lab 7
5 pages
Credit Risk Scorecard Stability Review
No ratings yet
Credit Risk Scorecard Stability Review
20 pages
Anderson-Darling Test Guide
No ratings yet
Anderson-Darling Test Guide
37 pages
Forecasting Techniques for Students
No ratings yet
Forecasting Techniques for Students
53 pages
Linear Regression Quiz
100% (2)
Linear Regression Quiz
6 pages
"II PUC Statistics Model Paper 2023-24"
No ratings yet
"II PUC Statistics Model Paper 2023-24"
9 pages
ECO242
No ratings yet
ECO242
1 page
LESSON 2 Test Concerning Means Part 2
No ratings yet
LESSON 2 Test Concerning Means Part 2
7 pages
Normality Test
No ratings yet
Normality Test
21 pages
PR2 Unit 3 Lesson 2 - Formulating A Hypothesis For Quantitative Research
No ratings yet
PR2 Unit 3 Lesson 2 - Formulating A Hypothesis For Quantitative Research
12 pages
Excel Tutorial
No ratings yet
Excel Tutorial
3 pages
Full Test Paper-1
No ratings yet
Full Test Paper-1
13 pages
Department of Agricultural Statistics: JNR MSC Agri, Students, Uasd
No ratings yet
Department of Agricultural Statistics: JNR MSC Agri, Students, Uasd
2 pages
Advanced Business Statistics For Decision Making: Facilitator-Dr. Shilpa Bhaskar Mujumdar
100% (1)
Advanced Business Statistics For Decision Making: Facilitator-Dr. Shilpa Bhaskar Mujumdar
31 pages
Unit 3 - Probability and Probability Distributions Vs2-Merged
No ratings yet
Unit 3 - Probability and Probability Distributions Vs2-Merged
28 pages
Practical Attachment Report12
No ratings yet
Practical Attachment Report12
22 pages
Final Mast221 18F
No ratings yet
Final Mast221 18F
4 pages
2011 02 08 Data Analysis
No ratings yet
2011 02 08 Data Analysis
47 pages
Stock Watson 4E Exercisesolutions Chapter12 Students
No ratings yet
Stock Watson 4E Exercisesolutions Chapter12 Students
6 pages
Statistics and Probability For VERSION 3
70% (20)
Statistics and Probability For VERSION 3
71 pages
Chapter 13, Numbers 13.6, 13.8, 13.9, and 13.10 2. Chapter 14, Numbers 14.11, 14.12, and 14.14 3. Chapter 15, Numbers 15.7, 15.8, 15.10 and 15.14
No ratings yet
Chapter 13, Numbers 13.6, 13.8, 13.9, and 13.10 2. Chapter 14, Numbers 14.11, 14.12, and 14.14 3. Chapter 15, Numbers 15.7, 15.8, 15.10 and 15.14
5 pages
Computing The Variance of A Discrete Probability Distribution
No ratings yet
Computing The Variance of A Discrete Probability Distribution
14 pages
Advanced Statistics Project Report
100% (1)
Advanced Statistics Project Report
42 pages
Moments J Skewness and Kurtosis - Final
No ratings yet
Moments J Skewness and Kurtosis - Final
70 pages
Stat
No ratings yet
Stat
8 pages
Main Assessment: Subject: Quantitative Techniques Subject Code: Kwn10Ab / Qth115E
No ratings yet
Main Assessment: Subject: Quantitative Techniques Subject Code: Kwn10Ab / Qth115E
12 pages