0% found this document useful (0 votes)

16 views24 pages

Pandas and Python

This document describes how to use the Pandas library in Python to manipulate and analyze data. Pandas allows importing data from various sources such as Excel files, CSV, and SQL databases, and representing them in DataFrames. DataFrames allow selecting rows and columns, sorting and filtering data, applying functions to columns, and removing duplicate rows and columns.

Uploaded by

ScribdTranslations

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views24 pages

Pandas and Python

Uploaded by

ScribdTranslations

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Pandas and Python

Pandas is an open-source Python library that provides analysis and

data manipulation inprogramming in Python.
It is a very promising library for data representation, filtering, and programming.
statistics. The most important piece in pandas is the DataFrame where it stores and plays.
with the data.

In this tutorial, you will learn what a DataFrame is, how to create it from different sources,
how to export it to different results and how to manipulate its data.

Install pandas
Can you install pandas in Python?using pipRun the following command in cmd:
pip install pandas

Also, you can install pandas using conda like this:

condainstallpandas

Read an Excel file

You can read from an Excel file using the read_excel() method from pandas. For this,
you need to import one more module called xlrd.

Install xlrd using pip:

pip install xlrd

The following example shows how to read from an Excel sheet:

1. We create an Excel sheet with the following contents:

2. Import the pandas module.

import pandas
3. We will provide the name of the Excel file and the sheet number that we need.
read the data using the read_excel() method.
pandas.read_excel('pandasExcel.xlsx', 'Sheet1')
The previous fragment will generate the following result:
If you check the output type using the keyword type, it will give you the following result:

<class 'pandas.core.frame.DataFrame'>
This result is called DataFrame! That is the basic unit of pandas with which we work.
to be addressed until the end of the tutorial.
The DataFrame is a labeled two-dimensional structure where we can store
data of different types. DataFrame is similar to a SQL table or a spreadsheet.
Excel.

Import CSV file

To read a CSV file, you can use the read_csv() method from pandas.

Import the pandas module:

import pandas
Now call the read_csv() method as follows:

pandas.read_csv('Book1.csv')
Book1.csv has the following content:
The code will generate the following DataFrame:

Read a text file

We can also use the read_csv method from pandas to read from a text file;
Consider the following example:

import pandas

pandas.read_csv('myFile.txt')
The myFile.txt has the following format:
The output of the previous code will be:

This text file is treated like a CSV file because we have separated elements.
by commas. The file can also use another delimiter, such as a semicolon, a
tabulator, etc.

Suppose we have a tab delimiter and the file looks like this:
When the delimiter is a tab, we will have the following result:

Since pandas has no idea of the delimiter, translate the tab to \ t.

To define the tab character as a delimiter, pass the delimiter argument from
this way:

pandas.read_csv('myFile.txt', delimiter='\t')
Now the output will be:

It seems correct now.

Read SQL
You can use the read_sql() method of pandas to read from a SQL database. This is
demonstrate in the following example:

import sqlite3

import pandas

con = sqlite3.connect('mydatabase.db')

pandas.read_sql('select * from Employee', con)

In this example, we connect to aSQLite3 databasethat has a table called
"Employee". Using the read_sql() method from pandas, we pass a query and an object of
Connection to the read_sql() method. The query retrieves all the data from the table.
Our employee table looks like the following:

When you run the above code, the output will be as follows:
Select columns
Let's assume we have three columns in the Employee table like this:

To select columns from the table, we will run the following query:

select Name, Job from Employee

The statement of the pandas code will be as follows:
pandas.read_sql('select Name, Job from Employee', con)

We can also select a column from a table by accessing the DataFrame.

Consider the following example:

x = pandas.read_sql('select * from Employee', con)

x
The result will be the following:

Select rows by value

First, we will create a DataFrame from which we will select rows.

To create a DataFrame, consider the following code:

import pandas

frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Clerk

df = pandas.DataFrame(frame_data)
In this code, we create a DataFrame with three columns and three rows using the method
Pandas DataFrame(). The result will be as follows:

To select a row according to its value, execute the following statement

df.loc[df['name'] == 'Jason']
df.loc [] or DataFrame.loc [] is a boolean array that can be used to access rows or
columns by values or labels. In the previous code, it will search for the row where the
My name is Jason.

The output will be:

Select row by index

To select a row by its index, we can use the slicing operator (:) or the
fix df.loc [].

Consider the following code:

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Clerk

>>> df = pandas.DataFrame(frame_data)
We create a DataFrame. Now we are going to access a row using df.loc[]:

>>> df.loc[1]
As you can see, we retrieved a row. We can do the same using the operator of
segmentation in the following way:

>>> df[1:2]

Change column type

The data type of a column can be changed using the astype() attribute of
DataFrame. To check the data type of the columns, we use the dtypes attribute of
DataFrame.

df.dtypes
The exit will be:

Now to convert the data type from one to another:

>>> df.name = df.name.astype(str)

We look for the 'name' column of our DataFrame and change its data type from object.
a string of characters.

Apply a function to columns / rows

To apply a function to a column or row, you can use the apply() method of
DataFrame.

Consider the following example:

>>> frame_data = {'A': [1,2,3],'B': [18,20,22],'C': [54,12,13]}

>>> df = pandas.DataFrame(frame_data)
We create a DataFrame and add integer values in the rows. To apply a
function, for example, the square root in the values, we will import the modulenumpyfor
use the sqrt function like this:
>>>import numpy as np

>>>df.apply(np.sqrt)
The output will be as follows:

To apply a sum function, the code will be:

>>> df.apply(np.sum)

To apply the function to a specific column, you can specify the column of the
next way:

>>>df['A'].apply(np.sqrt)

Sort values / sort by column

To sort the values in a DataFrame, use the DataFrame's sort_values() method.

Create a DataFrame with integer values:

>>> frame_data = {'A': [23,12,30],'B': [18,20,22],'C': [54,112,13]}

>>> df = pandas.DataFrame(frame_data)
Now to sort the values:

>>> df.sort_values(by=['A'])
The output will be:
The sort_values() method has a required 'by' attribute. In the previous code, the
values are sorted by column A. To sort by multiple columns, the code is
next:

Sort df by columns 'A' and 'B'.

If you want to sort in descending order, set the ascending attribute of set_values to
False in the following way:

>>>df.sort_values(by=['A'], ascending=False)
The output will be:

Remove / Delete duplicates

To remove duplicate rows from a DataFrame, use the drop_duplicates() method of the
DataFrame.

Consider the following example:

>>> frame_data = {'name': ['James','Jason','Rogers','Jason'],'age': [18,20,22,20],'job': ['Assistant',

'Manager','Clerk','Manager']}

>>> df = pandas.DataFrame(frame_data)
Here we create a DataFrame with a duplicate row. To check for duplicate rows in
the DataFrame, use the DataFrame's duplicated() method.

>>> df.duplicated()
The result will be:

It can be seen that the last row is a duplicate. To delete this row, execute the following
line of code:

>>> df.drop_duplicates()
Now the result will be:

Remove duplicates by column

Sometimes, we have data where the column values are the same and we want to
we can delete a row by column by passing the name of the column that
we must eliminate.

For example, we have the following DataFrame:

The provided text does not contain translatable content.

'Manager','Clerk','Employee']}

>>> df = pandas.DataFrame(frame_data)
Here you can see that Jason appears twice. If you want to remove duplicates by column,
just pass the column name as follows:

>>> df.drop_duplicates(['name'])
The result will be as follows:

Delete a column
To remove a whole column or row, we can use the drop() method of the DataFrame
specifying the name of the column or row.

Consider the following example:

>>> df.drop(['job'], axis=1)

In this line of code, we are removing the column called 'job'. The argument of
axis is necessary here. If the axis value is 1, it means we want to remove columns, if the
The axis value of 0 means that the row will be deleted. In axis values, 0 is for index and 1
for columns.

The result will be:

Remove rows
We can use the drop() method to remove a row by passing the index of the row.

Let's assume we have the following DataFrame:

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Clerk

>>> df = pandas.DataFrame(frame_data)
To delete a row with index 0 where the name is James, the age is 18 and the job
as an assistant, use the following code:

>>> df.drop([0])

We are going to create a DataFrame where the indices are the names:

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Clerk

>>> df = pandas.DataFrame(frame_data, index = ['James','Jason','Rogers'])

Now we can delete a row with a certain value. For example, if we want to delete a
row where the name is Rogers, then the code will be:

>>> df.drop(['Rogers'])
The output will be:

You can also delete a range of rows as follows:

>>>df.drop(df.index[[0, 1]])
This will delete the rows from index 0 to 1 and only one row will remain since our DataFrame is
composed of 3 rows:

If you want to delete the last row of the DataFrame and do not know the total number of rows,
You can use negative indexing as shown below:

>>>df.drop(df.index[-1])
-1 deletes the last row. Similarly, -2 will delete the last 2 rows and so on.

Sum a column
You can use the sum() method of the DataFrame to sum the elements of the column.

Let's suppose we have the following DataFrame:

>>> frame_data = {'A': [23,12,12],'B': [18,18,22],'C': [13,112,13]}

>>> df = pandas.DataFrame(frame_data)
Now to sum the elements of column A, use the following line of code:

>>> df['A'].sum()

You can also use the apply() method of the DataFrame and pass the sum method.
numpy to sum the values.

Count unique values

To count unique values in a column, you can use the nunique() method of
DataFrame.

Let's suppose we have a DataFrame as follows:

>>> frame_data = {'A': [23,12,12],'B': [18,18,22],'C': [13,112,13]}

>>> df = pandas.DataFrame(frame_data)
To count the unique values in column A:

>>> df['A'].nunique()

As you can see, column A has only 2 unique values 23 and 12 and the other 12 is a
duplicate, that's why we have 2 in the output.

If you want to count all the values in a column, you can use the count() method of the
next way:

>>> df['A'].count()

Rows of subsets
To select a subset of a DataFrame, you can use brackets.

For example, we have a DataFrame that contains some integers. We can select or
find the subset of a row like this:

df.[start:count]
The starting point will be included in the subset, but the stopping point is not included.
For example, to select 3 rows starting from the first row, you will write:

>>> df[0:3]
The output will be:

That code means to start from the first row which is 0 and select 3 rows.

Similarly, to select the first 2 rows, you will write:

>>> df[0:2]

To select or retrieve a subset with the last row, use negative indexing:

>>> df[-1:]

Write to an Excel
To write a DataFrame to an Excel sheet, we can use the to_excel() method.

To write on an Excel sheet, you need to open the sheet, and to open an Excel sheet,
we will have to import the openpyxl module.

Install openpyxl using pip:

pip install openpyxl

Consider the following example:

>>> import openpyxl

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',
Clerk

>>> df = pandas.DataFrame(frame_data)

>>> df.to_excel("pandasExcel.xlsx", "Sheet1")

The Excel file will look like the following:

Write to a CSV file

Similarly, to write a DataFrame to CSV, you can use the to_csv() method.
as shown in the following line of code.

Save the DataFrame to a CSV file named 'pandasCSV.csv'

The output file will be like the following:
Write to SQL
To write data in SQL, we can use the to_sql() method.

Consider the following example:

import sqlite3

import pandas

con = sqlite3.connect('mydatabase.db')

frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Clerk

df = pandas.DataFrame(frame_data)

df.to_sql('users', con)
In this code, we create a connection to a sqlite3 database. Then we create a
DataFrame with three rows and three columns.

Finally, we use the to_sql method of our DataFrame (df) and pass the name of
the table where the data will be stored along with the connection object.
The SQL database will look like this:

Write to JSON
You can use the DataFrame's to_json() method to write to a JSON file.

This is demonstrated in the following example:

df.to_json("myJson.json")
In this line of code, the name of the JSON file is passed as an argument. The
The DataFrame will be stored in the JSON file. The file will contain the following content:
Write in an HTML file
You can use the DataFrame's to_html() method to create an HTML file with the
content of the DataFrame.

Consider the following example:

>>> df.to_html("myhtml.html")
The results file will have the following content:

When you open the HTML file in the browser, it will look like this:

For Assignment-3 (Final - Pandas - Lab)
No ratings yet
For Assignment-3 (Final - Pandas - Lab)
40 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Dataframes-I (Create - Selection)
No ratings yet
Dataframes-I (Create - Selection)
12 pages
Ainotes Dataframe
No ratings yet
Ainotes Dataframe
5 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
16 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
8 pages
Pandas Notes
No ratings yet
Pandas Notes
20 pages
Data Frames
No ratings yet
Data Frames
60 pages
Pandas (Assignment 3)
No ratings yet
Pandas (Assignment 3)
24 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Pandas DataFrame Guide for Informatics
No ratings yet
Pandas DataFrame Guide for Informatics
11 pages
Ainotes
No ratings yet
Ainotes
5 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Starting Out With Pandas - Ext
No ratings yet
Starting Out With Pandas - Ext
18 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
27 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
26 pages
05 Pandas Data Frames
No ratings yet
05 Pandas Data Frames
33 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
Python 3rd Unit Question and Answer
No ratings yet
Python 3rd Unit Question and Answer
25 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
32 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Pandas
No ratings yet
Pandas
5 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Data Frames
No ratings yet
Data Frames
10 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
33 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Unit III - Notes
No ratings yet
Unit III - Notes
12 pages
Lab 9
No ratings yet
Lab 9
9 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
4 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
48 pages
Data Frame
No ratings yet
Data Frame
17 pages
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
No ratings yet
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
8 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
Dataframes-I (Create & Selection)
No ratings yet
Dataframes-I (Create & Selection)
10 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Data Frames
No ratings yet
Data Frames
42 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Unit 4.2
No ratings yet
Unit 4.2
24 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
Pandas for Data Science Beginners
No ratings yet
Pandas for Data Science Beginners
2 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Unit 4 Pandas
No ratings yet
Unit 4 Pandas
8 pages
Pandas for Data Analysis Beginners
No ratings yet
Pandas for Data Analysis Beginners
89 pages
Pandas (PPT 5)
No ratings yet
Pandas (PPT 5)
16 pages
Python Pandas for Data Science
No ratings yet
Python Pandas for Data Science
59 pages
Pandas Notes
No ratings yet
Pandas Notes
44 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
AI Student HandbookXII 2025-26!8!20
No ratings yet
AI Student HandbookXII 2025-26!8!20
13 pages
Shoe upper model maker
No ratings yet
Shoe upper model maker
4 pages
9. TOOLBOX
No ratings yet
9. TOOLBOX
5 pages
evaluation module 1 sales management
No ratings yet
evaluation module 1 sales management
5 pages
Diagnostic Evaluation of the Area of Education for Work
No ratings yet
Diagnostic Evaluation of the Area of Education for Work
2 pages
Problems of Ford Motor Company in the Year 2000
No ratings yet
Problems of Ford Motor Company in the Year 2000
3 pages
sisu202_regular_call_homologation_stage1
No ratings yet
sisu202_regular_call_homologation_stage1
22 pages
Writing letters in French
No ratings yet
Writing letters in French
3 pages
Financial indicators questionnaire
No ratings yet
Financial indicators questionnaire
21 pages
GIINN_U3_A3_RORF
No ratings yet
GIINN_U3_A3_RORF
6 pages
Essay on Good Luck
No ratings yet
Essay on Good Luck
2 pages
The secret of bathroom measurements
No ratings yet
The secret of bathroom measurements
3 pages
Pablo's strategies for church growth
No ratings yet
Pablo's strategies for church growth
3 pages
Height Work Test
No ratings yet
Height Work Test
2 pages
Rule of Rose Walkthrough
No ratings yet
Rule of Rose Walkthrough
5 pages
OIL PAINTING
No ratings yet
OIL PAINTING
9 pages
TOPIC 51
No ratings yet
TOPIC 51
9 pages
COPLAS ON THE DISCOVERY OF AMERICA
No ratings yet
COPLAS ON THE DISCOVERY OF AMERICA
2 pages
Copy of Solution Manual_Unit_02
No ratings yet
Copy of Solution Manual_Unit_02
16 pages
PARENTS' MEETING AGENDA 2024
No ratings yet
PARENTS' MEETING AGENDA 2024
3 pages
Wood Work Guide
No ratings yet
Wood Work Guide
20 pages
Preterite and Imperfect or Indefinite
No ratings yet
Preterite and Imperfect or Indefinite
4 pages
Unit 3 and 4 RESEARCH.docx
No ratings yet
Unit 3 and 4 RESEARCH.docx
29 pages
HUMAN RESOURCES IN SODIMAC.docx
No ratings yet
HUMAN RESOURCES IN SODIMAC.docx
8 pages
Guide 06_REMOTE STORAGE_2020-II
No ratings yet
Guide 06_REMOTE STORAGE_2020-II
11 pages
THE POWER OF HARMONIC RESONANCE
No ratings yet
THE POWER OF HARMONIC RESONANCE
21 pages
BACKGROUND of Community Center Project
No ratings yet
BACKGROUND of Community Center Project
8 pages
WES form
No ratings yet
WES form
2 pages
Virtual Assistant (Monograph)
No ratings yet
Virtual Assistant (Monograph)
10 pages
azangaro.docx
No ratings yet
azangaro.docx
9 pages
physics exam pre.docx
No ratings yet
physics exam pre.docx
3 pages
Skid Steer Loader L225 Parts Catalog
83% (6)
Skid Steer Loader L225 Parts Catalog
853 pages
F 2 PDF
No ratings yet
F 2 PDF
9 pages
IC Engines
No ratings yet
IC Engines
37 pages
IndividualTaskReport - ESPINOZA, JOAN
No ratings yet
IndividualTaskReport - ESPINOZA, JOAN
2 pages
Aquatic Plant Presentation
No ratings yet
Aquatic Plant Presentation
17 pages
General Engineering PDF
No ratings yet
General Engineering PDF
12 pages
Emerging Land Policy Issues in India
No ratings yet
Emerging Land Policy Issues in India
20 pages
Price of AIO Solar Street Light
No ratings yet
Price of AIO Solar Street Light
3 pages
Stuudy Case
No ratings yet
Stuudy Case
8 pages
Classical and Marginal Economics Overview
100% (1)
Classical and Marginal Economics Overview
5 pages
Presentation of ENISA Study - Recommendations - Christina Skouloudi
No ratings yet
Presentation of ENISA Study - Recommendations - Christina Skouloudi
31 pages
Semantic Structure & Translation Theory
0% (1)
Semantic Structure & Translation Theory
13 pages
JAVA Chapter 4
No ratings yet
JAVA Chapter 4
1 page
The Shiphandlers Guide
No ratings yet
The Shiphandlers Guide
143 pages
PM - I CIA
No ratings yet
PM - I CIA
5 pages
25x26 House Plan From House Construction Telegu YouTube Channel
No ratings yet
25x26 House Plan From House Construction Telegu YouTube Channel
1 page
From Pseudo Code To Program Code
No ratings yet
From Pseudo Code To Program Code
24 pages
DAA Experiment - 3
No ratings yet
DAA Experiment - 3
40 pages
Introduction To C++
No ratings yet
Introduction To C++
12 pages
CTO-20AC Data Sheet
No ratings yet
CTO-20AC Data Sheet
3 pages
Preschool Daily Schedule
No ratings yet
Preschool Daily Schedule
1 page
Theory 2 - Code of Ethics For Professional Teacher & Historical Development of Teaching
No ratings yet
Theory 2 - Code of Ethics For Professional Teacher & Historical Development of Teaching
5 pages
Dre8 Progress Test 2 A
No ratings yet
Dre8 Progress Test 2 A
3 pages
M01 MCS Machine Installation and Commissioning TM
No ratings yet
M01 MCS Machine Installation and Commissioning TM
43 pages
53302337203
No ratings yet
53302337203
3 pages
Focus-On Opta en
No ratings yet
Focus-On Opta en
3 pages
JCB ENGLISH Fault Finding COMPLETE PDF
97% (29)
JCB ENGLISH Fault Finding COMPLETE PDF
129 pages
Numerical Investigations of Gas-Liquid Two-Phase Flow in A Pump Inducer
No ratings yet
Numerical Investigations of Gas-Liquid Two-Phase Flow in A Pump Inducer
46 pages
Mechanical Engineering Review 2 Fundamentals Thermodynamics
No ratings yet
Mechanical Engineering Review 2 Fundamentals Thermodynamics
5 pages
VLSI Design MCQs & Answers
0% (1)
VLSI Design MCQs & Answers
20 pages

Pandas and Python

Uploaded by

Pandas and Python

Uploaded by

Pandas and Python

Pandas is an open-source Python library that provides analysis and

Also, you can install pandas using conda like this:

Read an Excel file

Install xlrd using pip:

The following example shows how to read from an Excel sheet:

1. We create an Excel sheet with the following contents:

2. Import the pandas module.

Import CSV file

Import the pandas module:

Read a text file

Since pandas has no idea of the delimiter, translate the tab to \ t.

It seems correct now.

pandas.read_sql('select * from Employee', con)

select Name, Job from Employee

We can also select a column from a table by accessing the DataFrame.

x = pandas.read_sql('select * from Employee', con)

Select rows by value

To create a DataFrame, consider the following code:

frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

To select a row according to its value, execute the following statement

The output will be:

Select row by index

Consider the following code:

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

Change column type

Now to convert the data type from one to another:

>>> df.name = df.name.astype(str)

Apply a function to columns / rows

Consider the following example:

>>> frame_data = {'A': [1,2,3],'B': [18,20,22],'C': [54,12,13]}

To apply a sum function, the code will be:

Sort values / sort by column

Create a DataFrame with integer values:

>>> frame_data = {'A': [23,12,30],'B': [18,20,22],'C': [54,112,13]}

Sort df by columns 'A' and 'B'.

Remove / Delete duplicates

Consider the following example:

>>> frame_data = {'name': ['James','Jason','Rogers','Jason'],'age': [18,20,22,20],'job': ['Assistant',

Remove duplicates by column

For example, we have the following DataFrame:

The provided text does not contain translatable content.

Consider the following example:

>>> df.drop(['job'], axis=1)

The result will be:

Let's assume we have the following DataFrame:

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

>>> frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

>>> df = pandas.DataFrame(frame_data, index = ['James','Jason','Rogers'])

You can also delete a range of rows as follows:

Let's suppose we have the following DataFrame:

>>> frame_data = {'A': [23,12,12],'B': [18,18,22],'C': [13,112,13]}

Count unique values

Let's suppose we have a DataFrame as follows:

Similarly, to select the first 2 rows, you will write:

Install openpyxl using pip:

pip install openpyxl

Consider the following example:

>>> import openpyxl

>>> df.to_excel("pandasExcel.xlsx", "Sheet1")

Write to a CSV file

Save the DataFrame to a CSV file named 'pandasCSV.csv'

Consider the following example:

frame_data = {'name': ['James','Jason','Rogers'],'age': [18,20,22],'job': ['Assistant','Manager',

This is demonstrated in the following example:

Consider the following example:

You might also like