0% found this document useful (0 votes)

34 views16 pages

Data Frames Pandas, Handout 1

Uploaded by

ayaqassas21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views16 pages

Data Frames Pandas, Handout 1

Uploaded by

ayaqassas21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1

Data Frames
Pandas library
1-Pandas library
2-Notepade.csv
3-Data frames
4-Reading from excel file and .csv file
5-The writer function

Reading data from excel file

You have to create this excel file, then read it and display
it in the output using python

Book4.xlsx excel file

a b c
0 1 5 3
1 1 6 8
2 2 3 7
3 3 1 8

Code to read the file, from excel

Page 1
2

You need to import some libraries that are

shown below. To install these libraries use
the pip3
#use pip3 in command window to install
# 1- pandas 2- openpyxl
# pip3 install pandas
# pip3 install openyxl
Example

Example Write the script

below to read

For excel file

import pandas as pd
df = pd.read_excel('book4.xlsx')
print(df)
print('done')

Page 2
3

###################################
For notepad with .csv extension
Create notepad commas separated data
as shown below.
When you save it by going to all
files, then add the extension .csc

Example The code to read notepad.csv

import pandas as pd
df = pd.read_csv('data2.csv')
print(df)
print('done')

Page 3
4

Data Frames
Python dataframe is a data structure constructed with rows and columns,
similar to a database or Excel spreadsheet. It consists of a dictionary of lists in
which the list each have their own identifiers or keys, such as “last name” or
“food group.”

—------------
Example
d={"Duration":{
"0":60,
"1":60,
"2":60,
"3":45,

},
"Maxpulse":{
"0":130,
"1":145,
"2":135,
"3":175,
},
"Calories":{
"0":409.1,
"1":479.0,
"2":340.0,
"3":282.4,

}
}
import json

with open("dd.json", 'w') as fout:

json_dumps_str = json.dumps(d, indent=4)
print(json_dumps_str, file=fout)

import pandas as pd
df = pd.read_json('dd.json')

Page 4
5

print(df.to_string())

Output
Duration Maxpulse Calories
0 60 130 409.1
1 60 145 479.0
2 60 135 340.0
3 45 175 282.4

To create a dataframe you must first create a dictionary.

These are the dictionary methods we use

Method Usage

Values( Return a list of all values in the dictionary

)

Update( Updates the dictionary with the specified key-value pairs

)

setdefa Returns the value of the specified key. If the key does not exist
ult() insert the key, with the specified value

Page 5
6

clear() Removes all the elements from the dictionary

keys() Returns a list containing the keys of the dictionary

pop() Removes the element with the specified key

popitem Removes the last inserted key-value pair

()

get() Returns the value of the specified key

items() Returns a list containing a tuple for each key value pair

copy() Returns a copy of the dictionary

Page 6
7

fromkey Returns a dictionary with the specified keys and value

s()

This how we create a

dictionary and print header
import pandas as pd
import numpy as np
df={
'First name':['Ali','Fred'],
'Last name':['Baba','Json'],
'Visit':['march,19','March,25'],

Page 7
8

'Leave':['April,20,Aprile,26']
}

# Get the list of all column names from headers

for column_headers in df:
print(column_headers, end='\t')

Page 8
9

Example:column by column
generate a data frame
import pandas as pd
import numpy as np

technologies= {
'Courses':["Spark","PySpark","Hadoop","Python","Pandas"],
'Fee' :[22000,25000,23000,24000,26000],
'Duration':['30days','50days','30days', None,np.nan],
'Discount':[1000,2300,1000,1200,2500]
}
df = pd.DataFrame(technologies)
print(df)

#notice the column generated in the output

# To Get the list of all column

names from headers
column_headers = list(df.columns.values)
print("The Column Header :", column_headers)
Output:
The Column Header : ['Courses', 'Fee', 'Duration', 'Discount']

Page 9
10

#Example Get the list of all column names

from headers

column_headers = df.columns.values.tolist()
print("The Column Header :", column_headers)

output
The Column Header : ['Courses', 'Fee', 'Duration', 'Discount']

Page 10
11

To reset, and delete

rows
import pandas as pd
# Create DataFrame from dict
df =
pd.DataFrame({'Courses':['Spark','PySpark','Java','PHP'
],
'Fee':[20000,20000,15000,10000],
'Duration':['35days','35days','40days','30days']})
print(df)
df=df.drop([2])
print(df)
df2=df.reset_index()
print(df2)

output
Courses Fee Duration
0 Spark 20000 35days
1 PySpark 20000 35days
2 Java 15000 40days
3 PHP 10000 30days

Courses Fee Duration

0 Spark 20000 35days
1 PySpark 20000 35days
3 PHP 10000 30days
index Courses Fee Duration
0 0 Spark 20000 35 days
1 1 PySpark 20000 35 days

Page 11
12

2 3 PHP 10000 30 days

#Use .scv file to drop a colun

import pandas as pd
df = pd.read_csv('data2.csv')
print(df)
df2=df.drop([2])
print(df2)
print('done')

a b c
0 1 2 3
1 2 5 7
2 3 8 3

a b c
0 1 2 3
1 2 5 7
Done
Page 12
13

0 1 5 3
1 1 6 8
2 2 3 7
3 3 1 8
a b c
0 1 5 3
1 1 6 8
3 3 1 8

Example Write Excel with Python

Pandas. Excel file will be created
You can write any data (lists, strings, numbers etc) to Excel, by first converting
it into a Pandas DataFrame and then writing the DataFrame to Excel.

To export a Pandas DataFrame as an Excel file (extension: .xlsx, .xls), use

the to_excel() method.

Page 13
14

Install the following library

$ pip install xlwt
$ pip install openpyxl
Importing openpyxl is required if you want to append it to an existing Excel file
described at the end.

import pandas as pd
import openpyxl

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32, 33]],

index=['one', 'two', 'three'], columns=['a', 'b', 'c'])

print(df)
# a b c
# one 11 21 31
# two 12 22 32
# three 31 32 33

You can specify a path as the first argument of the to_excel() method.

Note: that the data in the original file is deleted when overwriting.

The argument new_sheet_name is the name of the sheet. If omitted, it will be

named Sheet1.

Output is the excel file that is created

Page 14
15

import pandas as pd
import openpyxl

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32,

33]],
index=['one', 'two', 'three'],
columns=['a', 'b', 'c'])

print(df)
#writing to excel
df.to_excel('pandas_to_excel.xlsx',
sheet_name='new_sheet_name')

#If you do not need to write index (row name), columns

(column name),
#the argument index, columns is False

#df.to_excel('xxx_no_index_header.xlsx', index=False,
header=False)

#then use the ExcelWriter() function like

this:

Page 15
16

with pd.ExcelWriter('pandas_to_excel.xlsx') as writer:

df.to_excel(writer, sheet_name='sheet1')
df.to_excel(writer, sheet_name='sheet2')
#You don’t need to call writer.save(), writer.close()
within the blocks.

Page 16

Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
Pandas
No ratings yet
Pandas
8 pages
Unit 4.2
No ratings yet
Unit 4.2
24 pages
Lab 9
No ratings yet
Lab 9
9 pages
Python
No ratings yet
Python
16 pages
Cheat Sheet - Pandas
No ratings yet
Cheat Sheet - Pandas
12 pages
Pandas DataFrame Cheat Sheet
100% (1)
Pandas DataFrame Cheat Sheet
10 pages
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
No ratings yet
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
6 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
4 pages
UNIT II Notes
No ratings yet
UNIT II Notes
23 pages
14oct Pandas 2024
No ratings yet
14oct Pandas 2024
13 pages
AI Student HandbookXII 2025-26!8!20
No ratings yet
AI Student HandbookXII 2025-26!8!20
13 pages
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
No ratings yet
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
4 pages
Lecture 9 Pandas
No ratings yet
Lecture 9 Pandas
176 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
12 Pandas
No ratings yet
12 Pandas
9 pages
Pandas Cheat Sheet........
No ratings yet
Pandas Cheat Sheet........
11 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Lab1!10!07-2025 - Import Export Data Using NumPy Pandas
No ratings yet
Lab1!10!07-2025 - Import Export Data Using NumPy Pandas
5 pages
Advance Python Unit 4
No ratings yet
Advance Python Unit 4
13 pages
Python Pandas
No ratings yet
Python Pandas
34 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Pandas
No ratings yet
Pandas
27 pages
CHP 8 Pandas
No ratings yet
CHP 8 Pandas
49 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Pandas for Data Science Beginners
No ratings yet
Pandas for Data Science Beginners
41 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
2 Pandas
No ratings yet
2 Pandas
22 pages
XII IP Resource Material - DataFrame
No ratings yet
XII IP Resource Material - DataFrame
22 pages
Practical File-Python
No ratings yet
Practical File-Python
14 pages
Pandas & Numpy
No ratings yet
Pandas & Numpy
32 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
48 pages
05 Pandas Data Frames
No ratings yet
05 Pandas Data Frames
33 pages
Data Frames
No ratings yet
Data Frames
60 pages
Ainotes Dataframe
No ratings yet
Ainotes Dataframe
5 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Chapter 1 Python Pandas - I
No ratings yet
Chapter 1 Python Pandas - I
35 pages
Pandas
No ratings yet
Pandas
12 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Chapter 2 Data Handling Using Pandas - I (DATA FRAME)
No ratings yet
Chapter 2 Data Handling Using Pandas - I (DATA FRAME)
15 pages
Python Pandas
No ratings yet
Python Pandas
34 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Python Excel Integration Guide
No ratings yet
Python Excel Integration Guide
11 pages
Pandas DataFrame Notes
67% (3)
Pandas DataFrame Notes
13 pages
Day08-Pandas-Tutorial: Pandas - by Punith V T
No ratings yet
Day08-Pandas-Tutorial: Pandas - by Punith V T
8 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Pandas
No ratings yet
Pandas
5 pages
Lab Manual 5
No ratings yet
Lab Manual 5
5 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
14 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Practical
No ratings yet
Practical
29 pages
Quiz Practical Research 1
No ratings yet
Quiz Practical Research 1
29 pages
Resume 1-Pharm 1 PG
No ratings yet
Resume 1-Pharm 1 PG
2 pages
Material Selection For The Aircraft: Design Criteria
100% (1)
Material Selection For The Aircraft: Design Criteria
5 pages
Report
No ratings yet
Report
27 pages
Alien Influence on Atlantis and Humanity
100% (2)
Alien Influence on Atlantis and Humanity
10 pages
Grade 7 PE VPA Paper 1 Midyear 2024
75% (4)
Grade 7 PE VPA Paper 1 Midyear 2024
4 pages
Corruption Analysis in Nigeria
No ratings yet
Corruption Analysis in Nigeria
12 pages
Answer
No ratings yet
Answer
2 pages
Ame 8800
No ratings yet
Ame 8800
20 pages
0054 Syllabus
No ratings yet
0054 Syllabus
2 pages
Masturbation and The Bible
No ratings yet
Masturbation and The Bible
5 pages
IIP Mr. & Ms. Palaro 2022-2023 Guide
No ratings yet
IIP Mr. & Ms. Palaro 2022-2023 Guide
2 pages
Zimbabwe School Examinations Council: Accounting 9197/3
50% (2)
Zimbabwe School Examinations Council: Accounting 9197/3
8 pages
A Millionaire Mind Affirmations
67% (3)
A Millionaire Mind Affirmations
2 pages
Tomahawk NV Manual
No ratings yet
Tomahawk NV Manual
2 pages
6 Enlargement Negative Scale Factor
No ratings yet
6 Enlargement Negative Scale Factor
10 pages
It Iii I CG
No ratings yet
It Iii I CG
22 pages
Ground Sensor Ga Class 0940 Testing
No ratings yet
Ground Sensor Ga Class 0940 Testing
4 pages
Paul and The Law
100% (1)
Paul and The Law
27 pages
Internal Coil Calculation - Compress4
No ratings yet
Internal Coil Calculation - Compress4
17 pages
Bayes Theorem PDF
No ratings yet
Bayes Theorem PDF
9 pages
Underwater Navigation for Divers
No ratings yet
Underwater Navigation for Divers
2 pages
MHD Power Generation
No ratings yet
MHD Power Generation
15 pages
Progard H3
No ratings yet
Progard H3
15 pages
Aakriti Mahajan
No ratings yet
Aakriti Mahajan
45 pages
Hindu Swamirayan Temple Ngara
No ratings yet
Hindu Swamirayan Temple Ngara
3 pages
DPP 305
No ratings yet
DPP 305
35 pages
Task 3:assessment
No ratings yet
Task 3:assessment
3 pages
Dance Event Schedule for Kids & Juniors
No ratings yet
Dance Event Schedule for Kids & Juniors
7 pages
Understanding Solar Plant Design ParametersSolar Irradiance, Tilt Angle, Azimuth, Efficiency Factors and Shading Analysis
No ratings yet
Understanding Solar Plant Design ParametersSolar Irradiance, Tilt Angle, Azimuth, Efficiency Factors and Shading Analysis
46 pages

Data Frames Pandas, Handout 1

Uploaded by

Data Frames Pandas, Handout 1

Uploaded by

1

Reading data from excel file

Book4.xlsx excel file

Code to read the file, from excel

You need to import some libraries that are

Example Write the script

For excel file

Example The code to read notepad.csv

with open("dd.json", 'w') as fout:

To create a dataframe you must first create a dictionary.

Values( Return a list of all values in the dictionary

Update( Updates the dictionary with the specified key-value pairs

clear() Removes all the elements from the dictionary

keys() Returns a list containing the keys of the dictionary

pop() Removes the element with the specified key

popitem Removes the last inserted key-value pair

get() Returns the value of the specified key

copy() Returns a copy of the dictionary

fromkey Returns a dictionary with the specified keys and value

This how we create a

# Get the list of all column names from headers

#notice the column generated in the output

# To Get the list of all column

#Example Get the list of all column names

To reset, and delete

Courses Fee Duration

2 3 PHP 10000 30 days

#Use .scv file to drop a colun

Example Write Excel with Python

To export a Pandas DataFrame as an Excel file (extension: .xlsx, .xls), use

Install the following library

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32, 33]],

The argument new_sheet_name is the name of the sheet. If omitted, it will be

Output is the excel file that is created

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32,

#If you do not need to write index (row name), columns

#then use the ExcelWriter() function like

with pd.ExcelWriter('pandas_to_excel.xlsx') as writer:

You might also like