KEMBAR78
Python File Handling PDF | PDF | Comma Separated Values | Text File
100% found this document useful (1 vote)
3K views20 pages

Python File Handling PDF

The program takes a file name as input, checks if the file exists, opens

Uploaded by

Rajendra Buchade
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
3K views20 pages

Python File Handling PDF

The program takes a file name as input, checks if the file exists, opens

Uploaded by

Rajendra Buchade
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

Learn

Complete
Python
In
Simple Way

1 https://www.youtube.com/durgasoftware
FILE
HANDLING
STUDY MATERIAL

2 https://www.youtube.com/durgasoftware
 As the part of programming requirement, we have to store our data permanently for
future purpose. For this requirement we should go for files.
 Files are very common permanent storage areas to store our data.

Types of Files:
There are 2 types of files

1) Text Files:
Usually we can use text files to store character data
Eg: abc.txt

2) Binary Files:
Usually we can use binary files to store binary data like images,video files, audio files
etc...

Opening a File:
 Before performing any operation (like read or write) on the file,first we have to open
that file.For this we should use Python's inbuilt function open()
 But at the time of open, we have to specify mode,which represents the purpose of
opening file.
f = open(filename, mode)

The allowed modes in Python are

1) r  open an existing file for read operation. The file pointer is positioned at the
beginning of the file.If the specified file does not exist then we will get
FileNotFoundError.This is default mode.

2) w  open an existing file for write operation. If the file already contains some data
then it will be overridden. If the specified file is not already avaialble then this mode will
create that file.

3) a  open an existing file for append operation. It won't override existing data.If the
specified file is not already avaialble then this mode will create a new file.

4) r+  To read and write data into the file. The previous data in the file will not be
deleted.The file pointer is placed at the beginning of the file.
5) w+  To write and read data. It will override existing data.

6) a+  To append and read data from the file.It wont override existing data.

3 https://www.youtube.com/durgasoftware
7) x  To open a file in exclusive creation mode for write operation. If the file already
exists then we will get FileExistsError.

Note: All the above modes are applicable for text files. If the above modes suffixed with
'b' then these represents for binary files.

Eg: rb,wb,ab,r+b,w+b,a+b,xb

f = open("abc.txt","w")
We are opening abc.txt file for writing data.

Closing a File:
After completing our operations on the file, it is highly recommended to close the file.
For this we have to use close() function.
f.close()

Various Properties of File Object:


Once we opend a file and we got file object, we can get various details related to that file
by using its properties.
 name  Name of opened file
 mode  Mode in which the file is opened
 closed  Returns boolean value indicates that file is closed or not
 readable() Retruns boolean value indicates that whether file is readable or not
 writable() Returns boolean value indicates that whether file is writable or not.

1) f=open("abc.txt",'w')
2) print("File Name: ",f.name)
3) print("File Mode: ",f.mode)
4) print("Is File Readable: ",f.readable())
5) print("Is File Writable: ",f.writable())
6) print("Is File Closed : ",f.closed)
7) f.close()
8) print("Is File Closed : ",f.closed)

Output
D:\Python_classes>py test.py
File Name: abc.txt
File Mode: w
Is File Readable: False
Is File Writable: True
Is File Closed: False
Is File Closed: True
4 https://www.youtube.com/durgasoftware
Writing Data to Text Files:
We can write character data to the text files by using the following 2 methods.
1) write(str)
2) writelines(list of lines)

1) f=open("abcd.txt",'w')
2) f.write("Durga\n")
3) f.write("Software\n")
4) f.write("Solutions\n")
5) print("Data written to the file successfully")
6) f.close()

abcd.txt:
Durga
Software
Solutions

Note: In the above program, data present in the file will be overridden everytime if we
run the program. Instead of overriding if we want append operation then we should open
the file as follows.

f = open("abcd.txt","a")
Eg 2:

1) f=open("abcd.txt",'w')
2) list=["sunny\n","bunny\n","vinny\n","chinny"]
3) f.writelines(list)
4) print("List of lines written to the file successfully")
5) f.close()

abcd.txt:
sunny
bunny
vinny
chinny

Note: While writing data by using write() methods, compulsory we have to provide line
seperator(\n), otherwise total data should be written to a single line.

5 https://www.youtube.com/durgasoftware
Reading Character Data from Text Files:
We can read character data from text file by using the following read methods.

 read() To read total data from the file


 read(n)  To read 'n' characters from the file
 readline() To read only one line
 readlines() To read all lines into a list

Eg 1: To read total data from the file

1) f=open("abc.txt",'r')
2) data=f.read()
3) print(data)
4) f.close()

Output
sunny
bunny
chinny
vinny

Eg 2: To read only first 10 characters:

1) f=open("abc.txt",'r')
2) data=f.read(10)
3) print(data)
4) f.close()

Output
sunny
bunn

Eg 3: To read data line by line:

1) f=open("abc.txt",'r')
2) line1=f.readline()
3) print(line1,end='')
4) line2=f.readline()
5) print(line2,end='')
6) line3=f.readline()
7) print(line3,end='')
8) f.close()

6 https://www.youtube.com/durgasoftware
Output
sunny
bunny
chinny

Eg 4: To read all lines into list:

1) f=open("abc.txt",'r')
2) lines=f.readlines()
3) for line in lines:
4) print(line,end='')
5) f.close()

Output
sunny
bunny
chinny
vinny

Eg 5:

1) f=open("abc.txt","r")
2) print(f.read(3))
3) print(f.readline())
4) print(f.read(4))
5) print("Remaining data")
6) print(f.read())

Output
sun
ny

bunn
Remaining data
y
chinny
vinny

7 https://www.youtube.com/durgasoftware
The with Statement:
 The with statement can be used while opening a file.We can use this to group file
operation statements within a block.
 The advantage of with statement is it will take care closing of file,after completing all
operations automatically even in the case of exceptions also, and we are not required
to close explicitly.

1) with open("abc.txt","w") as f:
2) f.write("Durga\n")
3) f.write("Software\n")
4) f.write("Solutions\n")
5) print("Is File Closed: ",f.closed)
6) print("Is File Closed: ",f.closed)

Output
Is File Closed: False
Is File Closed: True

The seek() and tell() Methods:


tell():
 We can use tell() method to return current position of the cursor(file pointer) from
beginning of the file. [ can you plese telll current cursor position]
 The position(index) of first character in files is zero just like string index.

1) f=open("abc.txt","r")
2) print(f.tell())
3) print(f.read(2))
4) print(f.tell())
5) print(f.read(3))
6) print(f.tell())

abc.txt:
sunny
bunny
chinny
vinny

8 https://www.youtube.com/durgasoftware
Output:
0
su
2
nny
5

seek():
We can use seek() method to move cursor (file pointer) to specified location.
[Can you please seek the cursor to a particular location]
f.seek(offset, fromwhere)  offset represents the number of positions
The allowed Values for 2nd Attribute (from where) are
0  From beginning of File (Default Value)
1  From Current Position
2  From end of the File

Note: Python 2 supports all 3 values but Python 3 supports only zero.

1) data="All Students are STUPIDS"


2) f=open("abc.txt","w")
3) f.write(data)
4) with open("abc.txt","r+") as f:
5) text=f.read()
6) print(text)
7) print("The Current Cursor Position: ",f.tell())
8) f.seek(17)
9) print("The Current Cursor Position: ",f.tell())
10) f.write("GEMS!!!")
11) f.seek(0)
12) text=f.read()
13) print("Data After Modification:")
14) print(text)

Output
All Students are STUPIDS
The Current Cursor Position: 24
The Current Cursor Position: 17
Data After Modification:
All Students are GEMS!!!

9 https://www.youtube.com/durgasoftware
How to check a particular File exists OR not?
We can use os library to get information about files in our computer.
os module has path sub module,which contains isFile() function to check whether a
particular file exists or not?
os.path.isfile(fname)

Q) Write a Program to check whether the given File exists OR not. If it is


available then print its content?

1) import os,sys
2) fname=input("Enter File Name: ")
3) if os.path.isfile(fname):
4) print("File exists:",fname)
5) f=open(fname,"r")
6) else:
7) print("File does not exist:",fname)
8) sys.exit(0)
9) print("The content of file is:")
10) data=f.read()
11) print(data)

Output
D:\Python_classes>py test.py
Enter File Name: durga.txt
File does not exist: durga.txt
D:\Python_classes>py test.py
Enter File Name: abc.txt
File exists: abc.txt
The content of file is:
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!

Note:
sys.exit(0)  To exit system without executing rest of the program.
argument represents status code. 0 means normal termination and it is the default value.

10 https://www.youtube.com/durgasoftware
Q) Program to print the Number of Lines, Words and Characters
present in the given File?
1) import os,sys
2) fname=input("Enter File Name: ")
3) if os.path.isfile(fname):
4) print("File exists:",fname)
5) f=open(fname,"r")
6) else:
7) print("File does not exist:",fname)
8) sys.exit(0)
9) lcount=wcount=ccount=0
10) for line in f:
11) lcount=lcount+1
12) ccount=ccount+len(line)
13) words=line.split()
14) wcount=wcount+len(words)
15) print("The number of Lines:",lcount)
16) print("The number of Words:",wcount)
17) print("The number of Characters:",ccount)

Output
D:\Python_classes>py test.py
Enter File Name: durga.txt
File does not exist: durga.txt

D:\Python_classes>py test.py
Enter File Name: abc.txt
File exists: abc.txt
The number of Lines: 6
The number of Words: 24
The number of Characters: 149

abc.txt
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!
All Students are GEMS!!!

11 https://www.youtube.com/durgasoftware
Handling Binary Data:
It is very common requirement to read or write binary data like images,video files,audio
files etc.

Q) Program to read Image File and write to a New Image File?

1) f1=open("rossum.jpg","rb")
2) f2=open("newpic.jpg","wb")
3) bytes=f1.read()
4) f2.write(bytes)
5) print("New Image is available with the name: newpic.jpg")

Handling CSV Files:


⚽ CSV  Comma seperated values
⚽ As the part of programming, it is very common requirement to write and read data wrt
csv files. Python provides csv module to handle csv files.

Writing Data to CSV File:


1) import csv
2) with open("emp.csv","w",newline='') as f:
3) w=csv.writer(f) # returns csv writer object
4) w.writerow(["ENO","ENAME","ESAL","EADDR"])
5) n=int(input("Enter Number of Employees:"))
6) for i in range(n):
7) eno=input("Enter Employee No:")
8) ename=input("Enter Employee Name:")
9) esal=input("Enter Employee Salary:")
10) eaddr=input("Enter Employee Address:")
11) w.writerow([eno,ename,esal,eaddr])
12) print("Total Employees data written to csv file successfully")

Note: Observe the difference with newline attribute and without


with open("emp.csv","w",newline='') as f:
with open("emp.csv","w") as f:

Note: If we are not using newline attribute then in the csv file blank lines will be included
between data. To prevent these blank lines, newline attribute is required in Python-3,but
in Python-2 just we can specify mode as 'wb' and we are not required to use newline
attribute.

12 https://www.youtube.com/durgasoftware
Reading Data from CSV File:
1) import csv
2) f=open("emp.csv",'r')
3) r=csv.reader(f) #returns csv reader object
4) data=list(r)
5) #print(data)
6) for line in data:
7) for word in line:
8) print(word,"\t",end='')
9) print()

Output
D:\Python_classes>py test.py

ENO ENAME ESAL EADDR


100 Durga 1000 Hyd
200 Sachin 2000 Mumbai
300 Dhoni 3000 Ranchi

Zipping and Unzipping Files:


It is very common requirement to zip and unzip files.

The main advantages are:


1) To improve memory utilization
2) We can reduce transport time
3) We can improve performance.

To perform zip and unzip operations, Python contains one in-bulit module zip file.
This module contains a class: ZipFile

To Create Zip File:


We have to create ZipFile class object with name of the zip file, mode and constant
ZIP_DEFLATED. This constant represents we are creating zip file.
f = ZipFile("files.zip","w","ZIP_DEFLATED")

Once we create ZipFile object,we can add files by using write() method.
f.write(filename)

13 https://www.youtube.com/durgasoftware
1) from zipfile import *
2) f=ZipFile("files.zip",'w',ZIP_DEFLATED)
3) f.write("file1.txt")
4) f.write("file2.txt")
5) f.write("file3.txt")
6) f.close()
7) print("files.zip file created successfully")

To perform unzip Operation:


We have to create ZipFile object as follows
f = ZipFile("files.zip","r",ZIP_STORED)

ZIP_STORED represents unzip operation. This is default value and hence we are not
required to specify.
Once we created ZipFile object for unzip operation, we can get all file names present in
that zip file by using namelist() method.
names = f.namelist()

1) from zipfile import *


2) f=ZipFile("files.zip",'r',ZIP_STORED)
3) names=f.namelist()
4) for name in names:
5) print( "File Name: ",name)
6) print("The Content of this file is:")
7) f1=open(name,'r')
8) print(f1.read())
9) print()

Working with Directories:


It is very common requirement to perform operations for directories like
1) To know current working directory
2) To create a new directory
3) To remove an existing directory
4) To rename a directory
5) To list contents of the directory
etc...
To perform these operations, Python provides inbuilt module os,which contains several
functions to perform directory related operations.

14 https://www.youtube.com/durgasoftware
Q1) To Know Current Working Directory
import os
cwd = os.getcwd()
print("Current Working Directory:",cwd)

Q2) To Create a Sub Directory in the Current Working Directory


import os
os.mkdir("mysub")
print("mysub directory created in cwd")

Q3) To Create a Sub Directory in mysub Directory


cwd
|-mysub
|-mysub2

import os
os.mkdir("mysub/mysub2")
print("mysub2 created inside mysub")

Note: Assume mysub already present in cwd.

Q4) To Create Multiple Directories like sub1 in that sub2 in that


sub3
import os
os.makedirs("sub1/sub2/sub3")
print("sub1 and in that sub2 and in that sub3 directories created")
Q5) To Remove a Directory
import os
os.rmdir("mysub/mysub2")
print("mysub2 directory deleted")

Q6) To Remove Multiple Directories in the Path


import os
os.removedirs("sub1/sub2/sub3")
print("All 3 directories sub1,sub2 and sub3 removed")

Q7) To Rename a Directory


import os
os.rename("mysub","newdir")
print("mysub directory renamed to newdir")

15 https://www.youtube.com/durgasoftware
Q8) To know Contents of Directory
OS Module provides listdir() to list out the contents of the specified directory. It
won't display the contents of sub directory.

1) import os
2) print(os.listdir("."))

Output
D:\Python_classes>py test.py
['abc.py', 'abc.txt', 'abcd.txt', 'com', 'demo.py', 'durgamath.py', 'emp.csv', '
file1.txt', 'file2.txt', 'file3.txt', 'files.zip', 'log.txt', 'module1.py', 'myl
og.txt', 'newdir', 'newpic.jpg', 'pack1', 'rossum.jpg', 'test.py', '__pycache__']

 The above program display contents of current working directory but not contents of
sub directories.
 If we want the contents of a directory including sub directories then we should go for
walk() function.

Q9) To Know Contents of Directory including Sub Directories


 We have to use walk() function
 [Can you please walk in the directory so that we can aware all contents of that
directory]
 os.walk(path, topdown = True, onerror = None, followlinks = False)
 It returns an Iterator object whose contents can be displayed by using for loop

 path  Directory Path. cwd means .


 topdown = True  Travel from top to bottom
 onerror = None  On error detected which function has to execute.
 followlinks = True  To visit directories pointed by symbolic links

Eg: To display all contents of Current working directory including sub directories:

1) import os
2) for dirpath,dirnames,filenames in os.walk('.'):
3) print("Current Directory Path:",dirpath)
4) print("Directories:",dirnames)
5) print("Files:",filenames)
6) print()

Output
Current Directory Path: .
Directories: ['com', 'newdir', 'pack1', '__pycache__']

16 https://www.youtube.com/durgasoftware
Files: ['abc.txt', 'abcd.txt', 'demo.py', 'durgamath.py', 'emp.csv', 'file1.txt', 'file2.txt', 'file3.
txt', 'files.zip', 'log.txt', 'module1.py', 'mylog.txt', 'newpic.jpg', 'rossum.jpg', 'test.py']

Current Directory Path: .\com


Directories: ['durgasoft', '__pycache__']
Files: ['module1.py', '__init__.py']
...

Note: To display contents of particular directory, we have to provide that directory name
as argument to walk() function.
os.walk("directoryname")

Q) What is the difference between listdir() and walk() Functions?


In the case of listdir(), we will get contents of specified directory but not sub directory
contents. But in the case of walk() function we will get contents of specified directory
and its sub directories also.

Running Other Programs from Python Program:


OS Module contains system() function to run programs and commands.
It is exactly same as system() function in C language.

os.system("commad string")
The argument is any command which is executing from DOS.

Eg:
import os
os.system("dir *.py")
os.system("py abc.py")

How to get Information about a File:


We can get statistics of a file like size, last accessed time,last modified time etc by using
stat() function of os module.
stats = os.stat("abc.txt")

The statistics of a file includes the following parameters:

1) st_mode  Protection Bits


2) st_ino  Inode number
3) st_dev  Device
4) st_nlink  Number of Hard Links
5) st_uid  User id of Owner
6) st_gid  Group id of Owner

17 https://www.youtube.com/durgasoftware
7) st_size  Size of File in Bytes
8) st_atime  Time of Most Recent Access
9) st_mtime  Time of Most Recent Modification
10) st_ctime  Time of Most Recent Meta Data Change

Note: st_atime, st_mtime and st_ctime returns the time as number of milli seconds since
Jan 1st 1970, 12:00 AM. By using datetime module from timestamp() function, we can get
exact date and time.

Q) To Print all Statistics of File abc.txt


1) import os
2) stats=os.stat("abc.txt")
3) print(stats)

Output
os.stat_result(st_mode=33206, st_ino=844424930132788, st_dev=2657980798, st_nlin
k=1, st_uid=0, st_gid=0, st_size=22410, st_atime=1505451446, st_mtime=1505538999
, st_ctime=1505451446)

Q) To Print specified Properties


1) import os
2) from datetime import *
3) stats=os.stat("abc.txt")
4) print("File Size in Bytes:",stats.st_size)
5) print("File Last Accessed Time:",datetime.fromtimestamp(stats.st_atime))
6) print("File Last Modified Time:",datetime.fromtimestamp(stats.st_mtime))

Output
File Size in Bytes: 22410
File Last Accessed Time: 2017-09-15 10:27:26.599490
File Last Modified Time: 2017-09-16 10:46:39.245394

Pickling and Unpickling of Objects:


 Sometimes we have to write total state of object to the file and we have to read total
object from the file.
 The process of writing state of object to the file is called pickling and the process of
reading state of an object from the file is called unpickling.
 We can implement pickling and unpickling by using pickle module of Python.
 pickle module contains dump() function to perform pickling. pickle.dump(object,file)
 pickle module contains load() function to perform unpickling obj=pickle.load(file)
18 https://www.youtube.com/durgasoftware
pickling
pickle.dump
eno: 100 (e1, f)
ename: Durga
esal: 10000
eaddr: HYD
e1 eno: 100
ename: Durga
esal: 10000
eaddr: HYD

eno: 100 abc.txt


ename: Durga pickle load (f)
esal: 10000
eaddr: HYD unpickling
e2

Writing and Reading State of Object by using pickle Module:


1) import pickle
2) class Employee:
3) def __init__(self,eno,ename,esal,eaddr):
4) self.eno=eno;
5) self.ename=ename;
6) self.esal=esal;
7) self.eaddr=eaddr;
8) def display(self):
9) print(self.eno,"\t",self.ename,"\t",self.esal,"\t",self.eaddr)
10) with open("emp.dat","wb") as f:
11) e=Employee(100,"Durga",1000,"Hyd")
12) pickle.dump(e,f)
13) print("Pickling of Employee Object completed...")
14)
15) with open("emp.dat","rb") as f:
16) obj=pickle.load(f)
17) print("Printing Employee Information after unpickling")
18) obj.display()

19 https://www.youtube.com/durgasoftware
Writing Multiple Employee Objects to the File:
emp.py:

1) class Employee:
2) def __init__(self,eno,ename,esal,eaddr):
3) self.eno=eno;
4) self.ename=ename;
5) self.esal=esal;
6) self.eaddr=eaddr;
7) def display(self):
8)
9) print(self.eno,"\t",self.ename,"\t",self.esal,"\t",self.eaddr)

pick.py:

1) import emp,pickle
2) f=open("emp.dat","wb")
3) n=int(input("Enter The number of Employees:"))
4) for i in range(n):
5) eno=int(input("Enter Employee Number:"))
6) ename=input("Enter Employee Name:")
7) esal=float(input("Enter Employee Salary:"))
8) eaddr=input("Enter Employee Address:")
9) e=emp.Employee(eno,ename,esal,eaddr)
10) pickle.dump(e,f)
11) print("Employee Objects pickled successfully")

unpick.py:

1) import emp,pickle
2) f=open("emp.dat","rb")
3) print("Employee Details:")
4) while True:
5) try:
6) obj=pickle.load(f)
7) obj.display()
8) except EOFError:
9) print("All employees Completed")
10) break
11) f.close()

20 https://www.youtube.com/durgasoftware

You might also like