Download And Zip File R
This article explains how one can perform various operations on a zip file using a simple python program.
What is a zip file?
Attention geek! Strengthen your foundations with the Python Programming Foundation Course and learn the basics.
To begin with, your interview preparations Enhance your Data Structures concepts with the Python DS Course. And to begin with your Machine Learning Journey, join the Machine Learning - Basic Level Course
ZIP is an archive file format that supports lossless data compression. By lossless compression, we mean that the compression algorithm allows the original data to be perfectly reconstructed from the compressed data. So, a ZIP file is a single file containing one or more compressed files, offering an ideal way to make large files smaller and keep related files together.
Why do we need zip files?
- To reduce storage requirements.
- To improve transfer speed over standard connections.
To work on zip files using python, we will use an inbuilt python module called zipfile.
1. Extracting a zip file
from
zipfile
import
ZipFile
file_name
=
"my_python_files.zip"
with ZipFile(file_name,
'r'
) as
zip
:
zip
.printdir()
print
(
'Extracting all the files now...'
)
zip
.extractall()
print
(
'Done!'
)
The above program extracts a zip file named "my_python_files.zip" in the same directory as of this python script.
The output of above program may look like this:
Let us try to understand the above code in pieces:
-
from zipfile import ZipFile
ZipFile is a class of zipfile module for reading and writing zip files. Here we import only class ZipFile from zipfile module.
-
with ZipFile(file_name, 'r') as zip:
Here, a ZipFile object is made by calling ZipFile constructor which accepts zip file name and mode parameters. We create a ZipFile object in READ mode and name it as zip.
-
zip.printdir()
printdir() method prints a table of contents for the archive.
-
zip.extractall()
extractall() method will extract all the contents of the zip file to the current working directory. You can also call extract() method to extract any file by specifying its path in the zip file.
For example:zip.extract('python_files/python_wiki.txt')
This will extract only the specified file.
If you want to read some specific file, you can go like this:
data = zip.read(name_of_file_to_read)
2. Writing to a zip file
Consider a directory (folder) with such a format:
Here, we will need to crawl the whole directory and its sub-directories in order to get a list of all file paths before writing them to a zip file.
The following program does this by crawling the directory to be zipped:
from
zipfile
import
ZipFile
import
os
def
get_all_file_paths(directory):
file_paths
=
[]
for
root, directories, files
in
os.walk(directory):
for
filename
in
files:
filepath
=
os.path.join(root, filename)
file_paths.append(filepath)
return
file_paths
def
main():
directory
=
'./python_files'
file_paths
=
get_all_file_paths(directory)
print
(
'Following files will be zipped:'
)
for
file_name
in
file_paths:
print
(file_name)
with ZipFile(
'my_python_files.zip'
,
'w'
) as
zip
:
for
file
in
file_paths:
zip
.write(
file
)
print
(
'All files zipped successfully!'
)
if
__name__
=
=
"__main__"
:
main()
The output of above program looks like this:
Let us try to understand above code by dividing into fragments:
-
def get_all_file_paths(directory): file_paths = [] for root, directories, files in os.walk(directory): for filename in files: filepath = os.path.join(root, filename) file_paths.append(filepath) return file_paths
First of all, to get all file paths in our directory, we have created this function which uses the os.walk() method. In each iteration, all files present in that directory are appended to a list called file_paths.
In the end, we return all the file paths. -
file_paths = get_all_file_paths(directory)
Here we pass the directory to be zipped to the get_all_file_paths() function and obtain a list containing all file paths.
-
with ZipFile('my_python_files.zip','w') as zip:
Here, we create a ZipFile object in WRITE mode this time.
-
for file in file_paths: zip.write(file)
Here, we write all the files to the zip file one by one using write method.
3. Getting all information about a zip file
from
zipfile
import
ZipFile
import
datetime
file_name
=
"example.zip"
with ZipFile(file_name,
'r'
) as
zip
:
for
info
in
zip
.infolist():
print
(info.filename)
print
(
'\tModified:\t'
+
str
(datetime.datetime(
*
info.date_time)))
print
(
'\tSystem:\t\t'
+
str
(info.create_system)
+
'(0 = Windows, 3 = Unix)'
)
print
(
'\tZIP version:\t'
+
str
(info.create_version))
print
(
'\tCompressed:\t'
+
str
(info.compress_size)
+
' bytes'
)
print
(
'\tUncompressed:\t'
+
str
(info.file_size)
+
' bytes'
)
The output of above program may look like this:
for info in zip.infolist():
Here, infolist() method creates an instance of ZipInfo class which contains all the information about the zip file.
We can access all information like last modification date of files, file names, system on which files were created, Zip version, size of files in compressed and uncompressed form, etc.
This article is contributed by Nikhil Kumar. If you like GeeksforGeeks and would like to contribute, you can also write an article using write.geeksforgeeks.org or mail your article to review-team@geeksforgeeks.org. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please write comments if you find anything incorrect, or you want to share more information about the topic discussed above.
Posted by: downdusk.blogspot.com
Source: https://www.geeksforgeeks.org/working-zip-files-python/