Unzip One Or Multiple Zipped Files in a Archive With Python

Do you need to unzip files with Python keeping or not the file structure? In this short article you can find several examples on extracting single or multiple files to the same folder.

Step 1: Unzip single file from zip archive

To start, let's have a zip file called: archive.zip. The file is available from here: Stock Exchange Data.

It has 3 csv files inside:

  • indexProcessed.csv
  • indexInfo.csv
  • indexData.csv

If you like to extract only one file from it - indexProcessed.csv then you can use next Python snippet:

import zipfile

path = '/home/myuser/Downloads/'

archive = zipfile.ZipFile(f'{path}archive.zip')

for file in archive.namelist():
    if file.startswith('indexProcessed.csv'):
        archive.extract(file, path)

The archive is located in the folder: /home/myuser/Downloads/ and the destination file will be extracted in the same place.

The code is using the library zipfile. It reads the archive and iterates over all files inside.

Then checks if the file starts with indexProcessed.csv if so it will extract it to the destination path.

In order to extract multiple files you can change the if condition to meet your needs.

Step 2: Unzip all files from zip archive

What if you like to** extract all files from a zip archive**. Again we are going to use zipfile. This time instead of method extract we will use extractall:

import zipfile

path = '/home/myuser/Downloads/'

zipfilepath = f'{path}archive.zip'
extractiondir = f'{path}/extract'

zip = zipfile.ZipFile(zipfilepath)
zip.extractall(path=extractiondir)

Again the archive file is located in /home/myuser/Downloads/. The output folder will be /home/myuser/Downloads/extract

Note that this method will extract the whole structure - all files and folder recursively.

Step 3: Unzip all files without keeping the structure

To unzip all files from a zip file without keeping the original structure you can use module os in order to change the destination folder as:

import zipfile
import os

path = '/home/myuser/Downloads/'
archive = f'{path}/archive.zip'

with zipfile.ZipFile(archive) as zipfile:
    for zip_info in zipfile.infolist():
        if zip_info.filename[-1] == '/':
            continue
        zip_info.filename = os.path.basename(zip_info.filename)
        zipfile.extract(zip_info, f'{path}/extract')

The code above will extract all files into the destination folder /home/myuser/Downloads/extract recursively.

Step 4: Unzip all CSV files from zip archive

Finally let's check how to extract only CSV files from archive. This can be done by filtering files by CSV extension. We will use method endswith:

import zipfile

path = '/home/myuser/Downloads/'

archive = zipfile.ZipFile(f'{path}archive.zip')

for file in archive.namelist():
    if file.endswith('.csv'):
        archive.extract(file, path)