Search Shortcut cmd + k | ctrl + k
cozip

Cloud-Optimized ZIP reader for DuckDB

Maintainer(s): csaybar, ryali93

Installing and Loading

INSTALL cozip FROM community;
LOAD cozip;

Example

INSTALL cozip FROM community;
LOAD cozip;

SELECT *
FROM read_cozip('https://huggingface.co/datasets/Major-TOM/Core-VIIRS-Nighttime-Light/resolve/main/2024/MAJORTOM-VIIRS-NTL_2024_median_000.zip')
LIMIT 10;

About cozip

cozip replaces the ZIP Central Directory scan with a Parquet metadata file located through a fixed 51-byte header at byte 0 of the archive. read_cozip(path) reads that Parquet directly through a virtual cozip-subfile filesystem, so range requests flow lazily through the underlying transport. Works on local files and remote URLs (HTTPS, S3, GCS, Azure, HuggingFace), on native and WebAssembly. Every row gets an extra cozip:gdal_vsi column with a /vsisubfile/ path that opens the referenced inner file in GDAL or rasterio without re-downloading the archive.

Added Functions

function_name function_type description comment examples
cozip_offset_size scalar NULL NULL  
cozip_vsi_base scalar NULL NULL  
read_cozip table_macro NULL NULL  

Overloaded Functions

This extension does not add any function overloads.

Added Types

This extension does not add any types.

Added Settings

This extension does not add any settings.