Cloud-Optimized ZIP reader for DuckDB
Installing and Loading
INSTALL cozip FROM community;
LOAD cozip;
Example
INSTALL cozip FROM community;
LOAD cozip;
SELECT *
FROM read_cozip('https://huggingface.co/datasets/Major-TOM/Core-VIIRS-Nighttime-Light/resolve/main/2024/MAJORTOM-VIIRS-NTL_2024_median_000.zip')
LIMIT 10;
About cozip
cozip replaces the ZIP Central Directory scan with a Parquet metadata file located through a fixed 51-byte header at byte 0 of the archive. read_cozip(path) reads that Parquet directly through a virtual cozip-subfile filesystem, so range requests flow lazily through the underlying transport. Works on local files and remote URLs (HTTPS, S3, GCS, Azure, HuggingFace), on native and WebAssembly. Every row gets an extra cozip:gdal_vsi column with a /vsisubfile/ path that opens the referenced inner file in GDAL or rasterio without re-downloading the archive.
Added Functions
| function_name | function_type | description | comment | examples |
|---|---|---|---|---|
| cozip_offset_size | scalar | NULL | NULL | |
| cozip_vsi_base | scalar | NULL | NULL | |
| read_cozip | table_macro | NULL | NULL |
Overloaded Functions
This extension does not add any function overloads.
Added Types
This extension does not add any types.
Added Settings
This extension does not add any settings.