zarr
Zarr is an open format and software ecosystem for the storage of chunked, compressed, N-dimensional arrays. It is designed to enable scalable analytics on large datasets, particularly in cloud environments and high-performance computing, by allowing parallel and selective input/output without requiring a single monolithic file.
In Zarr, an array consists of chunks laid out in a multidimensional grid. Each chunk is stored
Backends for Zarr include local filesystems and object stores such as Amazon S3, Google Cloud Storage, or
Ecosystem and bindings: the Python reference implementation is the zarr library (often referred to as PyZarr),
History and usage: Zarr was developed to address limitations of traditional formats like HDF5 for cloud storage