BGZF
BGZF, short for Block GZip Format, is a gzip-compatible container format designed to provide random access to compressed data. It achieves this by dividing the compressed stream into independently decompressible blocks and by including per-block metadata that supports efficient seeking. Each BGZF block contains a standard gzip wrapper around a DEFLATE-compressed payload, and the uncompressed data in a single block is limited to 64 kilobytes. The blocks are concatenated to form a BGZF file, and the block-level metadata, together with a file-wide index, enables readers to jump directly to a given block without decompressing data preceding it.
To support random access, BGZF files are typically accompanied by an index that maps logical positions to
Usage and tooling commonly appear in bioinformatics. BGZF is widely used for genomics data formats such as