seqlengthsobject
seqlengthsobject is a data structure used in bioinformatics to store the lengths of a set of sequences, typically identifiers such as chromosomes or contigs and their corresponding lengths from a genome assembly or sequencing dataset. It serves as a compact reference for coordinate arithmetic, range validation, and indexing, without storing the full sequences themselves.
Representation and implementations
In the Bioconductor ecosystem (R), the standard representation is a named integer vector called seqlengths, for
Seqlengths can be created from assembly metadata, parsed from sequence headers, or derived from existing genomic
Seqlengthsobjects support coordinate-based operations, interval queries, and alignment workflows where coordinates must be validated against known