seqlengths
Seqlengths is a metadata vector used in genomic data objects to record the lengths of each sequence in a genome or assembly. It is typically implemented as a named integer vector where names are sequence identifiers (such as chromosome or contig names) and values are their lengths in base pairs. In Bioconductor, seqlengths is part of the Seqinfo object that accompanies genomic ranges and other sequence-aware data structures.
Purpose and usage: seqlengths enables correct handling of genomic coordinates, validation of ranges, and coordinate-based operations
Storage and access: For objects like GRanges, the seqinfo slot stores seqlengths; you can access or modify
Considerations: Ensure the seqlengths vector matches the actual sequences used; missing or incorrect lengths can cause
Related concepts: Seqinfo, GRanges, GenomicRanges, and other sequence-aware data structures in Bioconductor rely on seqlengths to