bbox
A bbox, short for bounding box, is a rectangular region that encloses a set of points or an area in a two-dimensional space. It is a basic, widely used construct in fields such as computer vision, image processing, geographic information systems (GIS), and document analysis. A bounding box typically encodes a location and size of a region of interest.
In computer vision, the axis-aligned bounding box (AABB) is the most common variant. It is defined by
In GIS, a bounding box (or extent) describes the geographic territory covering a region, defined by minimum
In document analysis and OCR, bounding boxes delineate regions like words, lines, or figures, enabling segmentation,
Common representations include (xmin, ymin, xmax, ymax) and (left, top, width, height), with conventions varying by