API Reference (auto)

`BBoxXYWHNorm`

Bases: BaseModel

Canonical TL-normalized bbox: [x, y, w, h] with invariants.

Convert canonical bbox to pixel XYXY coordinates.

Convert pixel XYXY coordinates to canonical bbox.

Intersection over Union for two xyxy pixel-space boxes.

Returns 0.0 when there is no overlap (including edge-touching).

Area of a pixel-space xyxy rectangle.

Negative extents are clamped to zero to be robust to degenerate inputs.

Intersection rectangle of two xyxy boxes or None if disjoint/touching.

Clamp an xyxy box to image bounds [0,W] x [0,H].

Ordering is preserved so that the output always has x1 <= x2 and y1 <= y2.

Convert normalized TL-xywh to pixel TL-xywh using image dimensions.

Uses simple scaling without rounding to preserve fractional pixel information.

Convert pixel TL-xywh to normalized TL-xywh using image dimensions.

Values are clamped to [0,1] to avoid float precision edge cases, with a minimal positive width/height enforced by the BBoxXYWHNorm model.

Convert pixel-space xyxy → pixel TL-xywh (no clamping).

Convert pixel TL-xywh → pixel-space xyxy (no clamping).

Group detections by frame_number preserving per-frame order.

Returns a dict with integer keys sorted in ascending order.

Return pixel xyxy tuple for a Detection using image dimensions.

Prefers the canonical bbox_norm if present. If only the legacy bbox (pixel xywh) is provided, falls back to converting that to xyxy.