Measures the spatial overlap between a predicted bounding box and one or more reference boxes. This metric is commonly used in object detection and layout tasks to quantify how well the predicted region aligns with the ground truth.
actual_output
: A single predicted bounding box in the format.expected_output
: A list of lists of ground truth boxes, each also in the format.