outputs
ModelForImageDetectionOutput
Bases: TypedDict
The output for image detection models.
Source code in glasses/models/vision/image/detection/outputs.py
bboxes: Tensor
class-attribute
A torch.Tensor of shape (batch_size, num_bboxes, 4).
logits: Tensor
class-attribute
A torch.Tensor of shape (batch_size, num_bboxes, num_classes + 1).