outputs
ModelForImageDetectionOutput
Bases: TypedDict
The output for image detection models.
Source code in glasses/models/vision/image/detection/outputs.py
bboxes: Tensor
class-attribute
A torch.Tensor
of shape (batch_size, num_bboxes, 4)
.
logits: Tensor
class-attribute
A torch.Tensor
of shape (batch_size, num_bboxes, num_classes + 1)
.