Function for translating the min/max ground truth box coordinates to something expected by the SSD network. This is a SSD-specific analog for keras::to_categorical(). For each image in the batch, we compare the ground truth boxes for that image with all the anchor boxes. If the overlap measure exceeds a specific threshold, we write the ground truth box coordinates and class to the specific position of the matched anchor box. Note that the background class will be assigned to all the anchor boxes for which there was no match with any ground truth box. However, an exception to this are the anchor boxes whose overlap measure is higher that the specified negative threshold.
encodeSsd3D( groundTruthLabels, anchorBoxes, imageSize, variances = rep(1, 6), foregroundThreshold = 0.5, backgroundThreshold = 0.2 )
groundTruthLabels | A list of length |
---|---|
anchorBoxes | a list of 2-D arrays where each element comprises the anchor boxes for a specific aspect ratios layer. The row of each 2-D array comprises a single box specified in the for (xmin,xmax,ymin,ymax,zmin,zmax). |
imageSize | 3-D vector specifying the spatial domain of the input images. |
variances | A list of 6 floats > 0 with scaling factors (actually it's
not factors but divisors to be precise) for the encoded predicted box
coordinates. A variance value of 1.0 would apply no scaling at all to the
predictions, while values in (0,1) upscale the encoded predictions and
values greater than 1.0 downscale the encoded predictions. These are the same
variances used to construct the model. Default =
|
foregroundThreshold | float between 0 and 1 determining the min threshold
for matching an anchor box with a ground truth box and, thus, labeling an anchor
box as a non-background class. If an anchor box exceeds the |
backgroundThreshold | float between 0 and 1 determining the max threshold
for labeling an anchor box as |
a 3-D array of shape (batchSize
, numberOfBoxes
,
numberOfClasses
+ 6 + 6 + 6)
where the additional 6's along the third dimension correspond to
the 6 predicted box coordinate offsets, the 6 coordinates for
the anchor boxes, and the 6 variance values.
This particular implementation was heavily influenced by the following python and R implementations:
\url{https://github.com/pierluigiferrari/ssd_keras} \url{https://github.com/rykov8/ssd_keras} \url{https://github.com/gsimchoni/ssdkeras}
Tustison NJ