Independently determining adjustments to bounding shapes for detected objects in image data

US12008792B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-12008792-B1
Application numberUS-202117548380-A
CountryUS
Kind codeB1
Filing dateDec 10, 2021
Priority dateDec 10, 2021
Publication dateJun 11, 2024
Grant dateJun 11, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Adjustments to bounding shapes for detected objects in image data may be independently determined. A bounding shape for an object detected in image data may be obtained. Independently determined adjustments for one or more edges of the bounding shape may be determined according to a multi-scale feature map generated from different resolutions of the image data that is provided as input to different dimension decoders to determine the adjustments to the bounding shape and respective confidence scores for the adjustments. The confidence scores are evaluated with respect to a confidence threshold to determine whether to provide the adjustments to the one or more edges of the bounding shape.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: at least one processor; and a memory, storing program instructions that when executed by the at least one processor, causes the at least one processor to: receive a bounding box for an object detected in image data; independently determine one or more adjustments to one or more edges of the bounding box according to a multi-scale feature map generated from different resolutions of the image data that is provided as input to different dimension decoders to determine the one or more adjustments to the bounding box and generate respective confidence scores for the one or more adjustments to the bounding box; and determine that the confidence scores for the one or more adjustments to the one or more edges of the bounding box satisfies a confidence threshold; and display an updated version of the bounding box according to the one or more adjustments to the one or more edges of the bounding box after the determination that the confidence scores for the one or more adjustments to the one or more edges of the bounding box satisfy the confidence threshold. 2. The system of claim 1 , wherein the memory stores further program instructions that when executed further cause the at least one processor to: determine that a difference between the bounding box and a version of the bounding described by the one or more adjustments is above a difference threshold; and wherein the display of the updated version of the bounding box according to the one or more adjustments to the one or more edges of the bounding box is performed after the determination that the difference between the bounding box and the version of the bounding box described by the one or more adjustments is above the difference threshold. 3. The system of claim 1 , wherein the bounding box is received as draw operation via a graphical user interface, and wherein the display of the updated version of the bounding box according to the one or more adjustments to the one or more edges of the bounding box automatically updates the drawn bounding box in the graphical user interface. 4. The system of claim 1 , wherein the at least one processor and the memory is implemented as part of a machine learning service offered by a provider network that displays the updated version of the bounding box via a labeling interface for the machine learning service. 5. A method, comprising: obtaining, at an image labeling system, a bounding shape for an object detected in image data; independently determining, by the image labeling system, one or more adjustments to one or more edges of the bounding shape according to a multi-scale feature map generated from different resolutions of the image data that is provided as input to different dimension decoders to determine the one or more adjustments to the one or more edges of the bounding shape and generate respective confidence scores for the one or more adjustments to the bounding shape; and after determining that the respective confidence scores for the one or more adjustments to the one or more edges of the bounding shape satisfy a confidence threshold, providing, by the image labeling system, the one or more adjustments to the one or more edges of the bounding shape. 6. The method of claim 5 , further comprising: determining that a difference between the bounding shape and a version of the bounding shape described by the one or more adjustments is above a difference threshold; and wherein the providing of the one or more adjustments to the one or more edges of the bounding shape is performed after the determination that the difference between the bounding box and the version of the bounding shape described by the one or more adjustments is above the difference threshold. 7. The method of claim 6 , wherein the bounding shape for the object detected in image data is obtained via a validation interface. 8. The method of claim 5 , wherein the bounding shape is obtained as draw operation via a graphical user interface of the image labeling system, and wherein providing the one or more adjustments to the one or more edges of the bounding shape comprises automatically updating the drawn bounding shape in the graphical user interface with an updated version of the bounding shape according to the one or more adjustments to the one or more edges of the bounding shape. 9. The method of claim 5 , wherein another adjustment determined for another edge of the bounding shape different from the one or more edges is not provided after a determination that a confidence score for the other edge does not satisfy the confidence threshold. 10. The method of claim 5 , wherein independently determining the one or more adjustments to the one or more edges of the second bounding shape comprises: cropping the image data, including the bounding shape for the detected object in the cropped image data; applying a convolutional neural network (CNN) machine learning model to extract features of the cropped image data at the different resolutions; combining the extracted features at the different resolutions to generate the multi-scale feature map; applying vertical pooling and a one dimensional decoder to the multi-scale feature map to generate new left and right edges for the bounding shape; applying horizontal pooling and another one dimensional decoder to the multi-scale feature map to generate new top and bottom edges for the bounding shape; and wherein the new left and right edges and the new top and bottom edges are the one or more adjustments to the bounding shape. 11. The method of claim 5 , wherein the bounding shape is a polygon. 12. The method of claim 5 , wherein providing the one or more adjustments to the one or more edges of the bounding shape comprises displaying an updated version of the bounding shape determined according to the one or more adjustments to the one or more edges. 13. The method of claim 5 , wherein providing the one or more adjustments to the one or more edges of the bounding shape comprises storing the one or more adjustments to the one or more edges as a new version of the bounding shape as part of a training data set. 14. One or more non-transitory, computer-readable storage media, storing program instructions that when executed on or across one or more computing devices cause the one or more computing devices to implement: obtaining a bounding shape for an object detected in image data; independently determining one or more adjustments to one or more edges of the bounding shape according to a multi-scale feature map generated from different resolutions of the image data that is provided as input to different dimension decoders to determine the one or more adjustments to the bounding shape and generate respective confidence scores for the one or more adjustments to the bounding shape; and determining that respective confidence scores for the one or more adjustments to the one or more edges of the bounding shape satisfies a confidence threshold; and displaying the one or more adjustments to the one or more edges of the bounding shape after determining that the confidence scores for the one or more adjustments to the one or more edges of the bounding shape satisfy the confidence threshold. 15. The one or more non-transitory, computer-readable storage media of claim 14 , storing further instructions that when executed on or across the one or more computing devices, cause the one or more computing devices to further implement: determining that a difference between the bounding shape and a version of the bounding shape described by the one or more adjustments is above a differ

Assignees

Inventors

Classifications

  • Edge detection · CPC title

  • using neural networks · CPC title

  • G06V10/235Primary

    based on user input or interaction · CPC title

  • Determination of region of interest [ROI] or a volume of interest [VOI] · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12008792B1 cover?
Adjustments to bounding shapes for detected objects in image data may be independently determined. A bounding shape for an object detected in image data may be obtained. Independently determined adjustments for one or more edges of the bounding shape may be determined according to a multi-scale feature map generated from different resolutions of the image data that is provided as input to diffe…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06V10/235. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 11 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).