System and method for deep network fusion for fast and robust object detection

US10657364B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10657364-B2
Application numberUS-201715434880-A
CountryUS
Kind codeB2
Filing dateFeb 16, 2017
Priority dateSep 23, 2016
Publication dateMay 19, 2020
Grant dateMay 19, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus are provided. The method includes receiving an image, detecting an object in the image, determining, by a primary object detector, a primary confidence detection score of the object, determining, by a classification network, a confidence scaling factor of the object, and adjusting the primary confidence detection score based on multiplying the primary confidence detection score by the confidence scaling factor.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving an image; detecting an object in the image; determining, by a primary object detector, a primary confidence detection score of the object; determining, by a classification network, a confidence scaling factor of the object; adjusting the primary confidence detection score based on multiplying the primary confidence detection score by the confidence scaling factor; generating a semantic segmentation mask based on classifying each pixel into one of pre-determined classes; determining an amount of overlap, if any, between a bounding box of the object and the semantic segmentation mask; and adjusting the primary confidence detection score and eliminating a secondary object detection from the semantic segmentation mask if there is no overlap between the bounding box of the object and the semantic segmentation mask, wherein the semantic segmentation mask is a binary mask generated with one color showing activated pixels in the image and another color showing a background in the image. 2. The method of claim 1 , further comprising: scaling the primary confidence detection score based on multiplying the primary confidence detection score by a product of a plurality of confidence scaling factors of the object from a plurality of classification networks. 3. The method of claim 2 , further comprising: setting an object detection to a fixed size using the scaled primary confidence detection score; determining random crops with the fixed size from the scaled primary confidence detection score; and labelling a class of the random crops with respect to ground truth to train the classification network. 4. The method of claim 1 , wherein the confidence scaling factor is based on a ratio of a classification probability of the object to a desired maximum classification probability. 5. The method of claim 1 , wherein the confidence scaling factor is set to at least a desired minimum value. 6. The method of claim 1 , further comprising determining a secondary confidence detection score of the object; maintaining the primary confidence detection score if the secondary confidence detection score is greater than or equal to a threshold; and adjusting the primary confidence detection score if the secondary confidence detection score is less than the threshold. 7. The method of claim 6 , wherein adjusting the primary confidence detection score if the secondary confidence detection score is less than the threshold comprises multiplying the primary confidence detection score by a product of the secondary confidence detection score and a plurality of confidence scaling factors. 8. The method of claim 6 , wherein adjusting the primary confidence detection score if the secondary confidence detection score is less than the threshold comprises multiplying the adjusted primary confidence detection score by a product of the secondary confidence detection score and a plurality of confidence scaling factors. 9. The method of claim 1 , further comprising adjusting the primary confidence detection score in proportion to an amount of overlap if the amount of overlap between a bounding box of the object and a semantic segmentation mask is less than a threshold. 10. The method of claim 1 , further comprising maintaining the primary confidence detection score if an amount of overlap between a bounding box of the object and a semantic segmentation mask is greater than or equal to a threshold. 11. The method of claim 1 , further comprising operating the primary object detector and the classification network in parallel. 12. An apparatus, comprising: an image sensor; a primary object detector; a classification network; and a processor configured to: capture an image from the image sensor, detect an object in the image, determine a primary confidence detection score of the object using the primary object detector, determine a confidence scaling factor of the object using the classification network, adjust the primary confidence detection score based on multiplying the primary confidence detection score by the confidence scaling factor, generate a semantic segmentation mask based on classifying each pixel into one of pre-determined classes, determine an amount of overlap, if any, between a bounding box of the object and the semantic segmentation mask; and adjust the primary confidence detection score and eliminate a secondary object detection from the semantic segmentation mask if there is no overlap between the bounding box of the object and the semantic segmentation mask, wherein the semantic segmentation mask is a binary mask generated with one color showing activated pixels in the image and another color showing a background in the image. 13. The apparatus of claim 12 , wherein the processor is further configured to scale the primary confidence detection score based on multiplying the primary confidence detection score by a product of a plurality of confidence scaling factors of the object from a plurality of classification networks. 14. The apparatus of claim 12 , wherein the confidence scaling factor is based on a ratio of a classification probability of the object to a desired maximum classification probability. 15. The apparatus of claim 12 , wherein the confidence scaling factor is set to at least a desired minimum value. 16. The apparatus of claim 12 , wherein the processor is further configured to determine a secondary confidence detection score of the object, maintain the primary confidence detection score if the secondary confidence detection score is greater than or equal to a threshold, and adjust the primary confidence detection score if the secondary confidence detection score is less than the threshold. 17. The apparatus of claim 16 , wherein adjusting the primary confidence detection score if the secondary confidence detection score is less than the threshold comprises multiplying the primary confidence detection score by a product of the secondary confidence detection score and a plurality of confidence scaling factors. 18. The apparatus of claim 16 , wherein adjusting the primary confidence detection score if the secondary confidence detection score is less than the threshold comprises multiplying the adjusted primary confidence detection score by a product of the secondary confidence detection score and a plurality of confidence scaling factors. 19. The apparatus of claim 12 , wherein the processor is further configured to adjust the primary confidence detection score in proportion to an amount of overlap if the amount of overlap between a bounding box of the object and a semantic segmentation mask is less than a threshold. 20. The apparatus of claim 12 , wherein the processor is further configured to maintain the primary confidence detection score if an amount of overlap between a bounding box of the object and a semantic segmentation mask is greater than or equal to a threshold. 21. The apparatus of claim 12 , wherein the processor is further configured to operate the primary object detector and the classification network in parallel. 22. The apparatus of claim 12 , wherein the processor is further configured to: set an object detection to a fixed size using the scaled primary confidence detection score, determine random crops with the fixed size from the scaled primary confidence detection score, and label a class of the random crops with respect to ground truth to train the classification network.

Assignees

Inventors

Classifications

  • Design verification or optimisation, e.g. using design rule check [DRC], layout versus schematics [LVS] or finite element methods [FEM] (optical proximity correction [OPC] design processes G03F1/36) · CPC title

  • Still image; Photographic image · CPC title

  • Human being; Person · CPC title

  • Artificial neural networks [ANN] · CPC title

  • Region-based segmentation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10657364B2 cover?
A method and apparatus are provided. The method includes receiving an image, detecting an object in the image, determining, by a primary object detector, a primary confidence detection score of the object, determining, by a classification network, a confidence scaling factor of the object, and adjusting the primary confidence detection score based on multiplying the primary confidence detection…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 19 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).