Accelerated non-maximum suppression in machine learning applications

US11989948B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11989948-B1
Application numberUS-202117500167-A
CountryUS
Kind codeB1
Filing dateOct 13, 2021
Priority dateOct 13, 2021
Publication dateMay 21, 2024
Grant dateMay 21, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Apparatuses, systems, and techniques to perform non-maximum suppression (NMS) with a bit-reduced radix sort to remove redundant bounding boxes are described. In at least one embodiment, one or more circuits perform i) a bit-reduced radix sort operation to sort a list of confidence scores associated with a set of bounding boxes corresponding to one or more objects within one or more digital images and ii) a non-maximum suppression (NMS) operation on the sorted list to remove one or more redundant bounding boxes from the set.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of performing a non-maximum suppression (NMS) algorithm, the method comprising: identifying a plurality of bounding boxes corresponding to one or more objects associated with one or more digital images, wherein each of the plurality of bounding boxes is associated with a confidence score in a first set of confidence scores; transforming the first set of confidence scores into a second set of confidence scores within a specified interval, wherein a first portion of each confidence score in the second set of confidence scores is the same; sorting the confidence scores in the second set of confidence scores in a descending order according to a remaining portion of each confidence score in the second set of confidence scores; and performing a first suppression operation on the second set of confidence scores to remove one or more redundant bounding boxes from the plurality of bounding boxes. 2. The method of claim 1 , wherein each of the second set of confidence scores comprises the first portion, a second portion, and a third portion, wherein the first portion of each of the second set of confidence scores is the same and the second portion of each of the second set of confidence scores is the same. 3. The method of claim 1 , wherein each of the first set of confidence scores is represented as a floating-point number according to the IEEE 754 format, wherein the floating-point number comprises a sign bit, a biased-exponent field, and a mantissa field, wherein each of the second set of confidence scores comprises same values for the sign bit and the biased-exponent field after the transforming. 4. The method of claim 3 , wherein sorting the confidence scores in the second set of confidence scores comprises executing an argsort function on only mantissa bits of the mantissa fields of the second set of confidence scores. 5. The method of claim 3 , wherein sorting the confidence scores in the second set of confidence scores comprises executing an argsort function on only a portion of mantissa bits of the mantissa fields of the second set of confidence scores. 6. The method of claim 3 , wherein each of the first set of confidence scores is in an interval between zero and one, wherein the specified interval is between one and two, which is clipped by a number of mantissa bits in the mantissa field. 7. The method of claim 1 , wherein each of the first set of confidence scores is in an interval between zero and one, wherein the specified interval is a half-open interval between one and two but excluding two. 8. The method of claim 1 , wherein the second set of confidence scores is part of a plurality of classes, and wherein the first suppression operation comprises selecting a specified number of bounding boxes having the highest confidence scores in the second set of confidence scores. 9. The method of claim 1 , wherein the first set of confidence scores is part of a first class of objects, and wherein the method further comprises: identifying a second plurality of bounding boxes corresponding to one or more objects associated with the one or more digital images, wherein each of the second plurality of bounding boxes is associated with a confidence score in a third set of confidence scores, wherein the third set of confidence scores is part of a second class of objects; transforming the third set of confidence scores into a fourth set of confidence scores within the specified interval, wherein a first portion of each confidence score in the fourth set of confidence scores is the same; sorting the confidence scores in the fourth set of confidence scores in a descending order according to a remaining portion of each confidence score in the fourth set of confidence scores; and performing a second suppression operation on the fourth set of confidence scores to remove one or more redundant bounding boxes from the second plurality of bounding boxes. 10. The method of claim 9 , further comprising: combining the remaining bounding boxes associated with the second set of confidence scores and the fourth set of confidence scores into a third plurality of bounding boxes, wherein each of the third plurality of bounding boxes is associated with a confidence score in a fifth set of confidence scores; sorting the confidence scores in the fifth set of confidence scores in a descending order according to a remaining portion of each confidence score in the fifth set of confidence scores; and performing a third suppression operation on the fourth set of confidence scores to remove one or more redundant bounding boxes from the second plurality of bounding boxes. 11. A system comprising: a memory device; and a processing unit coupled to the memory device, the processing unit to: identify a plurality of bounding boxes corresponding to one or more objects associated with one or more digital images, wherein each of the plurality of bounding boxes is associated with a confidence score in a first set of confidence scores; transform the first set of confidence scores into a second set of confidence scores within a specified interval, wherein a first portion of each confidence score in the second set of confidence scores is the same; sort the confidence scores of the second set of confidence scores in a descending order according to a remaining portion of each confidence score in the second set of confidence scores; and perform a first suppression operation on the second set of confidence scores in the descending order to remove one or more redundant bounding boxes from the plurality of bounding boxes. 12. The system of claim 11 , wherein each of the second set of confidence scores comprises the first portion, a second portion, and a third portion, wherein the first portion of each of the second set of confidence scores is the same and the second portion of each of the second set of confidence scores is the same. 13. The system of claim 11 , wherein each of the first set of confidence scores is represented as a floating-point number according to the IEEE 754 format, wherein the floating-point number comprises a sign bit, a biased-exponent field, and a mantissa field, wherein each of the second set of confidence scores comprises same values for the sign bit and the biased-exponent field after the transforming. 14. The system of claim 13 , wherein the processing unit is to sort the confidence scores in the second set of confidence scores by executing an argsort function on only mantissa bits of the mantissa fields of the second set of confidence scores. 15. The system of claim 13 , wherein the processing unit is to sort the confidence scores in the second set of confidence scores by executing an argsort function on only a portion of mantissa bits of the mantissa fields of the second set of confidence scores. 16. The system of claim 13 , wherein each of the first set of confidence scores is in an interval between zero and one, wherein the specified interval is between one and two, which is clipped by a number of mantissa bits in the mantissa field. 17. The system of claim 11 , wherein the second set of confidence scores is part of a plurality of classes, and wherein the first suppression operation comprises selecting a specified number of bounding boxes having the highest confidence scores in the second set of confidence scores. 18. The system of claim 11 , wherein the first set of confidence scores is part of a first class of objects, and wherein the processing unit is further to: identify a second plurality of bounding boxes corresponding to one or more

Assignees

Inventors

Classifications

  • G06V20/58Primary

    Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads · CPC title

  • Determination of region of interest [ROI] or a volume of interest [VOI] · CPC title

  • using classification, e.g. of video objects · CPC title

  • using neural networks · CPC title

  • Sorting, i.e. extracting data from one or more carriers, rearranging the data in numerical or other ordered sequence, and rerecording the sorted data on the original carrier or on a different carrier or set of carriers {sorting methods in general}(G06F7/36 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11989948B1 cover?
Apparatuses, systems, and techniques to perform non-maximum suppression (NMS) with a bit-reduced radix sort to remove redundant bounding boxes are described. In at least one embodiment, one or more circuits perform i) a bit-reduced radix sort operation to sort a list of confidence scores associated with a set of bounding boxes corresponding to one or more objects within one or more digital imag…
Who is the assignee on this patent?
Nvidia Corp
What technology area does this patent fall under?
Primary CPC classification G06V20/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).