Real-time low latency computer vision/machine learning compute accelerator with smart convolutional neural network scheduler

US11816871B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11816871-B2
Application numberUS-202017138817-A
CountryUS
Kind codeB2
Filing dateDec 30, 2020
Priority dateDec 30, 2020
Publication dateNov 14, 2023
Grant dateNov 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and devices are provided for processing image data on a sub-frame portion basis using layers of a convolutional neural network. The processing device comprises memory and a processor. The processor is configured to receive frames of image data comprising sub-frame portions, schedule a first sub-frame portion of a first frame to be processed by a first layer of the convolutional neural network when the first sub-frame portion is available for processing, process the first sub-frame portion by the first layer and continue the processing of the first sub-frame portion by the first layer when it is determined that there is sufficient image data available for the first layer to continue processing of the first sub-frame portion. Processing on a sub-frame portion basis continues for subsequent layers such that processing by a layer can begin as soon as sufficient data is available for the layer.

First claim

Opening claim text (preview).

What is claimed is: 1. An image processing method comprising: receiving a plurality of frames of image data, the plurality of frames comprising sub-frame portions of image data; scheduling a first sub-frame portion of a first frame to be processed by a first layer of a convolutional neural network when the first sub-frame portion is available for processing; processing the first sub-frame portion by the first layer; and continuing the processing of the first sub-frame portion by the first layer when it is determined that there is sufficient image data available for the first layer to continue processing of the first sub-frame portion. 2. The method of claim 1 , wherein the plurality of frames of image data are divided into a plurality of sub-frame portions, and the sub-frame portions are one of slices and tiles. 3. The method of claim 1 , wherein the first sub-frame portion comprises data for a plurality of pixels, and the first sub-frame portion is available for processing when data for each of the pixels of the first sub-frame portion is in memory. 4. The method of claim 1 , further comprising determining that there is sufficient image data available for the first layer to continue processing the first sub-frame portion when a second sub-frame portion is available for processing by the first layer. 5. The method of claim 4 , wherein the first layer of the convolutional neural network is a convolution layer, and the processing of the first sub-frame portion by the convolutional layer is continued by convolving the first sub-frame portion and the second sub-frame portion. 6. The method of claim 1 , wherein the convolutional neural network comprises a plurality of different layers, a sub-frame portion of the first frame is processed prior to any sub-frame portion of a second frame being processed, and when the sub-frame portion of the first frame and a sub-frame portion of the second frame are concurrently available for processing by any of the different layers, the sub-frame portion of the first frame is scheduled for processing prior to scheduling the sub-frame portion of the second frame for processing. 7. The method of claim 6 , wherein when the sub-frame portion of the second frame is available for processing by the first layer, but no sub-frame portions of the first frame are available for processing by the first layer, the sub-frame portion of the second frame is processed by the first layer, and after the processing of the sub-frame portion the second frame is completed and when an additional sub-frame portion of the first frame and an additional sub-frame portion of the second frame are concurrently available for processing, the additional sub-frame portion of the first frame is scheduled for processing by the first layer. 8. The method of claim 1 , wherein the convolutional neural network comprises a plurality of different layers which comprise a convolution layer, a max pooling layer and a rectified linear unit layer. 9. The method of claim 8 , further comprising: processing the plurality of frames of image data on a sub-frame portion basis using the plurality of different layers; and displaying the plurality of frames of image data. 10. An image processing device comprising: memory; and a processor configured to: receive a plurality of frames of image data, the plurality of frames comprising sub-frame portions of image data; schedule a first sub-frame portion of a first frame to be processed by a first layer of a convolutional neural network when the first sub-frame portion is available for processing; process the first sub-frame portion by the first layer; and continue the processing of the first sub-frame portion by the first layer when it is determined that there is sufficient image data available for the first layer to continue processing of the first sub-frame portion. 11. The image processing device of claim 10 , wherein the plurality of frames of image data are divided into a plurality of sub-frame portions, and the sub-frame portions are one of slices and tiles. 12. The image processing device of claim 10 , wherein, wherein the first sub-frame portion comprises data for a plurality of pixels, and the first sub-frame portion is available for processing when data for each of the pixels of the first sub-frame portion is in memory. 13. The image processing device of claim 10 , wherein the processor is configured to determine that there is sufficient image data available for the first layer to continue processing the first sub-frame portion when a second sub-frame portion is available for processing by the first layer. 14. The image processing device of claim 13 , wherein the first layer of the convolutional neural network is a convolution layer, and the processing of the first sub-frame portion by the convolutional layer is continued by convolving the first sub-frame portion and the second sub-frame portion. 15. The image processing device of claim 10 , wherein the convolutional neural network comprises a plurality of different layers, a sub-frame portion of the first frame is processed prior to any sub-frame portion of a second frame being processed, and when the sub-frame portion of the first frame and a sub-frame portion of the second frame are concurrently available for processing by any of the different layers, the processor is configured to schedule the sub-frame portion of the first frame for processing prior to scheduling the sub-frame portion of the second frame for processing. 16. The image processing device of claim 15 , wherein when the sub-frame portion of the second frame is available for processing, but no sub-frame portions of the first frame are available for processing, the processor is configured to schedule the sub-frame portion of the second frame for processing by one of the different layers, and after the processing of the sub-frame portion the second frame is completed and when an additional sub-frame portion of the first frame and an additional sub-frame portion of the second frame are concurrently available for processing, the processor is configured to schedule the additional sub-frame portion of the first frame for processing by one of the different layers. 17. The image processing device of claim 10 , wherein the convolutional neural network comprises a plurality of different layers which comprise a convolution layer, a max pooling layer and a rectified linear unit layer. 18. The image processing device of claim 17 , further comprising a display device, wherein the plurality of frames of image data are processed on a sub-frame portion basis using the plurality of different layers; and the plurality of frames of image data are displayed at the display device. 19. A non-transitory computer readable medium comprising instructions for causing a computer to execute a video encoding method comprising: receiving a plurality of frames of image data, the plurality of frames comprising sub-frame portions of image data; scheduling a first sub-frame portion of a first frame to be processed by a first layer of a convolutional neural network when the first sub-frame portion is available for processing; processing the first sub-frame portion by the first layer; and continuing the processing of the first sub-frame portion by the first layer when it is determined that there is sufficient image data available for the first layer to continue processing of the first sub-frame portion. 20. The non-transitory computer readable medium of claim 19 , wh

Assignees

Inventors

Classifications

  • G06T9/002Primary

    using neural networks · CPC title

  • Neural networks · CPC title

  • using neural networks · CPC title

  • G01F1/66Primary

    by measuring frequency, phase shift or propagation time of electromagnetic or other waves, e.g. using ultrasonic flowmeters · CPC title

  • of a piezoelectric element · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11816871B2 cover?
Methods and devices are provided for processing image data on a sub-frame portion basis using layers of a convolutional neural network. The processing device comprises memory and a processor. The processor is configured to receive frames of image data comprising sub-frame portions, schedule a first sub-frame portion of a first frame to be processed by a first layer of the convolutional neural n…
Who is the assignee on this patent?
Advanced Micro Devices Inc, Ati Technologies Ulc
What technology area does this patent fall under?
Primary CPC classification G06T9/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).