Method and device for processing multi-channel feature map images

US10733767B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10733767-B2
Application numberUS-201815967039-A
CountryUS
Kind codeB2
Filing dateApr 30, 2018
Priority dateMay 31, 2017
Publication dateAug 4, 2020
Grant dateAug 4, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A convolutional neural network-based image processing method is provided. The method includes: receiving, in a second layer, multi-channel feature map images generated by applying a convolution operation to an input image of a convolutional neural network having a plurality of layers with a plurality of filter kernels of a first layer; analyzing a dynamic range of the multi-channel feature map images; re-ordering the multi-channel feature map images, based on the dynamic range; and processing the re-ordered multi-channel feature map images in the second layer.

First claim

Opening claim text (preview).

What is claimed is: 1. A convolutional neural network-based image processing method comprising: receiving, in a second layer, multi-channel feature map images generated by applying a convolution operation to an input image of a convolutional neural network having a plurality of layers with a plurality of filter kernels of a first layer; analyzing a dynamic range of the multi-channel feature map images; re-ordering the multi-channel feature map images, based on the dynamic range; and processing the re-ordered multi-channel feature map images in the second layer. 2. The convolutional neural network-based image processing method of claim 1 , wherein the analyzing comprises obtaining the dynamic range for each of the multi-channel feature map images, based on a maximum output value and a minimum output value of each of the plurality of filter kernels of the first layer. 3. The convolutional neural network-based image processing method of claim 1 , wherein the re-ordering comprises aligning the multi-channel feature map images in a descending order or an ascending order in a channel direction, based on the dynamic range. 4. The convolutional neural network-based image processing method of claim 1 , wherein the processing comprises performing inter-channel inter-prediction on the re-ordered multi-channel feature map images. 5. The convolutional neural network-based image processing method of claim 1 , wherein the processing comprises performing encoding and decoding operations on the re-ordered multi-channel feature map images, wherein the encoding and decoding operations in the second layer are performed according to a pipelined process with a convolution operation in a layer subsequent to the second layer among the plurality of layers. 6. The convolutional neural network-based image processing method of claim 1 , wherein the processing comprises determining a coding unit having a pixel depth in a channel direction of the multi-channel feature map images, the pixel depth corresponding to a number of channels in the channel direction in the multi-channel feature map images. 7. The convolutional neural network-based image processing method of claim 6 , wherein the coding unit corresponds to a basic unit of a filter kernel operation in a layer subsequent to the second layer among the plurality of layers. 8. The convolutional neural network-based image processing method of claim 6 , wherein a size of the coding unit is X×Y×Z, where X, Y, and Z are integers. 9. The convolutional neural network-based image processing method of claim 6 , wherein the processing further comprises: dividing the coding unit into sub-coding units in the channel direction; and performing encoding on each of the sub-coding units. 10. The convolutional neural network-based image processing method of claim 9 , wherein the performing of encoding on each of the sub-coding units comprises: encoding information on whether an encoding mode of a current sub-coding unit is equal to an encoding mode of a previous sub-coding unit; and encoding information on the encoding mode of the current sub-coding unit when the encoding mode of the current sub-coding unit is not equal to the encoding mode of the previous sub-coding unit. 11. A convolutional neural network-based image processing device comprises: a storage; and a controller configured to control the image processing device to receive, in a second layer, multi-channel feature map images generated by applying a convolution operation to an input image of a convolutional neural network having a plurality of layers with a plurality of filter kernels of a first layer, to analyze a dynamic range of the multi-channel feature map images, to re-order the multi-channel feature map images, based on the dynamic range, to process the re-ordered multi-channel feature map images in the second layer, and to store the processed re-ordered multi-channel feature map images in the storage. 12. The convolutional neural network-based image processing device of claim 11 , wherein the controller is further configured to control the image processing device to obtain the dynamic range for each of the multi-channel feature map images, based on a maximum output value and a minimum output value of each of the plurality of filter kernels of the first layer. 13. The convolutional neural network-based image processing device of claim 11 , wherein the controller is further configured to control the image processing device to align the multi-channel feature map images in a descending order or an ascending order in a channel direction, based on the dynamic range. 14. The convolutional neural network-based image processing device of claim 11 , wherein the controller is further configured to control the image processing device to perform inter-channel inter-prediction on the re-ordered multi-channel feature map images. 15. The convolutional neural network-based image processing device of claim 11 , wherein the controller is further configured to control the image processing device to perform encoding and decoding operations on the re-ordered multi-channel feature map images, wherein the controller is further configured to control the image processing device to perform the encoding and decoding operations in the second layer according to a pipelined process with a convolution operation in a layer subsequent to the second layer among the plurality of layers. 16. The convolutional neural network-based image processing device of claim 11 , wherein the controller is further configured to control the image processing device to determine a coding unit having a pixel depth in a channel direction of the multi-channel feature map images, the pixel depth corresponding to a number of channels in the channel direction in the multi-channel feature map images. 17. The convolutional neural network-based image processing device of claim 16 , wherein the coding unit corresponds to a basic unit of a filter kernel operation in a layer subsequent to the second layer among the plurality of layers. 18. The convolutional neural network-based image processing device of claim 16 , wherein the controller is further configured to control the image processing device to control the image processing device to divide the coding unit into sub-coding units in the channel direction, and to perform encoding on each of the sub-coding units. 19. The convolutional neural network-based image processing device of claim 18 , wherein the controller is configured to control the image processing device to encode information on whether an encoding mode of a current sub-coding unit is equal to an encoding mode of a previous sub-coding unit, and to encode information on the encoding mode of the current sub-coding unit when the encoding mode of the current sub-coding unit is not equal to the encoding mode of the previous sub-coding unit. 20. A non-transitory computer-readable recording medium having stored thereon program commands which, when executed, cause a convolution neural network-based image process device to perform: receiving, in a second layer, multi-channel feature map images generated by applying a convolution operation to an input image of a convolutional neural network having a plurality of layers with a plurality of filter kernels of a first layer; analyzing a dynamic range of the received multi-channel feature map images; re-ordering the received multi-channel feature map images based on the dynamic range; and processing the re-ordered multi-channel feature map images in

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • Combinations of networks · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10733767B2 cover?
A convolutional neural network-based image processing method is provided. The method includes: receiving, in a second layer, multi-channel feature map images generated by applying a convolution operation to an input image of a convolutional neural network having a plurality of layers with a plurality of filter kernels of a first layer; analyzing a dynamic range of the multi-channel feature map …
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T9/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 04 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).