Method and apparatus for performing artificial intelligence encoding and artificial intelligence decoding

US11270469B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11270469-B2
Application numberUS-202117333845-A
CountryUS
Kind codeB2
Filing dateMay 28, 2021
Priority dateJun 11, 2020
Publication dateMar 8, 2022
Grant dateMar 8, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for performing artificial intelligence (AI) encoding on an image includes: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory to: determine a resolution of an original image; when the resolution of the original image is higher than a predetermined value, obtain a first image by performing AI downscaling on the original image via a downscaling deep neural network (DNN); when the resolution of the original image is lower than or equal to the predetermined value, obtain a first image by performing AI one-to-one preprocessing on the original image via a one-to-one preprocessing DNN for upscaling; generate image data by performing first encoding on the first image; and transmit the image data and AI data including information related to the AI downscaling or information related to the AI one-to-one preprocessing.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for performing artificial intelligence (AI) encoding on an image, the apparatus comprising: a memory storing one or more instructions; and at least one processor configured to execute the one or more instructions stored in the memory to: determine a resolution of an original image; based on the resolution of the original image being greater than a predetermined value, obtain a first image by performing AI downscaling on the original image via a downscaling deep neural network (DNN); based on the resolution of the original image being less than or being equal to the predetermined value, obtain the first image by performing AI one-to-one preprocessing on the original image via a one-to-one preprocessing DNN for upscaling; generate image data by performing first encoding on the first image; and transmit the image data and AI data including information related to the AI downscaling or information related to the AI one-to-one preprocessing, wherein the AI data comprises information for selecting DNN setting information of an upscaling DNN for AI upscaling of a second image that is generated by performing first decoding on the image data, wherein DNN setting information of the downscaling DNN is obtained via first joint training of the downscaling DNN and the upscaling DNN, and wherein DNN setting information of the one-to-one preprocessing DNN is obtained via second joint training of the one-to-one preprocessing DNN and the upscaling DNN by using the DNN setting information of the upscaling DNN obtained via the first joint training. 2. The apparatus of claim 1 , wherein the one-to-one preprocessing DNN is configured to enhance features of the original image while maintaining a same resolution as that of the original image. 3. The apparatus of claim 1 , wherein the downscaling DNN and the upscaling DNN are jointly trained based on first loss information corresponding to a result of comparing a first training image output from the upscaling DNN with an original training image that has not undergone the AI downscaling. 4. The apparatus of claim 3 , wherein the downscaling DNN is trained by further taking into account at least one of second loss information, which corresponds to a result of comparing a reduced training image corresponding to the original training image with a second training image output from the downscaling DNN, or third loss information corresponding to a degree of spatial complexity of the second training image. 5. The apparatus of claim 1 , wherein the one-to-one preprocessing DNN is trained by taking into account fourth loss information corresponding to a result of comparing the original training image with a fourth training image obtained via the one-to-one preprocessing DNN and the upscaling DNN by taking, as an input, a reduced training image corresponding to the original training image. 6. The apparatus of claim 1 , wherein the predetermined value is a 2K resolution. 7. The apparatus of claim 1 , wherein the predetermined value is a 4K resolution. 8. The apparatus of claim 1 , wherein, based on a parameter of one of the upscaling DNN and the downscaling DNN being updated during the first joint training, a corresponding parameter of the other one of the upscaling DNN and the downscaling DNN is updated accordingly. 9. The apparatus of claim 8 , wherein, based on a parameter of the upscaling DNN being updated during the first joint training, a corresponding parameter of the one-to-one preprocessing DNN is updated accordingly during the second joint training. 10. A method of performing artificial intelligence (AI) encoding on an image, the method comprising: determining a resolution of an original image; determining, based on the resolution of the original image, whether to obtain a first image by performing AI one-to-one processing on the original image via a one-to-one preprocessing deep neural network (DNN) for upscaling, or by performing AI downscaling on the original image via a downscaling DNN; based on the resolution of the original image being less than or being equal to a predetermined value, obtaining the first image by performing the AI one-to-one preprocessing on the original image via the one-to-one preprocessing DNN for upscaling; based on the resolution of the original image being greater than the predetermined value, obtaining the first image by performing the AI downscaling on the original image via the downscaling DNN; generating image data by performing first encoding on the first image; and transmitting the image data and AI data including information related to the AI one-to-one preprocessing or information related to the AI downscaling, wherein the AI data comprises information for selecting DNN setting information of an upscaling DNN for AI upscaling of a second image that is generated by performing first decoding on the image data, wherein DNN setting information of the downscaling DNN is obtained via first joint training of the downscaling DNN and the upscaling DNN, and wherein DNN setting information of the one-to-one preprocessing DNN is obtained via second joint training of the one-to-one preprocessing DNN and the upscaling DNN by using the DNN setting information of the upscaling DNN obtained via the first joint training. 11. The method of claim 10 , wherein the one-to-one preprocessing DNN is configured to enhance features of the original image while maintaining a same resolution as that of the original image. 12. The method of claim 10 , wherein the downscaling DNN and the upscaling DNN are jointly trained based on first loss information corresponding to a result of comparing a first training image output from the upscaling DNN with an original training image that has not undergone the AI downscaling. 13. The method of claim 12 , wherein the downscaling DNN is trained by further taking into account at least one of second loss information, which corresponds to a result of comparing a reduced training image corresponding to the original training image with a second training image output from the downscaling DNN, or third loss information corresponding to a degree of spatial complexity of the second training image. 14. The method of claim 10 , wherein the one-to-one preprocessing DNN is trained by taking into account fourth loss information corresponding to a result of comparing the original training image with a fourth training image obtained via the one-to-one preprocessing DNN and the upscaling DNN by taking, as an input, a reduced training image corresponding to the original training image. 15. The method of claim 10 , wherein the predetermined value is a 2K resolution or a 4K resolution. 16. A non-transitory computer-readable recording medium having recorded thereon a program for performing the method of claim 10 .

Assignees

Inventors

Classifications

  • Activation functions · CPC title

  • Combinations of networks · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11270469B2 cover?
An apparatus for performing artificial intelligence (AI) encoding on an image includes: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory to: determine a resolution of an original image; when the resolution of the original image is higher than a predetermined value, obtain a first image by performing AI downscaling…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T9/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).