Method and apparatus for performing artificial intelligence encoding and artificial intelligence decoding

US11816872B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11816872-B2
Application numberUS-202117503420-A
CountryUS
Kind codeB2
Filing dateOct 18, 2021
Priority dateJun 11, 2020
Publication dateNov 14, 2023
Grant dateNov 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for performing artificial intelligence (AI) encoding on an image includes: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory to: determine a resolution of an original image; when the resolution of the original image is higher than a predetermined value, obtain a first image by performing AI downscaling on the original image via a downscaling deep neural network (DNN); when the resolution of the original image is lower than or equal to the predetermined value, obtain a first image by performing AI one-to-one preprocessing on the original image via a one-to-one preprocessing DNN for upscaling; generate image data by performing first encoding on the first image; and transmit the image data and AI data including information related to the AI downscaling or information related to the AI one-to-one preprocessing.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for performing artificial intelligence (AI) decoding on an image, the apparatus comprising: a memory storing one or more instructions; and at least one processor configured to execute the one or more instructions stored in the memory to: obtain AI data and image data, the image data generated as a result of first encoding of a first image, and the AI data related to AI downscaling of an original image to the first image or AI one-to-one preprocessing of the original image to the first image; obtain a second image corresponding to the first image by performing first decoding on the image data; obtain, based on the AI data, deep neural network (DNN) setting information for AI upscaling of the second image, from among a plurality of pieces of DNN setting information; and generate a third image by performing the AI upscaling on the second image via an upscaling DNN operating based on the obtained DNN setting information, wherein the plurality of pieces of DNN setting information are obtained via: first joint training of the upscaling DNN and a downscaling DNN used for the AI downscaling of the original image, and second joint training of a one-to-one preprocessing DNN used for the AI one-to-one preprocessing of the original image and the upscaling DNN, the second joint training being performed using DNN setting information for the AI upscaling obtained as a result of the first joint training. 2. The apparatus of claim 1 , wherein the one-to-one preprocessing DNN is configured to enhance features of the original image while maintaining a same resolution as that of the original image. 3. A method for performing artificial intelligence (AI) decoding on an image, the method comprising: obtaining AI data and image data, the image data generated as a result of first encoding of a first image, and the AI data related to AI downscaling of an original image to the first image or AI one-to-one preprocessing of the original image to the first image; obtaining a second image corresponding to the first image by performing first decoding on the image data; obtaining, based on the AI data, deep neural network (DNN) setting information for AI upscaling of the second image, from among a plurality of pieces of DNN setting information; and generating a third image by performing the AI upscaling on the second image via an upscaling DNN operating based on the obtained DNN setting information, wherein the plurality of pieces of DNN setting information are obtained via: first joint training of the upscaling DNN and a downscaling DNN used for the AI downscaling of the original image, and second joint training of a one-to-one preprocessing DNN used for the AI one-to-one preprocessing of the original image and the upscaling DNN, the second joint training being performed using DNN setting information for the AI upscaling obtained as a result of the first joint training. 4. The method of claim 3 , wherein the one-to-one preprocessing DNN is configured to enhance features of the original image while maintaining a same resolution as that of the original image. 5. A non-transitory computer-readable recording medium having recorded thereon a program for performing the method of claim 3 .

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06T9/002Primary

    using neural networks · CPC title

  • Learning methods · CPC title

  • using neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11816872B2 cover?
An apparatus for performing artificial intelligence (AI) encoding on an image includes: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory to: determine a resolution of an original image; when the resolution of the original image is higher than a predetermined value, obtain a first image by performing AI downscaling…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T9/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).