Apparatus and method for performing scalable video decoding

US11610283B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11610283-B2
Application numberUS-202016831805-A
CountryUS
Kind codeB2
Filing dateMar 27, 2020
Priority dateMar 28, 2019
Publication dateMar 21, 2023
Grant dateMar 21, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided are a method and an apparatus for performing scalable video decoding, wherein the method and the apparatus down-sample input video, determine the down-sampled input video as base layer video, generate prediction video for enhancement layer video by applying an up-scaling filter to the base layer video, and code the base layer video and the prediction video, wherein the up-scaling filter is a convolution filter of a deep neural network.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of performing scalable video decoding, the method comprising: generating base layer video by down-sampling input video; generating prediction video for enhancement layer video by selectively applying a fixed up-scaling filter, which has fixed filter coefficient values, and a convolution filter of a deep neural network to the base layer video; and coding the base layer video and the prediction video, wherein the generating of the prediction video for the enhancement layer video comprises generating the prediction video for the enhancement layer video by applying one convolution filter corresponding to a compression and distortion degree of the generated base layer video to the base layer video among a plurality of convolution filters of a plurality of deep neural networks which are pre-trained according to a plurality of compression and distortion degrees. 2. The method of claim 1 , wherein the generating of the prediction video comprises generating the prediction video for the enhancement layer video by applying a bi-cubic interpolation to a chrominance component of the base layer video and applying the convolution filter of the deep neural network to a luminance component of the base layer video. 3. The method of claim 1 , wherein the deep neural network is a deep neural network trained on the basis of a difference between video scaled up from low-resolution luminance input video and high-resolution original video. 4. The method of claim 1 , wherein the deep neural network comprises a plurality of residual blocks in which two convolution layers and two activation functions are alternately connected. 5. The method of claim 4 , wherein the activation functions comprise leaky rectified linear units (LReLUs). 6. The method of claim 1 , wherein the deep neural network comprises a pixel shuffle layer. 7. A non-transitory computer-readable recording medium recording thereon a program for executing the method of claim 1 in a computer. 8. An apparatus for performing scalable video decoding, the apparatus comprising: a controller configured to generate base layer video by down-sampling input video, generate prediction video for enhancement layer video by selectively applying a fixed up-scaling filter, which has fixed filter coefficient values, and a convolution filter of a deep neural network to the base layer video, and code the base layer video and the prediction video, wherein the controller is configured to generate the prediction video for the enhancement layer video by applying one convolution filter corresponding to a compression and distortion degree of the generated base layer video to the base layer video among a plurality of convolution filters of a plurality of deep neural networks which are pre-trained according to a plurality of compression and distortion degrees. 9. The apparatus of claim 8 , wherein the controller generates the prediction video for the enhancement layer video by applying a bi-cubic interpolation to a chrominance component of the base layer video and applying the convolution filter of the deep neural network to a luminance component of the base layer video. 10. The apparatus of claim 8 , wherein the deep neural network is a deep neural network trained on the basis of a difference between video scaled up from low-resolution luminance input video and high-resolution original video. 11. The apparatus of claim 8 , wherein the deep neural network comprises a plurality of residual blocks in which two convolution layers and two activation functions are alternately connected. 12. The apparatus of claim 11 , wherein the activation functions comprise leaky rectified linear units (LReLUs). 13. The apparatus of claim 8 , wherein the deep neural network comprises a pixel shuffle layer.

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution · CPC title

  • H04N19/33Primary

    in the spatial domain · CPC title

  • Learning methods · CPC title

  • the unit being a colour or a chrominance component · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11610283B2 cover?
Provided are a method and an apparatus for performing scalable video decoding, wherein the method and the apparatus down-sample input video, determine the down-sampled input video as base layer video, generate prediction video for enhancement layer video by applying an up-scaling filter to the base layer video, and code the base layer video and the prediction video, wherein the up-scaling filte…
Who is the assignee on this patent?
Agency Defense Dev
What technology area does this patent fall under?
Primary CPC classification H04N19/33. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 21 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).