Feature-Domain Residual for Video Coding for Machines

US2021314573A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021314573-A1
Application numberUS-202117218967-A
CountryUS
Kind codeA1
Filing dateMar 31, 2021
Priority dateApr 7, 2020
Publication dateOct 7, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: decode encoded data to generate decoded data, the encoded data having a bitrate lower than that of original data, and extract features from the decoded data; decode encoded residual features to generate decoded residual features; and generate enhanced decoded features as a result of combining the decoded residual features with the features extracted from the decoded data.

First claim

Opening claim text (preview).

1 . An apparatus comprising: at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: encode original data with a first codec to generate encoded data with a bitrate lower than that of the original data, and decoded data; encode the original data with at least one second learned codec to generate encoded residual features and decoded residual features; and generate enhanced decoded features as a result of combining the decoded residual features with features extracted from the decoded data generated with the first codec. 2 . The apparatus of claim 1 , wherein at least one machine processes or analyzes the decoded data using the enhanced decoded features. 3 . The apparatus of claim 1 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: generate enhanced decoded video resulting from combining the decoded data with the enhanced decoded features; wherein at least one machine processes or analyzes the decoded data using the enhanced decoded video. 4 . The apparatus of claim 3 , wherein the enhanced decoded video is generated using a neural network. 5 . The apparatus of claim 1 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: rather than generating the enhanced decoded features, generate enhanced decoded video resulting from combining the decoded data with the decoded residual features; wherein at least one machine processes or analyzes the decoded data using the enhanced decoded video. 6 . The apparatus of claim 5 , wherein the enhanced decoded video is generated using a neural network. 7 . The apparatus of claim 1 , wherein the residual features are encoded using at least one neural network, and the residual features are decoded using at least one neural network. 8 . The apparatus of claim 1 , wherein the features extracted from the decoded data generated with the first codec are extracted using a neural network. 9 . The apparatus of claim 1 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: extract features from the original data; extract features from the decoded data; and generate the residual features, prior to being encoded, as a result of computing a difference between the features extracted from the decoded data and the features extracted from the original data. 10 .- 20 . (canceled) 21 . An apparatus comprising: at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: decode encoded data to generate decoded data, the encoded data having a bitrate lower than that of original data, and extract features from the decoded data; decode encoded residual features to generate decoded residual features; and generate enhanced decoded features as a result of combining the decoded residual features with the features extracted from the decoded data. 22 . The apparatus of claim 21 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: process or analyze the enhanced decoded features using at least one task neural network. 23 . The apparatus of claim 21 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: generate enhanced decoded video as a result of combining the decoded data with the enhanced decoded features; wherein the combining of the decoded data with the enhanced decoded features to generate the enhanced decoded video is performed using a neural network; and process or analyze the enhanced decoded video using at least one task neural network. 24 . The apparatus of claim 21 , wherein the at least one memory and the computer program code are further configured to, with the at least one processor, cause the apparatus at least to: generate enhanced decoded video as a result of combining the decoded data with the decoded residual features; wherein the combining of the decoded data with the decoded residual features to generate the enhanced decoded video is performed using a neural network; and process or analyze the enhanced decoded video using at least one task neural network. 25 . The apparatus of claim 21 , wherein the features are extracted from the decoded data using a neural network; and the encoded residual features are decoded using a neural network. 26 . The apparatus of claim 21 , wherein the combining of the decoded residual features with the features extracted from the decoded data to generate the enhanced decoded features is a summation of the decoded residual features and the features extracted from the decoded data. 27 . The apparatus of claim 21 , wherein the encoded residual features are a difference between features extracted from the original data, and features extracted from preliminary decoded data or the features extracted from the decoded data. 28 . The apparatus of claim 21 , wherein the decoded residual features are decoded using entropy decoding and dequantization. 29 . The apparatus of claim 21 , wherein the decoded residual features are decoded using an image of a video decoder, the decoding of the residual features comprising converting decoded feature map images to the decoded residual features. 30 . The apparatus of claim 21 , wherein the original data is video data. 31 . A method comprising: decoding encoded data to generate decoded data, the encoded data having a bitrate lower than that of original data, and extracting features from the decoded data; decoding encoded residual features to generate decoded residual features; and generating enhanced decoded features as a result of combining the decoded residual features with the features extracted from the decoded data. 32 . (canceled)

Assignees

Inventors

Classifications

  • using hierarchical techniques, e.g. scalability (H04N19/63 takes precedence) · CPC title

  • H04N19/50Primary

    using predictive coding (H04N19/61 takes precedence) · CPC title

  • using neural networks · CPC title

  • using parallelised computational arrangements · CPC title

  • H04N19/146Primary

    Data rate or code amount at the encoder output · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021314573A1 cover?
An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: decode encoded data to generate decoded data, the encoded data having a bitrate lower than that of original data, and extract featur…
Who is the assignee on this patent?
Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification H04N19/50. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Oct 07 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).