Implicit image and video compression using machine learning systems

US2022385907A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022385907-A1
Application numberUS-202117645018-A
CountryUS
Kind codeA1
Filing dateDec 17, 2021
Priority dateMay 21, 2021
Publication dateDec 1, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are described for compressing and decompressing data using machine learning systems. An example process can include receiving a plurality of images for compression by a neural network compression system. The process can include determining, based on a first image from the plurality of images, a first plurality of weight values associated with a first model of the neural network compression system. The process can include generating a first bitstream comprising a compressed version of the first plurality of weight values. The process can include outputting the first bitstream for transmission to a receiver.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of processing media data, comprising: receiving a plurality of images for compression by a neural network compression system; determining, based on a first image from the plurality of images, a first plurality of weight values associated with a first model of the neural network compression system; generating a first bitstream comprising a compressed version of the first plurality of weight values; and outputting the first bitstream for transmission to a receiver. 2 . The method of claim 1 , wherein at least one layer of the first model includes a positional encoding of a plurality of coordinates associated with the first image. 3 . The method of claim 2 , wherein the first model is configured to determine one or more pixel values corresponding to the plurality of coordinates associated with the first image. 4 . The method of claim 1 , further comprising: determining, based on a second image from the plurality of images, a second plurality of weight values for use by a second model associated with the neural network compression system; generating a second bitstream comprising a compressed version of the second plurality of weight values; and outputting the second bitstream for transmission to a receiver. 5 . The method of claim 4 , wherein the second model is configured to determine an optical flow between the first image and the second image. 6 . The method of claim 5 , further comprising: determining, based on the optical flow, at least one updated weight value from the first plurality of weight values. 7 . The method of claim 1 , further comprising: quantizing the first plurality of weight values under a weight prior to yield a plurality of quantized weight values, wherein the first bitstream comprises a compressed version of the plurality of quantized weight values. 8 . The method of claim 7 , wherein the weight prior is selected to minimize a rate loss associated with sending the first bitstream to the receiver. 9 . The method of claim 7 , wherein generating the first bitstream comprises: entropy encoding the first plurality of weight values using the weight prior. 10 . The method of claim 7 , wherein the first plurality of weight values is quantized using fixed-point quantization. 11 . The method of claim 10 , wherein the fixed-point quantization is implemented using a machine learning algorithm. 12 . The method of claim 1 , further comprising: selecting, based on the first image, a model architecture corresponding to the first model. 13 . The method of claim 12 , further comprising: generating a second bitstream comprising a compressed version of the model architecture; and outputting the second bitstream for transmission to the receiver. 14 . The method of claim 12 , wherein selecting the model architecture comprises: tuning, based on the first image, a plurality of weight values associated with one or more model architectures, wherein each of the one or more model architectures is associated with one or more model characteristics; determining at least one distortion between the first image and reconstructed data output corresponding to each of the one or more model architectures; and selecting the model architecture from the one or more model architectures based on the at least one distortion. 15 . The method of claim 14 , wherein the one or more model characteristics include at least one of a width, a depth, a resolution, a size of a convolution kernel, and an input dimension. 16 . An apparatus comprising: at least one memory; and at least one processor coupled to the at least one memory and configured to: receive a plurality of images for compression by a neural network compression system; determine, based on a first image from the plurality of images, a first plurality of weight values associated with a first model of the neural network compression system; generate a first bitstream comprising a compressed version of the first plurality of weight values; and output the first bitstream for transmission to a receiver. 17 . The apparatus of claim 16 , wherein at least one layer of the first model includes a positional encoding of a plurality of coordinates associated with the first image. 18 . The apparatus of claim 17 , wherein the first model is configured to determine one or more pixel values corresponding to the plurality of coordinates associated with the first image. 19 . The apparatus of claim 16 , wherein the at least one processor is further configured to: determine, based on a second image from the plurality of images, a second plurality of weight values for use by a second model associated with the neural network compression system; generate a second bitstream comprising a compressed version of the second plurality of weight values; and output the second bitstream for transmission to a receiver. 20 . The apparatus of claim 19 , wherein the second model is configured to determine an optical flow between the first image and the second image. 21 . The apparatus of claim 20 , wherein the at least one processor is further configured to: determine, based on the optical flow, at least one updated weight value from the first plurality of weight values. 22 . The apparatus of claim 16 , wherein the at least one processor is further configured to: quantize the first plurality of weight values under a weight prior to yield a plurality of quantized weight values, wherein the first bitstream comprises a compressed version of the plurality of quantized weight values. 23 . The apparatus of claim 22 , wherein the weight prior is selected to minimize a rate loss associated with sending the first bitstream to the receiver. 24 . The apparatus of claim 22 , wherein to generate the first bitstream the at least one processor is further configured to: entropy encode the first plurality of weight values using the weight prior. 25 . The apparatus of claim 22 , wherein the first plurality of weight values are quantized using fixed-point quantization. 26 . The apparatus of claim 25 , wherein the fixed-point quantization is implemented using a machine learning algorithm. 27 . The apparatus of claim 16 , wherein the at least one processor is further configured to: select, based on the first image, a model architecture corresponding to the first model. 28 . The apparatus of claim 27 , wherein the at least one processor is further configured to: generate a second bitstream comprising a compressed version of the model architecture; and output the second bitstream for transmission to the receiver. 29 . The apparatus of claim 27 , wherein to select the model architecture the at least one processor is further configured to: tune, based on the first image, a plurality of weight values associated with one or more model architectures, wherein each of the one or more model architectures is associated with one or more model characteristics; determine at least one distortion between the first image and reconstructed data output corresponding to each of the one or more model architectures; and select the model architecture from the one or more model architectures based on the at least one distortion. 30 . The apparatus of claim 29 , wherein the one or more model characteristics include a

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Motion estimation other than block-based · CPC title

  • Learning methods · CPC title

  • Entropy coding, e.g. variable length coding [VLC] or arithmetic coding · CPC title

  • Data rate or code amount at the encoder output · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022385907A1 cover?
Techniques are described for compressing and decompressing data using machine learning systems. An example process can include receiving a plurality of images for compression by a neural network compression system. The process can include determining, based on a first image from the plurality of images, a first plurality of weight values associated with a first model of the neural network compr…
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/124. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Dec 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).