Training method, image encoding method, image decoding method and apparatuses thereof

US11330264B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11330264-B2
Application numberUS-202117182433-A
CountryUS
Kind codeB2
Filing dateFeb 23, 2021
Priority dateMar 23, 2020
Publication dateMay 10, 2022
Grant dateMay 10, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream.

First claim

Opening claim text (preview).

What is claimed is: 1. A training device for an image processing apparatus, in which an image encoder and an image decoder are trained by using a training image, the training device comprises: a memory to store a plurality of instructions; and a processor coupled to the memory and configured to: acquire a latent variable obtained by the image encoder by encoding input training image data; acquire first restored image data obtained by the image decoder by decoding the latent variable and second restored image data obtained by the image decoder by decoding a sum of the latent variable and a noise; and train the image encoder and the image decoder according to a cost function, the cost function being related to a deviation between the input training image data and the first restored image data and a deviation between the first restored image data and the second restored image data. 2. An image encoding apparatus, comprising: an image encoder configured to encode input image data to obtain a latent variable, the image encoder encoding the input image data according to training by the training device as claimed in claim 1 ; a quantizer configured to perform quantizing processing on the latent variable according to a quantization operation to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream. 3. The image encoding apparatus according to claim 2 , wherein the image encoding apparatus further comprises: a quantization adjuster configured to adjust the quantization operation to adjust a bit rate of the bit stream. 4. The image encoding apparatus according to claim 2 , wherein, the quantizing processing of the quantizer is non-uniform quantizing processing. 5. The image encoding apparatus according to claim 4 , wherein, the non-uniform quantizing processing comprises: taking a latent variable to which a probability distribution peak value of the latent variable corresponds as a zero point, a latent variable of a first range containing the zero point corresponding to a first quantized latent variable; and for other quantized latent variables than the first quantized latent variables, the other quantized latent variables corresponding to latent variables of a second range, the second range being less than the first range. 6. The image encoding apparatus according to claim 5 , wherein, the probability distribution peak value of the latent variable is obtained based on the entropy model. 7. An image decoding apparatus, comprising: an entropy decoder configured to perform entropy decoding on a bit stream by using an entropy model to form a quantized latent variable; a de-quantizer configured to perform de-quantizing processing on the quantized latent variable according to a quantization operation to generate a reconstructed latent variable; and an image decoder configured to perform decoding processing on the reconstructed latent variable to obtain restored image data, the image decoder performing the decoding processing according to training by the training device as claimed in claim 1 . 8. The image decoding apparatus according to claim 7 , wherein, the de-quantizer performs the de-quantizing processing according to the quantization operation. 9. The image decoding apparatus according to claim 7 , wherein the image decoding apparatus further comprises: a quantization adjuster configured to adjust the quantization operation.

Assignees

Inventors

Classifications

  • Data rate or code amount at the encoder output · CPC title

  • H04N19/124Primary

    Quantisation · CPC title

  • the unit being bits, e.g. of the compressed video stream · CPC title

  • involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements {(video transcoding H04N19/40; media packet handling at the source H04L65/762)} · CPC title

  • involving reformatting operations of video signals for household redistribution, storage or real-time display {(details of conversion of video standards at pixel level H04N7/01; video transcoding H04N19/40; adapting incoming signals to the display format of the display terminal G09G5/005; media handling at the source in data packet switching networks H04L65/764)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11330264B2 cover?
Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized late…
Who is the assignee on this patent?
Fujitsu Ltd
What technology area does this patent fall under?
Primary CPC classification H04N19/124. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue May 10 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).