Method for learned image compression and related autoencoder

US12587664B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12587664-B2
Application numberUS-202418758390-A
CountryUS
Kind codeB2
Filing dateJun 28, 2024
Priority dateSep 11, 2023
Publication dateMar 24, 2026
Grant dateMar 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for learned image compression implemented in an autoencoder includes: a) extracting from an image a latent space by the learnable encoder; b) quantizing the latent space by a quantizer to obtain a quantized latent space; c) entropy coding the quantized latent space by an entropy encoder to obtain a bitstream, wherein an entropy model used to encode the latent space is represented by a probability distribution; d) entropy decoding the bitstream by an entropy decoder to obtain an entropy decoded bitstream; e) feeding the entropy decoded bitstream to the decoder; f) recover a reconstructed image by the decoder; g) training the autoencoder via standard gradient descent of the backpropagated error gradient by finding learnable parameters of the learnable encoder and of the decoder that minimize a rate distortion cost function, wherein the entropy encoder is based on a differentiable formulation of a soft frequency counter.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A method for learned image compression implemented in an autoencoder comprising a learnable encoder (f a ) and a decoder (f s ), said method comprising the steps of: a) extracting from an image (x) a latent space (y) by means of said learnable encoder (f a ); b) quantizing said latent space (y) by means of a quantizer (U|Q) to obtain a quantized latent space (ŷ); c) entropy coding said quantized latent space (ŷ) by means of an entropy encoder to obtain a bitstream, wherein an entropy model used to encode said latent space (y) is represented by a probability distribution p ŷ ; d) entropy decoding said bitstream by means of an entropy decoder to obtain an entropy decoded bitstream; e) feeding said entropy decoded bitstream to said decoder (f s ); f) recover a reconstructed image ({circumflex over (x)}) by means of said decoder (f s ); g) training said autoencoder via standard gradient descent of the backpropagated error gradient by finding learnable parameters (θ f ,θ g ) of said learnable encoder (f a ) and of said decoder (f s ) that minimize a rate distortion cost function L, wherein said entropy encoder is based on a differentiable formulation of a soft frequency counter (SFC). 2 . The method according to claim 1 , wherein said latent space (y) comprises a number N c of latent space channels having a dimension N d , and wherein, given a j-th channel of said latent space (y) and a quantization level l i j of said j-th channel, the soft frequency counter (SFC) associates every value of said latent space y n j to a weight inversely proportional to the distance with l i j , where n varies within the same channel and ranges from 1 to N d . 3 . The method according to claim 2 , wherein said soft frequency counter (SFC) relies on a scalar function φ i j and wherein a first order entropy H {tilde over (p)} of a probability distribution {acute over (p)} j for every single channel of said latent space is: H p ~ = - 1 N c ⁢ ∑ j = 1 N c H p ~ j = - 1 N c ⁢ ∑ j = 1 N c ∑ i = 1 L SFC ⁡ ( l i j ) ⁢ log 2 [ SFC ⁡ ( l i j ) ] where SFC ⁡ ( l i j ) = ∑ n = 1 N d ϕ i j ( y n j ) ∑ m = 1 L ∑ n = 1 N d ϕ m j ( y n j ) and ϕ i j

Assignees

Inventors

Classifications

  • G06N3/0455Primary

    Auto-encoder networks; Encoder-decoder networks · CPC title

  • Entropy coding, e.g. variable length coding [VLC] or arithmetic coding · CPC title

  • Quantisation · CPC title

  • according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12587664B2 cover?
A method for learned image compression implemented in an autoencoder includes: a) extracting from an image a latent space by the learnable encoder; b) quantizing the latent space by a quantizer to obtain a quantized latent space; c) entropy coding the quantized latent space by an entropy encoder to obtain a bitstream, wherein an entropy model used to encode the latent space is represented by a …
Who is the assignee on this patent?
Sisvel Tech S R L, Univ Degli Studi Di Torino, Inst Mines Telecom
What technology area does this patent fall under?
Primary CPC classification G06N3/0455. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).