System, method, and computer program for content adaptive online training for multiple blocks in neural image compression

US12347149B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12347149-B2
Application numberUS-202217950569-A
CountryUS
Kind codeB2
Filing dateSep 22, 2022
Priority dateDec 13, 2021
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Content-adaptive online training for end-to-end (E2E) neural image compression (NIC) using a neural network performed by at least one processor, is provided, including receiving an input image, to an E2E NIC framework, including one or more blocks, preprocessing a first neural network of the E2E NIC framework, based on the one or more blocks, computing updated parameters using the preprocessed first neural network, encoding the one or more blocks and the updated parameters, updating the first neural network based on the encoded updated parameters, and generating a compressed representation of the encoded one or more blocks using the updated first neural network.

First claim

Opening claim text (preview).

What is claims is: 1. A method of content-adaptive online training for end-to-end (E2E) neural image compression (NIC) using a neural network performed by at least one processor, the method comprising: receiving, by an E2E NIC framework, an input image including one or more blocks; preprocessing a first neural network of the E2E NIC framework, based on the one or more blocks; computing updated parameters using the preprocessed first neural network, wherein the updated parameters include a learning rate and a number of steps, and wherein the learning rate and the number of steps are selected based on characteristics of the input image; encoding the one or more blocks and the updated parameters; updating the first neural network based on the encoded updated parameters; and generating a compressed representation of the encoded one or more blocks using the updated first neural network. 2. The method according to claim 1 , further comprising: splitting the input image into the one or more blocks; and compressing the one or more blocks individually. 3. The method according to claim 1 , further comprising; decoding the compressed representation using arithmetic decoding; and generating a reconstructed image based on the decoded compressed representation using a second neural network. 4. The method according to claim 1 , further comprising compressing the updated parameters. 5. The method according to claim 1 , wherein the characteristics of the input image are one of a RGB variance of the input image and an RD performance of the input image. 6. The method according to claim 1 , wherein when preprocessing the first neural network, the first neural network is fine-tuned using the one or more blocks. 7. An apparatus for content-adaptive online training for end-to-end (E2E) neural image compression (NIC) using a neural network, the apparatus comprising: at least one memory configured to store computer program code; and at least one processor configured to read the computer program code and operate as instructed by the computer program code, the computer program code including: receiving code configured to cause the at least one processor to receive, by an E2E NIC framework, an input including one or more blocks; preprocessing code configured to cause the at least one processor to preprocess a first neural network of the E2E NIC framework, based on the one or more blocks; computing code configured to cause the at least one processor to compute updated parameters using the preprocessed first neural network, wherein the updated parameters include a learning rate and a number of steps, and wherein the learning rate and the number of steps are selected based on characteristics of the input image; encoding code configured to cause the at least one processor to encode the one or more blocks and the updated parameters; updating code configured to cause the at least one processor to update the first neural network based on the encoded updated parameters; and first generating code configured to cause the at least one processor to generate a compressed representation of the encoded one or more blocks using the updated first neural network. 8. The apparatus of claim 7 , the computer program code further including: splitting code configured to cause the at least one processor to split the input image into the one or more blocks; and compressing code configured to cause the at least one processor to compress the one or more blocks individually. 9. The apparatus of claim 7 , the computer program code further including: decoding code configured to cause the at least one processor to decode the compressed representation using arithmetic decoding; and second generating code configured to cause the at least one processor to generate a reconstructed image based on the decoded compressed representation using a second neural network. 10. The apparatus of claim 7 , the computer program code further including compressing code configured to cause the at least one processor to compress the updated parameters. 11. The apparatus of claim 7 , wherein the characteristics of the input image are one of a RGB variance of the input image and an RD performance of the input image. 12. The apparatus of claim 7 , wherein when preprocessing the first neural network, the first neural network is fine-tuned using the one or more blocks. 13. A non-transitory computer-readable medium storing instructions that, when executed by at least one processor of an apparatus for content-adaptive online training for end-to-end (E2E) neural image compression (NIC) using a neural network, cause the at least one processor to: receive, by an E2E NIC framework, an input image including one or more blocks; preprocess a first neural network of the E2E NIC framework, based on the one or more blocks; compute updated parameters using the preprocessed first neural network; encode the one or more blocks and the updated parameters, wherein the updated parameters include a learning rate and a number of steps, and wherein the learning rate and the number of steps are selected based on characteristics of the input image; update the first neural network based on the encoded updated parameters; and generate a compressed representation of the encoded one or more blocks using the updated first neural network. 14. The non-transitory computer-readable medium of claim 13 , wherein the instructions further cause the at least one processor to: split the input image into the one or more blocks; and compress the one or more blocks individually. 15. The non-transitory computer-readable medium of claim 13 , wherein the instructions further cause the at least one processor to: decode the compressed representation using arithmetic decoding; and generate a reconstructed image based on the decoded compressed representation using a second neural network. 16. The non-transitory computer-readable medium of claim 13 , wherein the instructions further cause the at least one processor to compress the updated parameters. 17. The non-transitory computer-readable medium of claim 13 , wherein the characteristics of the input image are one of a RGB variance of the input image and an RD performance of the input image. 18. The non-transitory computer-readable medium of claim 13 , wherein when preprocessing the first neural network, the first neural network is fine-tuned using the one or more blocks.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • using neural networks · CPC title

  • Artificial neural networks [ANN] · CPC title

  • G06T9/002Primary

    using neural networks · CPC title

  • the region being a block, e.g. a macroblock · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12347149B2 cover?
Content-adaptive online training for end-to-end (E2E) neural image compression (NIC) using a neural network performed by at least one processor, is provided, including receiving an input image, to an E2E NIC framework, including one or more blocks, preprocessing a first neural network of the E2E NIC framework, based on the one or more blocks, computing updated parameters using the preprocessed …
Who is the assignee on this patent?
Tencent America LLC
What technology area does this patent fall under?
Primary CPC classification G06T9/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).