Training a data coding system comprising a feature extractor neural network

US12170779B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12170779-B2
Application numberUS-202117917153-A
CountryUS
Kind codeB2
Filing dateMar 30, 2021
Priority dateApr 9, 2020
Publication dateDec 17, 2024
Grant dateDec 17, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Example embodiments provide a system for training a data coding pipeline including a feature extractor neural network, an encoder neural network, and a decoder neural network configured to reconstruct input data based on encoded features. A plurality of losses corresponding to different tasks may be determined for the coding pipeline. Tasks may be performed based on an output of the coding pipeline. A weight update may be determined for at least a subset of the coding pipeline based on the plurality of losses. The weight update may be configured to reduce a number of iterations for fine-tuning the coding pipeline for one of the tasks. This enables faster adaptation of the coding pipeline for one of the tasks after deployment of the coding pipeline. Apparatuses, methods, and computer programs are disclosed. Apparatuses, methods, and computer programs are disclosed.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus, comprising: at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus to: obtain a data coding pipeline comprising a feature extractor neural network configured to extract features from input data, an encoder neural network configured to encode extracted features from the feature extractor neural network, and a decoder neural network configured to reconstruct the input data based on output of the encoder neural network; determine a plurality of losses for the coding pipeline, the plurality of losses corresponding to at least a plurality of tasks; determine a weight update for at least a subset of the coding pipeline based on the plurality of losses, wherein the weight update is configured to reduce a number of iterations for fine-tuning the coding pipeline for at least one of the plurality of tasks; and update at least the subset of the coding pipeline based on the weight update. 2. The apparatus according to claim 1 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: determine the weight update based on an average performance of the plurality of tasks. 3. The apparatus according to claim 1 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: receive the input data for encoding; extract features form the input data; encode the extracted features with at least the encoder neural network to obtain an encoded bitstream; and transmit the encoded bitstream to a decoder device. 4. The apparatus according to claim 1 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: fine-tune the feature extractor neural network based on the plurality of losses corresponding to at least the plurality of tasks. 5. The apparatus according to claim 4 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: fine-tune the encoder neural network based on the plurality of losses corresponding to at least the plurality of tasks. 6. The apparatus according to claim 4 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: fine-tune a plurality of task neural networks configured to perform the plurality of tasks based on the plurality of losses. 7. The apparatus according to claim 6 , wherein fine-tuning the plurality of task neural networks is based on a compression term configured to reduce entropy of a second plurality of weight updates associated with the plurality of task neural networks. 8. The apparatus according to claim 7 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: compress the second plurality of weight updates; and/or transmit the second plurality of weight updates to a decoder device or at least one client encoder device. 9. The apparatus according to claim 4 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: determine the plurality of losses based on outputs of a plurality of task neural networks and ground-truth data for the plurality of tasks. 10. The apparatus according to claim 4 , wherein the at least one memory further stores instructions that, when executed by the at least one processor cause the apparatus at least to: compute the plurality of losses based on outputs of a plurality of task neural networks and outputs of a plurality of reference neural networks configured to perform the plurality of tasks based on the input data. 11. The apparatus according to claim 1 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, a cause the apparatus to: encode at least one portion of the input data; and transmit the encoded at least one portion of the data to a decoder device. 12. An apparatus, comprising: at least one processor; and at least one memory including storing instructions that, when executed by the at least one processor, cause the apparatus to perform: receive encoded features of input data; receive at least one encoded portion of the input data; decode the at least one encoded portion of the input data to form a decoded portion of the input data; extract features from the decoded portion of the input data with a feature extractor neural network; determine, with a decoder neural network, decoded features based on the encoded features; perform a plurality of tasks with a plurality of task neural networks based on the decoded features; perform the plurality of tasks with a plurality of reference task neural networks based on the extracted features, wherein the plurality of task neural networks are independent of the plurality of reference task neural networks; determine a plurality of losses based on outputs of the plurality of task neural networks and outputs of the plurality of reference task neural networks; determine a weight update based on the plurality of losses; and fine-tune the feature extractor neural network based on the weight update, wherein the weight update is configured to reduce a number of iterations for the fine-tuning. 13. The apparatus according to claim 12 , wherein the at least one memory further stores instructions that, when executed by the at least one processor, cause the apparatus to: compress the weight update; and/or transmit the weight update to an encoder device. 14. A method, comprising: obtaining a data coding pipeline comprising a feature extractor neural network configured to extract features from input data, an encoder neural network configured to encode extracted features from the feature extractor neural network, and a decoder neural network configured to reconstruct the input data based on output of the encoder neural network; determining a plurality of losses for the coding pipeline, the plurality of losses corresponding to at least a plurality of tasks; determining a weight update for at least a subset of the coding pipeline based on the plurality of losses, wherein the weight update is configured to reduce a number of iterations for fine-tuning the coding pipeline for at least one of the plurality of tasks; and updating at least the subset of the coding pipeline based on the weight update. 15. The method according to claim 14 , further comprising: fine-tuning the feature extractor neural network based on the plurality of losses corresponding to at least the plurality of tasks. 16. The method according to claim 15 , further comprising: determining the plurality of losses based on outputs of a plurality of task neural networks and ground-truth data for the plurality of tasks. 17. The method according to claim 15 , further comprising: computing the plurality of losses based on outputs of a plurality of task neural networks and outputs of a plurality of reference neural networks configured to perform the plurality of tasks based on the input data. 18. A method, comprising: receiving encoded features of input data; receiving at least one encoded portion of the input data; decoding the at least one encoded portion of the input data to form a decoded portion of the input data; extr

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • Supervised learning · CPC title

  • Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12170779B2 cover?
Example embodiments provide a system for training a data coding pipeline including a feature extractor neural network, an encoder neural network, and a decoder neural network configured to reconstruct input data based on encoded features. A plurality of losses corresponding to different tasks may be determined for the coding pipeline. Tasks may be performed based on an output of the coding pipe…
Who is the assignee on this patent?
Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification H04N19/146. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Dec 17 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).