Committed information rate variational autoencoders

US11636283B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11636283-B2
Application numberUS-202016889125-A
CountryUS
Kind codeB2
Filing dateJun 1, 2020
Priority dateSep 27, 2018
Publication dateApr 25, 2023
Grant dateApr 25, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A variational autoencoder (VAE) neural network system, comprising an encoder neural network to encode an input data item to define a posterior distribution for a set of latent variables, and a decoder neural network to generate an output data item representing values of a set of latent variables sampled from the posterior distribution. The system is configured for training with an objective function including a term dependent on a difference between the posterior distribution and a prior distribution. The prior and posterior distributions are arranged so that they cannot be matched to one another. The VAE system may be used for compressing and decompressing data.

First claim

Opening claim text (preview).

The invention claimed is: 1. A system comprising one or more computers and one or more storage devices storing instructions that when executed by the one or more computers cause the one or more computers to implement a variational autoencoder neural network system, the variational autoencoder neural network system comprising: a subsystem configured to determine respective values of a set of latent variables for each of a plurality of time steps; and a decoder neural network configured to receive the respective values of the set of latent variables for the time steps and to generate an output data item representing the respective values of the set of latent variables, wherein the output data item comprises a sequence of output data item values that each correspond to a respective one of the plurality of time steps, and wherein the decoder neural network is an autoregressive neural network that is configured to generate each output data item value in the sequence conditional upon (i) any previously generated output data item values in the sequence and (ii) only the values of the set of latent variables for time steps that correspond to output data item values that have yet to be generated. 2. The system of claim 1 , wherein the variational autoencoder neural network system further comprises: an encoder neural network configured to encode an input data item to determine a sequence comprising a respective set of parameters for each of a plurality of time steps that defines a respective posterior distribution for the set of latent variables for the time step, wherein determining respective values of a set of latent variables for each of the plurality of time steps comprises sampling from the respective posterior distributions for the sets of latent variables for the time steps. 3. The system of claim 2 , wherein the variational autoencoder neural network system is configured for training with an objective function which has a first term dependent upon a difference between the input data item and the output data item and a second term dependent upon a difference between the respective posterior distributions and a set of second, prior distributions of the set of latent variables, and wherein a structure of the prior distributions is different to a structure of the posterior distributions such that the posterior distributions cannot be matched to the prior distributions. 4. The system of claim 3 wherein each posterior distribution and each prior distribution each comprise a multivariate Gaussian distribution and wherein a variance of the posterior distributions is a factor of a different to a variance of the prior distributions, where α≠1. 5. The system of claim 3 wherein the prior distribution comprises an autoregressive distribution such that at each time step the prior distribution depends on the prior distribution at a previous time step. 6. The system of claim 5 wherein the respective prior distribution at each time step t is defined by a sum of a times the values of the set of latent variables at a previous time step t−1 and a noise component, where |α|<1. 7. The system as claimed in claim 1 wherein determining respective values of a set of latent variables for each of a plurality of time steps comprises, for each time step, sampling the values of the set of latent variables for the time step from a prior distribution for the time step. 8. The system of claim 1 , wherein determining respective values of a set of latent variables for each of a plurality of time steps comprises, for each time step, sampling the values of the set of latent variables for the time step from an auxiliary prior distribution for the time step generated by an auxiliary neural network. 9. The system of claim 8 , wherein the auxiliary neural network has been trained jointly with the decoder neural network and an encoder neural network. 10. One or more non-transitory computer-readable storage media storing instructions that when executed by one or more computers cause the one or more computers to implement a variational autoencoder neural network system, the variational autoencoder neural network system comprising: a subsystem configured to determine respective values of a set of latent variables for each of a plurality of time steps; and a decoder neural network configured to receive the respective values of the set of latent variables for the time steps and to generate an output data item representing the values of the set of latent variables, wherein the output data item comprises a sequence of output data item values that each correspond to a respective one of the plurality of time steps, and wherein the decoder neural network is an autoregressive neural network that is configured to generate each output data item value in the sequence conditional upon (i) any previously generated output data item values in the sequence and (ii) only the values of the set of latent variables for time steps that correspond to output data item values that have yet to be generated. 11. The non-transitory computer-readable storage media of claim 10 , wherein the variational autoencoder neural network system further comprises: an encoder neural network configured to encode an input data item to determine a sequence comprising a respective set of parameters for each of a plurality of time steps that defines a respective posterior distribution for the set of latent variables for the time step, wherein determining respective values of a set of latent variables for each of the plurality of time steps comprises sampling from the respective posterior distributions for the sets of latent variables for the time steps. 12. A method performed by one or more computers, the method comprising: determining respective values of a set of latent variables for each of a plurality of time steps; and processing the respective values of the set of latent variables for each of the plurality of time steps using a decoder neural network configured to receive the respective values of the set of latent variables for the time steps and to generate an output data item representing the values of the set of latent variables, wherein the output data item comprises a sequence of output data item values that each correspond to a respective one of the plurality of time steps, and wherein the decoder neural network is an autoregressive neural network that is configured to generate each output data item value in the sequence conditional upon (i) any previously generated output data item values in the sequence and (ii) only the values of the set of latent variables for time steps that correspond to output data item values that have yet to be generated. 13. The method of claim 12 , wherein determining respective values of a set of latent variables for each of the plurality of time steps comprises: processing an input data item using an encoder neural network configured to encode the input data item to determine a sequence comprising a respective set of parameters for each of a plurality of time steps that defines a respective posterior distribution for the set of latent variables for the time step, and sampling from the respective posterior distributions for the sets of latent variables for the time steps to determine the respective values of the set of latent variables for the time steps. 14. The method of claim 13 , further comprising training the encoder neural network and the decoder neural network to optimize an objective function which has a first term dependent upon a difference between the input data item and the output data item and a second term dependent upon a difference between the respective posterior

Assignees

Inventors

Classifications

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Generative networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11636283B2 cover?
A variational autoencoder (VAE) neural network system, comprising an encoder neural network to encode an input data item to define a posterior distribution for a set of latent variables, and a decoder neural network to generate an output data item representing values of a set of latent variables sampled from the posterior distribution. The system is configured for training with an objective fun…
Who is the assignee on this patent?
Deepmind Tech Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 25 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).