Generative neural networks

US10176424B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10176424-B2
Application numberUS-201715424708-A
CountryUS
Kind codeB2
Filing dateFeb 3, 2017
Priority dateFeb 5, 2016
Publication dateJan 8, 2019
Grant dateJan 8, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a neural network system. In one aspect, a neural network system includes a recurrent neural network that is configured to, for each time step of a predetermined number of time steps, receive a set of latent variables for the time step and process the latent variables to update a hidden state of the recurrent neural network; and a generative subsystem that is configured to, for each time step, generate the set of latent variables for the time step and provide the set of latent variables as input to the recurrent neural network; update a hidden canvas using the updated hidden state of the recurrent neural network; and, for a last time step, generate an output image using the updated hidden canvas for the last time step.

First claim

Opening claim text (preview).

What is claimed is: 1. A neural network system implemented by one or more computers, the neural network system comprising: a recurrent neural network that is configured to, for each time step of a predetermined number of time steps, receive a set of latent variables for the time step and process the set of latent variables to update a hidden state of the recurrent neural network; and a generative subsystem that is configured to: for each time step of the predetermined number of time steps: generate the set of latent variables for the time step and provide the set of latent variables as input to the recurrent neural network; update a hidden canvas using the updated hidden state of the recurrent neural network; and for a last time step of the predetermined number of time steps: generate an output image using the updated hidden canvas for the last time step. 2. The neural network system of claim 1 , wherein the recurrent neural network is a long short-term memory (LSTM) neural network. 3. The neural network system of claim 1 , wherein updating the hidden canvas using the updated hidden state of the recurrent neural network comprises: generating a write for the time step by processing the hidden state using a writing mechanism; and applying a canvas function to the write to combine the write with the hidden canvas to generate the updated hidden canvas for the time step. 4. The neural network system of claim 3 , wherein the writing mechanism is an attention-based writing mechanism. 5. The neural network system of claim 4 , wherein the attention-based writing mechanism uses a spatial transformers function. 6. The neural network system of claim 3 , wherein the canvas function is an additive canvas function. 7. The neural network system of claim 3 , wherein the canvas function is a gated recurrent canvas function. 8. The neural network system of claim 1 , wherein generating an output image using the updated hidden canvas for the last time step comprises: mapping the updated hidden canvas for the last time step to sufficient statistics of a likelihood model using an observation function; and generating the output image by sampling from the likelihood model. 9. The neural network system of claim 1 , wherein generating the set of latent variables for the time step comprises sampling the latent variables from a prior distribution. 10. The neural network system of claim 1 , wherein the output image is a reconstruction of an input image, and wherein the neural network system further comprises: an inference subsystem, wherein the inference subsystem is configured to, for each of the predetermined number of time steps: generate a sprite representation of the input image by applying a reading function to the input image and the hidden state of the recurrent neural network; determine parameters of a distribution of latent variables using the sprite representation and the hidden state of the recurrent neural network; and sample the set of latent variables for the time step from the distribution in accordance with the determined parameters. 11. A method comprising: at a generative subsystem of a neural network system implemented by one or more computers, for each time step of a predetermined number of time steps: receiving, from a recurrent neural network of the neural network system, an updated hidden state of the recurrent neural network; generating a set of latent variables for the time step and providing the set of latent variables as input to the recurrent neural network; and updating a hidden canvas using the updated hidden state of the recurrent neural network; and for a last time step of the predetermined number of time steps, generating an output image using the updated hidden canvas for the last time step. 12. The method of claim 11 , wherein the recurrent neural network is a long short-term memory (LSTM) neural network. 13. The method of claim 11 , wherein updating the hidden canvas using the updated hidden state of the recurrent neural network comprises: generating a write for the time step by processing the hidden state using a writing mechanism; and applying a canvas function to the write to combine the write with the hidden canvas to generate the updated hidden canvas for the time step. 14. The method of claim 13 , wherein the writing mechanism is an attention-based writing mechanism. 15. The method of claim 14 , wherein the attention-based writing mechanism uses a spatial transformers function. 16. The method of claim 13 , wherein the canvas function is an additive canvas function or is a gated recurrent canvas function. 17. The method of claim 11 , wherein generating an output image using the updated hidden canvas for the last time step comprises: mapping the updated hidden canvas for the last time step to sufficient statistics of a likelihood model using an observation function; and generating the output image by sampling from the likelihood model. 18. The method of claim 11 , wherein the output image is a reconstruction of an input image, and wherein the method further comprises, at an inference subsystem of the neural network system, generating a sprite representation of the input image by applying a reading function to the input image and the hidden state of the recurrent neural network; determining parameters of a distribution of latent variables using the sprite representation and the hidden state of the recurrent neural network; and sampling the set of latent variables for the time step from the distribution in accordance with the determined parameters. 19. A non-transitory computer storage medium encoded with instructions that, when executed by one or more computers, cause the one or more computers to perform operations comprising: at a generative subsystem of a neural network system implemented by one or more computers, for each time step of a predetermined number of time steps: receiving, from a recurrent neural network of the neural network system, an updated hidden state of the recurrent neural network; generating a set of latent variables for the time step and providing the set of latent variables as input to the recurrent neural network; and updating a hidden canvas using the updated hidden state of the recurrent neural network; and for a last time step of the predetermined number of time steps, generating an output image using the updated hidden canvas for the last time step. 20. The non-transitory computer storage medium of claim 19 , wherein the output image is a reconstruction of an input image, and wherein the operations further comprise, at an inference subsystem of the neural network system, generating a sprite representation of the input image by applying a reading function to the input image and the hidden state of the recurrent neural network; determining parameters of a distribution of latent variables using the sprite representation and the hidden state of the recurrent neural network; and sampling the set of latent variables for the time step from the distribution in accordance with the determined parameters.

Assignees

Inventors

Classifications

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

  • Probabilistic or stochastic networks · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10176424B2 cover?
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a neural network system. In one aspect, a neural network system includes a recurrent neural network that is configured to, for each time step of a predetermined number of time steps, receive a set of latent variables for the time step and process the latent variables to update a hidden state o…
Who is the assignee on this patent?
Deepmind Tech Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 08 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).