Recurrent neural networks for data item generation

US11080587B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11080587-B2
Application numberUS-201615016160-A
CountryUS
Kind codeB2
Filing dateFeb 4, 2016
Priority dateFeb 6, 2015
Publication dateAug 3, 2021
Grant dateAug 3, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, and systems, including computer programs encoded on computer storage media for generating data items. A method includes reading a glimpse from a data item using a decoder hidden state vector of a decoder for a preceding time step, providing, as input to a encoder, the glimpse and decoder hidden state vector for the preceding time step for processing, receiving, as output from the encoder, a generated encoder hidden state vector for the time step, generating a decoder input from the generated encoder hidden state vector, providing the decoder input to the decoder for processing, receiving, as output from the decoder, a generated a decoder hidden state vector for the time step, generating a neural network output update from the decoder hidden state vector for the time step, and combining the neural network output update with a current neural network output to generate an updated neural network output.

First claim

Opening claim text (preview).

What is claimed is: 1. A neural network system implemented by one or more computers, the neural network system comprising: an encoder neural network, wherein the encoder neural network is a recurrent neural network that is configured to, for each input image processed by the encoder neural network and at each time step of a plurality steps: receive a glimpse captured by reading from the input image, receive a decoder hidden state vector of a decoder neural network for the preceding time step, and process the glimpse, the decoder hidden state vector, and an encoder hidden state vector of the encoder neural network from the preceding time step to generate an encoder hidden state vector for the time step; a decoder neural network, wherein the decoder neural network is a recurrent neural network that is configured to, for each of the plurality of time steps: receive a decoder input for the time step, and process the decoder hidden state vector for the preceding time step and the decoder input to generate a decoder hidden state vector for the time step; and a subsystem, wherein the subsystem is configured to generate a final output image by repeatedly updating a neural network output at each of a plurality of time steps based on an input image to generate a final neural network output and generating the final output image from the final neural network output, the updating comprising, for each of the time steps: reading the glimpse from the input image using the decoder hidden state vector for the preceding time step; providing the glimpse as input to the encoder neural network; generating the encoder hidden state vector at the time step as output from the encoder neural network; generating the decoder input for the decoder neural network from the encoder hidden state vector at the time step; providing the decoder input as input to the decoder neural network for the time step; generating a neural network output update for the time step from the decoder hidden state vector for the time step; and combining the neural network output update for the time step with a current neural network output to generate an updated neural network output. 2. The neural network system of claim 1 , wherein the encoder neural network and the decoder neural network are long short term memory neural networks. 3. The neural network system of claim 1 , wherein the subsystem is further configured to train the encoder neural network and the decoder neural network to autoencode input images. 4. The neural network system of claim 3 , wherein training the encoder neural network and the decoder neural network to autoencode input images comprises training the neural networks to generate a neural network output image that is a reconstruction of the input image. 5. The neural network system of claim 1 , wherein the subsystem is further configured to provide the encoder hidden state vectors from each of the time steps for a particular image as features of the particular image. 6. The neural network system of claim 5 , wherein the features of the particular image are provided for use in processing the particular image during a semi-supervising learning procedure. 7. The neural network system of claim 1 , wherein in the input data items are image frames from videos. 8. The neural network system of claim 1 , wherein the glimpse captured by reading from the input image is an image patch generated by applying an array of Gaussian filters to the image. 9. The neural network system of claim 8 , wherein the parameters for applying the array of Gaussian filters are generated by applying a linear transformation to the decoder hidden state vector for the preceding time step. 10. The neural network system of claim 1 , wherein generating the decoder input for the decoder neural network from the encoder hidden state vector at the time step comprises: using the encoder hidden state vector for the time step to parameterize a distribution of a latent vector; and sampling the decoder input from the distribution. 11. The neural network system of claim 1 , wherein the subsystem is further configured to cause the decoder neural network to process a sequence of inputs sampled from a prior distribution to generate a new output image. 12. A computer implemented method for processing an input image through a recurrent encoder neural network and a recurrent decoder neural network to generate a final output image by repeatedly updating a neural network output at each of a plurality of time steps based on an input image to generate a final neural network output and generating the final output image from the final neural network output, the updating comprising, for each of the time step: reading a glimpse from the input image using a decoder hidden state vector of the decoder neural network for the preceding time step; providing, as input to the encoder neural network, the (i) glimpse and (ii) decoder hidden state vector for the preceding time step for processing; receiving, as output from the encoder neural network, a generated encoder hidden state vector for the time step; generating a decoder input for the decoder neural network from the generated encoder hidden state vector at the time step; providing, as input to the decoder neural network, the decoder input for processing; receiving, as output from the decoder neural network, a generated a decoder hidden state vector for the time step; generating a neural network output update for the time step from the decoder hidden state vector for the time step; and combining the neural network output update for the time step with a current neural network output to generate an updated neural network output. 13. The method of claim 12 , further comprising providing the encoder hidden state vectors from each of the time steps for a particular image as features of the particular image. 14. The method of claim 12 , wherein generating the decoder input for the decoder neural network from the encoder hidden state vector at the time step comprises: using the encoder hidden state vector for the time step to parameterize a distribution of a latent vector; and sampling the decoder input from the distribution. 15. The method of claim 12 , further comprising causing the decoder neural network to process a sequence of inputs sampled from a prior distribution to generate a new output image. 16. A computer storage medium encoded with instructions that, when executed by one or more computers, cause one or more computers to implement a neural network system, the neural network system comprising: an encoder neural network, wherein the encoder neural network is a recurrent neural network that is configured to, for each input image processed by the encoder neural network and at each time step of a plurality steps: receive a glimpse captured by reading from the input image, receive a decoder hidden state vector of a decoder neural network for the preceding time step, and process the glimpse, the decoder hidden state vector, and an encoder hidden state vector of the encoder neural network from the preceding time step to generate an encoder hidden state vector for the time step; a decoder neural network, wherein the decoder neural network is a recurrent neural network that is configured to, for each of the plurality of time steps: receive a decoder input for the time step, and process the decoder hidden state vector for the preceding time step and the decoder input to generate a decoder hidden state vector for the time step; and a subsystem, wherein the subsystem is configured to generate a final outp

Assignees

Inventors

Classifications

  • G06N3/044Primary

    Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • G06N3/0455Primary

    Auto-encoder networks; Encoder-decoder networks · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11080587B2 cover?
Methods, and systems, including computer programs encoded on computer storage media for generating data items. A method includes reading a glimpse from a data item using a decoder hidden state vector of a decoder for a preceding time step, providing, as input to a encoder, the glimpse and decoder hidden state vector for the preceding time step for processing, receiving, as output from the encod…
Who is the assignee on this patent?
Deepmind Tech Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/044. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 03 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).