Super resolution using a generative adversarial network

US11024009B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11024009-B2
Application numberUS-201715706428-A
CountryUS
Kind codeB2
Filing dateSep 15, 2017
Priority dateSep 15, 2016
Publication dateJun 1, 2021
Grant dateJun 1, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A neural network is trained to process received visual data to estimate a high-resolution version of the visual data using a training dataset and reference dataset. A set of training data is generated and a generator convolutional neural network parameterized by first weights and biases is trained by comparing characteristics of the training data to characteristics of the reference dataset. The first network is trained to generate super-resolved image data from low-resolution image data and the training includes modifying first weights and biases to optimize processed visual data based on the comparison between the characteristics of the training data and the characteristics of the reference dataset. A discriminator convolutional neural network parameterized by second weights and biases is trained by comparing characteristics of the generated super-resolved image data to characteristics of the reference dataset, and where the second network is trained to discriminate super-resolved image data from real image data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for training a neural network to process at least a section of received low-resolution visual data to estimate a high-resolution version of the low-resolution visual data using a training dataset and reference dataset, the method comprising: (a) generating a set of training data; (b) training a generator convolutional neural network that is parameterized by first weights and biases by comparing one or more characteristics of the training data to one or more characteristics of at least a section of the reference dataset, wherein the generator convolutional neural network is trained to generate super-resolved image data from low-resolution image data and wherein the training includes modifying one or more of the first weights and biases of the generator convolutional neural network to optimize processed visual data based on the comparison between the one or more characteristics of the training data and the one or more characteristics of the reference dataset, wherein the modification is based on a perceptual loss function that includes a weighted combination of a content loss function, an adversarial loss function based on a discriminator network trained to differentiate between the super-resolved images and original photo-realistic images, and a regularization loss function that encourages spatially coherent solutions, wherein a first feature map of the neural network is generated by the neural network from the low-resolution image data and a second feature map of the neural network is generated from the super-resolved reference image data and wherein the content loss function is based on a Euclidean distance between the first and second feature maps, where similarities of the feature maps of the neural network are based on human notions of content similarity as determined based on object classification of training; and (c) training a discriminator convolutional neural network that is parameterized by second weights and biases by comparing one or more characteristics of the generated super-resolved image data to one or more characteristics of at least a section of the reference dataset, wherein the second network is trained to discriminate super-resolved image data from real image data. 2. The method of claim 1 , wherein the training dataset includes a plurality of visual images. 3. The method of claim 1 , wherein the reference dataset includes a plurality of visual images. 4. The method of claim 3 , wherein the plurality of visual images of the reference dataset are not increased quality versions of the visual data of the training dataset. 5. The method of claim 1 , further comprising: generating an estimated high-resolution version of an input image by using the trained convolutional neural network on the input image, wherein the trained convolutional neural network is configured to remove compression artifacts from the input image to generate the estimated high-resolution version of an input image. 6. The method of claim 1 , further comprising: generating an estimated high-resolution version of an input image by using the trained convolutional neural network on the input image, wherein the trained convolutional neural network is configured to perform image de-mosaicing on the input image to generate the estimated high-resolution version of an input image. 7. The method of claim 1 , further comprising: generating an estimated high-resolution version of an input image by using the trained convolutional neural network on the input image, wherein the trained convolutional neural network is configured to perform image de-noising on the input image to generate the estimated high-resolution version of an input image. 8. The method of claim 1 , wherein the generator convolutional neural network is hierarchical and includes a plurality of layers. 9. The method of claim 8 , wherein the layers are any of sequential, recurrent, recursive, branching, or merging. 10. The method of claim 1 , further comprising: iterating over (a), (b), and (c); and updating the training data during an iteration.

Assignees

Inventors

Classifications

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

  • Probabilistic or stochastic networks · CPC title

  • Combinations of networks · CPC title

  • Generative networks · CPC title

  • Adversarial learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11024009B2 cover?
A neural network is trained to process received visual data to estimate a high-resolution version of the visual data using a training dataset and reference dataset. A set of training data is generated and a generator convolutional neural network parameterized by first weights and biases is trained by comparing characteristics of the training data to characteristics of the reference dataset. The…
Who is the assignee on this patent?
Twitter Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 01 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).