What technology area does this patent fall under?

Primary CPC classification G06T5/002. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Image enhancement via iterative refinement based on machine learning models

US11769228B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11769228-B2
Application number	US-202117391150-A
Country	US
Kind code	B2
Filing date	Aug 2, 2021
Priority date	Aug 2, 2021
Publication date	Sep 26, 2023
Grant date	Sep 26, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image; training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process; and outputting the trained neural network. 2. The computer-implemented method of claim 1 , wherein the applying of the forward Gaussian diffusion process comprises determining, for an iterative step, a scalar hyperparameter indicative of a variance of the Gaussian noise at the iterative step. 3. The computer-implemented method of claim 1 , wherein the iterative denoising of the input image comprises predicting a noise vector based on a variance of the Gaussian noise added during the forward Gaussian process. 4. The computer-implemented method of claim 1 , wherein the neural network is a convolutional neural network comprising a U-net architecture based on a denoising diffusion probabilistic (DDPM) model. 5. The computer-implemented method of claim 1 , wherein the input image is of a first resolution and the enhanced version of the input image is of a second resolution, and wherein the second resolution is greater than the first resolution. 6. The computer-implemented method of claim 1 , wherein the iterative denoising of the input image further comprises: upsampling the input image to an enhanced version by applying bicubic interpolation. 7. The computer-implemented method of claim 1 , wherein the neural network comprises a plurality of cascading models. 8. The computer-implemented method of claim 7 , wherein the plurality of cascading models are chained together. 9. The computer-implemented method of claim 1 , wherein the iterative denoising of the input image further comprises: a plurality of iterative refinement steps corresponding to different levels of image quality, and wherein each step is trained with a regression loss. 10. The computer-implemented method of claim 9 , wherein a number of the plurality of iterative refinement steps is greater for refinement steps corresponding to a lower image quality than for refinement steps corresponding to a higher image quality. 11. The computer-implemented method of claim 1 , wherein the neural network comprises a plurality of cascading models, and wherein the training of the neural network comprises training the plurality of cascading models in parallel. 12. The computer-implemented method of claim 1 , wherein the training of the neural network is performed at the computing device. 13. A computer-implemented method, comprising: receiving, by a computing device, an input image; applying a neural network to predict an enhanced version of the input image by iteratively denoising the input image, wherein the iterative denoising is based on a reverse Markov chain associated with a forward Gaussian diffusion process, the neural network having been trained by applying the forward Gaussian diffusion process to add Gaussian noise to at least one corresponding target version of each image of a plurality of pairs of images in training data; and outputting the predicted enhanced version of the input image. 14. The computer-implemented method of claim 13 , wherein the neural network is a convolutional neural network comprising a U-net architecture based on a denoising diffusion probabilistic (DDPM) model. 15. The computer-implemented method of claim 13 , wherein the iterative denoising of the input image further comprising: upsampling the input image to an enhanced version by applying bicubic interpolation. 16. The computer-implemented method of claim 13 , wherein the neural network comprises a plurality of cascading models. 17. The computer-implemented method of claim 16 , wherein the plurality of cascading models are chained together. 18. The computer-implemented method of claim 13 , wherein the outputting of the predicted enhanced version of the input image further comprising: obtaining a trained neural network at the computing device; and applying the trained neural network as obtained to the outputting of the predicted enhanced version of the input image. 19. The computer-implemented method of claim 13 , wherein the outputting of the predicted enhanced version of the input image further comprising: determining, by the computing device, a request to predict the enhanced version of the input image; sending the request from the computing device to a second computing device, the second computing device comprising a trained version of the neural network; and after sending the request, the computing device receiving, from the second computing device, the predicted enhanced version of the input image. 20. The computer-implemented method of claim 13 , wherein the input image is of a first image resolution and the enhanced version of the input image is of a second image resolution, and wherein the second image resolution is greater than the first image resolution. 21. The computer-implemented method of claim 13 , wherein the input image is of a first color version and the enhanced version of the input image is of a second color version, and wherein the second color version is of a higher quality than the first color version. 22. The computer-implemented method of claim 13 , wherein the input image is of a first light composition and the enhanced version of the input image is of a second light composition, and wherein the second light composition is of a higher quality than the first light composition. 23. The computer-implemented method of claim 13 , wherein the input image comprises video content. 24. The computer-implemented method of claim 13 , wherein the input image is a compressed image file, and wherein the predicted enhanced version is a decompressed version of the compressed image file. 25. A computing device, comprising: one or more processors; and data storage, wherein the data storage has stored thereon computer-executable instructions that, when executed by the one or more processors, cause the computing device to carry out operations comprising: receiving, by the computing device, an input image; applying a neural network to predict an enhanced version of the input image by iteratively denoising the input image, wherein the iterative denoising is based on a reverse Markov chain associated with a forward Gaussian diffusion process, the neural network having been trained by applying the forward Gaussian diffusion process to add Gaussian noise to at least one corresponding target version of each image of a plurality of pairs of images in training data; and outputting the predicted enhanced version of the input image.

Assignees

Google Llc

Inventors

Classifications

G06T5/002Primary
Physics · mapped topic
G06N3/045
Combinations of networks · CPC title
G06N3/08
Learning methods · CPC title
G06T3/4007
based on interpolation, e.g. bilinear interpolation (image demosaicing G06T3/4015; edge-driven or edge-based scaling G06T3/403) · CPC title
G06T5/50
using two or more images, e.g. averaging or subtraction · CPC title

Patent family

Related publications grouped by family.

View patent family 85285870

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11769228B2 cover?: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a…
Who is the assignee on this patent?: Google Llc
What technology area does this patent fall under?: Primary CPC classification G06T5/002. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).