Methods, systems, and media for relighting images using predicted deep reflectance fields

US10997457B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10997457-B2
Application numberUS-201916616235-A
CountryUS
Kind codeB2
Filing dateOct 16, 2019
Priority dateMay 23, 2019
Publication dateMay 4, 2021
Grant dateMay 4, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and media for relighting images using predicted deep reflectance fields are provided. In some embodiments, the method comprises: identifying a group of training samples, wherein each training sample includes (i) a group of one-light-at-a-time (OLAT) images that have each been captured when one light of a plurality of lights arranged on a lighting structure has been activated, (ii) a group of spherical color gradient images that have each been captured when the plurality of lights arranged on the lighting structure have been activated to each emit a particular color, and (iii) a lighting direction, wherein each image in the group of OLAT images and each of the spherical color gradient images are an image of a subject, and wherein the lighting direction indicates a relative orientation of a light to the subject; training a convolutional neural network using the group of training samples, wherein training the convolutional neural network comprises: for each training iteration in a series of training iterations and for each training sample in the group of training samples: generating an output predicted image, wherein the output predicted image is a representation of the subject associated with the training sample with lighting from the lighting direction associated with the training sample; identifying a ground-truth OLAT image included in the group of OLAT images for the training sample that corresponds to the lighting direction for the training sample; calculating a loss that indicates a perceptual difference between the output predicted image and the identified ground-truth OLAT image; and updating parameters of the convolutional neural network based on the calculated loss; identifying a test sample that includes a second group of spherical color gradient images and a second lighting direction; and generating a relit image of the subject included in each of the second group of spherical color gradient images with lighting from the second lighting direction using the trained convolutional neural network.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for relighting images using deep reflectance fields, comprising: identifying a group of training samples, wherein each training sample includes (i) a group of one-light-at-a-time (OLAT) images that have each been captured when one light of a plurality of lights arranged on a lighting structure has been activated, (ii) a group of spherical color gradient images that have each been captured when the plurality of lights arranged on the lighting structure have been activated to each emit a particular color, and (iii) a lighting direction, wherein each image in the group of OLAT images and each of the spherical color gradient images are an image of a subject, and wherein the lighting direction indicates a relative orientation of a light to the subject; training a convolutional neural network using the group of training samples, wherein training the convolutional neural network comprises: for each training iteration in a series of training iterations and for each training sample in the group of training samples: generating an output predicted image, wherein the output predicted image is a representation of the subject associated with the training sample with lighting from the lighting direction associated with the training sample; identifying a ground-truth OLAT image included in the group of OLAT images for the training sample that corresponds to the lighting direction for the training sample; calculating a loss that indicates a perceptual difference between the output predicted image and the identified ground-truth OLAT image; and updating parameters of the convolutional neural network based on the calculated loss; identifying a test sample that includes a second group of spherical color gradient images and a second lighting direction; and generating a relit image of the subject included in each of the second group of spherical color gradient images with lighting from the second lighting direction using the trained convolutional neural network. 2. The method of claim 1 , wherein the lighting structure is a spherical dome, and wherein the plurality of lights are arranged on a surface of the spherical dome. 3. The method of claim 1 , wherein the loss is calculated using a pre-trained neural network. 4. The method of claim 1 , wherein the loss includes a first loss component that indicates the perceptual difference between the output image and the identified OLAT image based on texture information in each image, and wherein the loss includes a second loss component that indicates the perceptual difference between the output image and the identified OLAT image based on specularity information in each image. 5. The method of claim 4 , wherein the second loss component is calculated using a trained neural network that has been trained to take, as an input, an OLAT image, and to generate, as an output, a light direction of a light used to generate the OLAT image. 6. The method of claim 1 , wherein the group of OLAT images and the group of spherical color gradient images for each of the training samples are captured from a first plurality of cameras, each having a viewpoint from a first plurality of viewpoints, and wherein the second group of spherical color gradient images corresponding to the test sample are captured from a camera having a viewpoint that is not included in the first plurality of viewpoints. 7. The method of claim 1 , further comprising generating an aligned ground-truth OLAT image prior to calculating the loss, wherein the loss is calculated using the aligned ground-truth image. 8. A system for relighting images using deep reflectance fields, the system comprising: a memory; and a hardware processor that, when executing computer-executable instructions stored in the memory, is configured to: identify a group of training samples, wherein each training sample includes (i) a group of one-light-at-a-time (OLAT) images that have each been captured when one light of a plurality of lights arranged on a lighting structure has been activated, (ii) a group of spherical color gradient images that have each been captured when the plurality of lights arranged on the lighting structure have been activated to each emit a particular color, and (iii) a lighting direction, wherein each image in the group of OLAT images and each of the spherical color gradient images are an image of a subject, and wherein the lighting direction indicates a relative orientation of a light to the subject; train a convolutional neural network using the group of training samples, wherein training the convolutional neural network comprises: for each training iteration in a series of training iterations and for each training sample in the group of training samples: generating an output predicted image, wherein the output predicted image is a representation of the subject associated with the training sample with lighting from the lighting direction associated with the training sample; identifying a ground-truth OLAT image included in the group of OLAT images for the training sample that corresponds to the lighting direction for the training sample; calculating a loss that indicates a perceptual difference between the output predicted image and the identified ground-truth OLAT image; and updating parameters of the convolutional neural network based on the calculated loss; identify a test sample that includes a second group of spherical color gradient images and a second lighting direction; and generate a relit image of the subject included in each of the second group of spherical color gradient images with lighting from the second lighting direction using the trained convolutional neural network. 9. The system of claim 8 , wherein the lighting structure is a spherical dome, and wherein the plurality of lights are arranged on a surface of the spherical dome. 10. The system of claim 8 , wherein the loss is calculated using a pre-trained neural network. 11. The system of claim 8 , wherein the loss includes a first loss component that indicates the perceptual difference between the output image and the identified OLAT image based on texture information in each image, and wherein the loss includes a second loss component that indicates the perceptual difference between the output image and the identified OLAT image based on specularity information in each image. 12. The system of claim 11 , wherein the second loss component is calculated using a trained neural network that has been trained to take, as an input, an OLAT image, and to generate, as an output, a light direction of a light used to generate the OLAT image. 13. The system of claim 8 , wherein the group of OLAT images and the group of spherical color gradient images for each of the training samples are captured from a first plurality of cameras, each having a viewpoint from a first plurality of viewpoints, and wherein the second group of spherical color gradient images corresponding to the test sample are captured from a camera having a viewpoint that is not included in the first plurality of viewpoints. 14. The system of claim 8 , wherein the hardware processor is further configured to generate an aligned ground-truth OLAT image prior to calculating the loss, wherein the loss is calculated using the aligned ground-truth image. 15. A non-transitory computer-readable medium containing computer executable instructions that, when executed by a processor, cause the processor to perform a method for relighting images using deep reflectance fields, the method comprising: identifying a group of training samples, wherein each training sample includes (i) a gro

Assignees

Inventors

Classifications

  • G06T15/506Primary

    Illumination models · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • relating to illumination properties, e.g. using a reflectance or lighting model · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10997457B2 cover?
Methods, systems, and media for relighting images using predicted deep reflectance fields are provided. In some embodiments, the method comprises: identifying a group of training samples, wherein each training sample includes (i) a group of one-light-at-a-time (OLAT) images that have each been captured when one light of a plurality of lights arranged on a lighting structure has been activated, …
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06T15/506. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 04 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).