Generative latent textured proxies for object category modeling

US11710287B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11710287-B2
Application numberUS-202017309817-A
CountryUS
Kind codeB2
Filing dateAug 4, 2020
Priority dateJun 30, 2020
Publication dateJul 25, 2023
Grant dateJul 25, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are described for generating a plurality of three-dimensional (3D) proxy geometries of an object, generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, the neural textures defining a plurality of different shapes and appearances representing the object, providing the plurality of neural textures to a neural renderer, receiving, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object, and generating a composite image based on the pose, the color image, and the alpha mask.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method utilizing at least one processing device to perform operations including: receiving a pose associated with an object in image content; generating a plurality of three-dimensional (3D) proxy geometries of the object, the plurality of 3D proxy geometries being based on a shape of the object; generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, each of the plurality of neural textures defining a plurality of different shapes and appearances representing the object, the plurality of neural textures being configured to reconstruct a hidden portion of the object captured in the image content; providing the plurality of neural textures to a neural renderer, the plurality of neural textures being provided in a stacked formation, the hidden portion being reconstructed based on the stacked formation of the plurality of neural textures; generating, by the neural renderer, transparent layers of the object and surfaces behind the transparent layers of the object based on the plurality of neural textures; receiving, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object; and generating a composite image based on the pose, the color image, and the alpha mask. 2. The method of claim 1 , further comprising: rendering a latent texture onto a target viewpoint based at least in part on the pose associated with the object, wherein each of the plurality of 3D proxy geometries include a geometric approximation of at least a portion of the object and the latent texture of the object mapped to the geometric approximation. 3. The method of claim 1 , wherein each of the plurality of 3D proxy geometries encode surface light field associated with the object in the image content, the surface light field including specular reflections associated with the object. 4. The method of claim 1 , wherein the plurality of neural textures are based, at least in part, on the pose, each of the plurality of neural textures being generated by: identifying a category of the object; generating a feature map based on the identified category of the object; providing the feature map to a neural network; and generating each of the plurality of neural textures based on a latent code associated with each instance of the identified category and a view associated with the pose. 5. The method of claim 1 , wherein at least a portion of the object is a transparent material. 6. The method of claim 1 , wherein at least a portion of the object is a reflective material. 7. The method of claim 1 , wherein: the image content includes image data including at least a user; and the object includes a pair of eyeglasses. 8. A system comprising: at least one processing device; and a memory storing instructions that when executed cause the system to perform operations including: receiving a pose associated with an object in image content; generating a plurality of three-dimensional (3D) proxy geometries of the object, the plurality of 3D proxy geometries being based on a shape of the object; generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, each of the plurality of neural textures defining a plurality of different shapes and appearances representing the object, the plurality of neural textures being configured to reconstruct a hidden portion of the object captured in the image content; providing the plurality of neural textures to a neural renderer, the plurality of neural textures being provided in a stacked formation, the hidden portion being reconstructed based on the stacked formation of the plurality of neural textures; generating, by the neural renderer, transparent layers of the object and surfaces behind the transparent layers of the object based on the plurality of neural textures; receiving, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object; and generating a composite image based on the color image and the alpha mask. 9. The system of claim 8 , further comprising: rendering a latent texture onto a target viewpoint based at least in part on the pose associated with the object, wherein each of the plurality of 3D proxy geometries include a geometric approximation of at least a portion of the object and the latent texture of the object mapped to the geometric approximation. 10. The system of claim 8 , wherein each of the plurality of 3D proxy geometries encode surface light field associated with the object in the image content, the surface light field including specular reflections associated with the object. 11. The system of claim 8 , wherein the plurality of neural textures are based, at least in part, on the pose, each of the plurality of neural textures being generated by: identifying a category of the object; generating a feature map based on the identified category of the object; providing the feature map to a neural network; and generating each of the plurality of neural textures based on a latent code associated with each instance of the identified category and a view associated with the pose. 12. The system of claim 11 , wherein the neural renderer uses a generative model to reconstruct unseen object instances within the identified category, the reconstruction based on less than four captured views of the object. 13. The system of claim 8 , wherein the plurality of 3D proxy geometries are based on geometry interpolation of shapes that construct the object in the image content. 14. A non-transitory, machine-readable medium having instructions stored thereon, the instructions, when executed by a processor, cause a computing device to: receiving a pose associated with an object in image content; generate a plurality of three-dimensional (3D) proxy geometries of the object, the plurality of 3D proxy geometries being based on a shape of the object; generate, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, each of the plurality of neural textures defining a plurality of different shapes and appearances representing the object, the plurality of neural textures being configured to reconstruct a hidden portion of the object captured in the image content; provide the plurality of neural textures to a neural renderer, the plurality of neural textures being provided in a stacked formation, the hidden portion being reconstructed based on the stacked formation of the plurality of neural textures; generate, by the neural renderer, transparent layers of the object and surfaces behind the transparent layers of the object based on the plurality of neural textures; receive, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object; and generate a composite image based on the color image and the alpha mask. 15. The machine-readable medium of claim 14 , further comprising: rendering a latent texture onto a target viewpoint based at least in part on the pose associated with the object, wherein each of the plurality of 3D texture proxy geometries include a geometric approximation of at least a portion of the object and the latent texture of the object mapped to the geometric approximation. 16. The machine-readable medium of claim 14 , wherein the plurality of neural textures are based, at least in p

Assignees

Inventors

Classifications

  • Artificial neural networks [ANN] · CPC title

  • Shape modification · CPC title

  • Colour editing, changing, or manipulating; Use of colour codes · CPC title

  • structured as a network, e.g. client-server architectures · CPC title

  • G06T19/20Primary

    Editing of three-dimensional [3D] images, e.g. changing shapes or colours, aligning objects or positioning parts · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11710287B2 cover?
Systems and methods are described for generating a plurality of three-dimensional (3D) proxy geometries of an object, generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, the neural textures defining a plurality of different shapes and appearances representing the object, providing the plurality of neural textures to a neural renderer, receivi…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06T19/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 25 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).