Synthesizing sequences of images for movement-based performance

US12243140B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12243140-B2
Application numberUS-202117526647-A
CountryUS
Kind codeB2
Filing dateNov 15, 2021
Priority dateNov 15, 2021
Publication dateMar 4, 2025
Grant dateMar 4, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A technique for rendering an input geometry includes generating a first segmentation mask for a first input geometry and a first set of texture maps associated with one or more portions of the first input geometry. The technique also includes generating, via one or more neural networks, a first set of neural textures for the one or more portions of the first input geometry. The technique further includes rendering a first image corresponding to the first input geometry based on the first segmentation mask, the first set of texture maps, and the first set of neural textures.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for rendering an input geometry, the computer-implemented method comprising: generating a first segmentation mask for a first input three-dimensional (3D) geometry and a first plurality of texture maps associated with a plurality of portions of the first input 3D geometry; generating, via execution of a plurality of generator blocks included in one or more neural networks, a first plurality of neural textures for the plurality of portions of the first input 3D geometry, wherein each generator block included in the plurality of generator blocks generates a neural texture for a different portion included in the plurality of portions of the first input 3D geometry, wherein portions of the first input 3D geometry corresponding to different generator blocks included in the plurality of generator blocks are non-overlapping; and rendering a first image corresponding to the first input 3D geometry based on the first segmentation mask, the first plurality of texture maps, and the first plurality of neural textures. 2. The computer-implemented method of claim 1 , further comprising training the one or more neural networks based on a training dataset that includes a second plurality of texture maps and a plurality of segmentation masks for a plurality of synthetic geometries. 3. The computer-implemented method of claim 1 , further comprising training the one or more neural networks based on one or more predictions generated by a discriminator neural network from one or more images produced by the one or more neural networks. 4. The computer-implemented method of claim 1 , wherein generating the first segmentation mask and the first plurality of texture maps comprises: deforming a template mesh to match the first input 3D geometry; and generating the first segmentation mask and the first plurality of texture maps based on a pose associated with the first input 3D geometry. 5. The computer-implemented method of claim 1 , further comprising rendering a second image corresponding to a second input geometry based on a second segmentation mask for the second input geometry, a second plurality of texture maps associated with a plurality of portions of the second input geometry, and the first plurality of neural textures. 6. The computer-implemented method of claim 1 , wherein generating the first plurality of neural textures comprises, for each portion included in the plurality of portions of the first input 3D geometry: generating an input vector based on sampling a distribution of latent variables associated with the portion; and inputting the input vector into the generator block corresponding to the portion. 7. The computer-implemented method of claim 1 , wherein rendering the first image comprises: sampling the first plurality of neural textures based on the first plurality of texture maps to generate a plurality of screen-space neural features; generating a composited set of screen-space neural features based on the first segmentation mask and the plurality of screen-space neural features; and applying one or more convolutional layers to the composited set of screen-space neural features to render the first image. 8. The computer-implemented method of claim 1 , wherein the one or more neural networks comprise a generative neural network. 9. The computer-implemented method of claim 1 , wherein the first input 3D geometry comprises a face. 10. The computer-implemented method of claim 1 , wherein the plurality of portions of the first input 3D geometry comprises at least one of a skin, a hair, one or more eyes, a mouth, or a background. 11. One or more non-transitory computer readable media storing instructions that, when executed by one or more processors, cause the one or more processors to perform the steps of: generating one or more maps associated with a plurality of portions of a first input three-dimensional (3D) geometry; generating, via execution of a plurality of generator blocks included in one or more neural networks, a first plurality of neural textures for the plurality of portions of the first input 3D geometry, wherein each generator block included in the plurality of generator blocks generates a neural texture for a different portion included in the plurality of portions of the first input 3D geometry, wherein portions of the first input 3D geometry corresponding to different generator blocks included in the plurality of generator blocks are non-overlapping; and rendering a first image corresponding to the first input 3D geometry based on the one or more maps and the first plurality of neural textures. 12. The one or more non-transitory computer readable media of claim 11 , wherein the instructions further cause the one or more processors to perform the step of training the one or more neural networks based on a training dataset that includes a plurality of maps associated with a plurality of synthetic geometries. 13. The one or more non-transitory computer readable media of claim 12 , wherein training the one or more neural networks comprises updating parameters of the one or more neural networks based on one or more predictions generated by a discriminator neural network from one or more images produced by the one or more neural networks. 14. The one or more non-transitory computer readable media of claim 11 , wherein generating the one or more maps comprises: deforming a template mesh to match the first input 3D geometry; and generating a segmentation mask and a set of texture maps based on a pose associated with the first input 3D geometry. 15. The one or more non-transitory computer readable media of claim 11 , wherein the instructions further cause the one or more processors to perform the step of rendering a second image corresponding to the first input 3D geometry based on the one or more maps and a second set of neural textures for the plurality of portions of the first input 3D geometry. 16. The one or more non-transitory computer readable media of claim 11 , wherein generating the first plurality of neural textures comprises: generating one or more input vector based on sampling one or more distributions of latent variables associated with the plurality of portions; executing a mapping network included in the one or more neural networks to convert one or more sampled vectors into one or more input vectors; and executing the plurality of generator blocks to convert the one or more input vectors into the first plurality of neural textures. 17. The one or more non-transitory computer readable media of claim 11 , wherein rendering the first image comprises: sampling the first plurality of neural textures based on a first plurality of texture maps included in the one or more maps to generate a plurality of screen-space neural features; generating a composited set of screen-space neural features based on a first segmentation mask included in the one or more maps and the plurality of screen-space neural features; and applying one or more convolutional layers to the composited set of screen-space neural features to render the first image. 18. The one or more non-transitory computer readable media of claim 11 , wherein the first input 3D geometry comprises a face. 19. The one or more non-transitory computer readable media of claim 11 , wherein the one or more maps comprises at least one of a skin texture map, a hair texture map, an eye texture map, a mouth texture map, or a background texture map. 20. A system, comprising: one or more memo

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12243140B2 cover?
A technique for rendering an input geometry includes generating a first segmentation mask for a first input geometry and a first set of texture maps associated with one or more portions of the first input geometry. The technique also includes generating, via one or more neural networks, a first set of neural textures for the one or more portions of the first input geometry. The technique furthe…
Who is the assignee on this patent?
Disney Entpr Inc, Eth Zuerich Eidgenoessische Technische Hochschule Zuerich
What technology area does this patent fall under?
Primary CPC classification G06T13/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 04 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).