Who is the assignee on this patent?

Disney Entpr Inc, Eth Zuerich Eidgenoessische Technische Hochschule Zuerich

What technology area does this patent fall under?

Primary CPC classification G06T17/20. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Mar 04 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Face reconstruction using a mesh convolution network

US12243349B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12243349-B2
Application number	US-202217697774-A
Country	US
Kind code	B2
Filing date	Mar 17, 2022
Priority date	Mar 17, 2021
Publication date	Mar 4, 2025
Grant date	Mar 4, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiment of the present invention sets forth techniques for performing face reconstruction. The techniques include generating an identity mesh based on an identity encoding that represents an identity associated with a face in one or more images. The techniques also include generating an expression mesh based on an expression encoding that represents an expression associated with the face in the one or more images. The techniques also include generating, by a machine learning model, an output mesh of the face based on the identity mesh and the expression mesh.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for performing reconstruction of a face, the computer-implemented method comprising: generating an identity mesh based on an identity encoding that represents an identity associated with a face in one or more images; generating an expression mesh based on an expression encoding that represents an expression associated with the face in the one or more images; and generating, by a machine learning model, an output mesh of the face via an upsampling operation associated with at least one of the identity mesh or the expression mesh. 2. The computer-implemented method of claim 1 , wherein the expression mesh associates one or more expression features with one or more locations of a mesh topology, and the identity mesh associates one or more identity features with one or more locations of the mesh topology. 3. The computer-implemented method of claim 1 , further comprising generating, based on the one or more images of the face, one or more camera parameters associated with the one or more images. 4. The computer-implemented method of claim 3 , further comprising adjusting one or both of the identity mesh or the expression mesh based on a feature selection, the feature selection being based on the one or more camera parameters. 5. The computer-implemented method of claim 3 , further comprising training the machine learning model based on one or more losses, the one or more losses including one or more of, an identity loss based on the generated identity mesh and a ground truth identity mesh of the face, an expression loss based on the generated expression mesh and a ground truth expression mesh of the face, an output mesh loss based on the generated output mesh and a ground truth mesh, or a camera parameter loss based on the one or more camera parameters and one or more ground truth camera parameters. 6. The computer-implemented method of claim 1 , wherein each of the expression mesh and the identity mesh includes one or both of a set of vertex coordinates or a set of vertex displacement vectors. 7. The computer-implemented method of claim 1 , wherein a resolution of the output mesh is higher than a resolution of one or both of the identity mesh or the expression mesh. 8. The computer-implemented method of claim 1 , further comprising training the machine learning model based on an identity consistency loss, wherein the identity consistency loss is based on identity encodings associated with each of the one or more images. 9. The computer-implemented method of claim 1 , further comprising normalizing a set of vertices in one or both of the identity mesh or the expression mesh, wherein the normalizing is based on a difference between the one or both of the identity mesh or the expression mesh and an average mesh. 10. One or more non-transitory computer readable media storing instructions that, when executed by one or more processors, cause the one or more processors to perform the steps of: generating an identity mesh based on an identity encoding that represents an identity associated with a face in one or more images; generating an expression mesh based on an expression encoding that represents an expression associated with the face in the one or more images; and generating, by a machine learning model, an output mesh of the face via an upsampling operation associated with at least one of the identity mesh or the expression mesh. 11. The one or more non-transitory computer readable media of claim 10 , wherein the expression mesh associates one or more expression features with one or more locations of a mesh topology, and the identity mesh associates one or more identity features with one or more locations of the mesh topology. 12. The one or more non-transitory computer readable media of claim 10 , further comprising generating, based on the one or more images of the face, one or more camera parameters associated with the one or more images. 13. The one or more non-transitory computer readable media of claim 12 , further comprising adjusting one or both of the identity mesh or the expression mesh based on a feature selection, the feature selection being based on the one or more camera parameters. 14. The one or more non-transitory computer readable media of claim 12 , wherein the instructions further cause the one or more processors to perform the step of training the machine learning model based on one or more losses, the one or more losses including one or more of, an identity loss based on the generated identity mesh and a ground truth identity mesh of the face, an expression loss based on the generated expression mesh and a ground truth expression mesh of the face, an output mesh loss based on the generated output mesh and a ground truth mesh, or a camera parameter loss based on the one or more camera parameters and one or more ground truth camera parameters. 15. The one or more non-transitory computer readable media of claim 10 , wherein each of the expression mesh and the identity mesh includes one or both of a set of vertex coordinates or a set of vertex displacement vectors. 16. The one or more non-transitory computer readable media of claim 10 , wherein a resolution of the output mesh is higher than a resolution of one or both of the identity mesh or the expression mesh. 17. The one or more non-transitory computer readable media of claim 10 , wherein the instructions further cause the one or more processors to perform the step of training the machine learning model based on an identity consistency loss, wherein the identity consistency loss is based on identity encodings associated with each of the one or more images. 18. The one or more non-transitory computer readable media of claim 10 , wherein the instructions further cause the one or more processors to perform the step of normalizing a set of vertices in one or both of the identity mesh or the expression mesh, wherein the normalizing is based on a difference between the one or both of the identity mesh or the expression mesh and an average mesh. 19. A system, comprising: one or more memories that store instructions, and one or more processors that are coupled to the one or more memories and, when executing the instructions, are configured to: generate an identity mesh based on an identity encoding that represents an identity of a face in one or more images; generate an expression mesh based on an expression encoding that represents an expression of the face in the one or more images of the face; and generate, by a machine learning model, an output mesh of the face via an upsampling operation associated with at least one of the identity mesh or the expression mesh. 20. The system of claim 19 , wherein the one or more processors, when executing the instructions, are configured to train the machine learning model based on one or more losses, the one or more losses including one or more of, an identity loss based on the generated identity mesh and a ground truth identity mesh of the face, an expression loss based on the generated expression mesh and a ground truth expression mesh of the face, an output mesh loss based on the generated output mesh and a ground truth mesh, or a camera parameter loss based on one or more camera parameters and one or more ground truth camera parameters.

Assignees

Inventors

Classifications

G06V40/172
Classification, e.g. identification · CPC title
G06T17/20Primary
Finite element generation, e.g. wire-frame surface description, {tesselation} · CPC title
G06V10/454
Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title
G06V40/166
using acquisition arrangements · CPC title
G06V40/161
Detection; Localisation; Normalisation · CPC title

Patent family

Related publications grouped by family.

View patent family 83283912

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12243349B2 cover?: Embodiment of the present invention sets forth techniques for performing face reconstruction. The techniques include generating an identity mesh based on an identity encoding that represents an identity associated with a face in one or more images. The techniques also include generating an expression mesh based on an expression encoding that represents an expression associated with the face in …
Who is the assignee on this patent?: Disney Entpr Inc, Eth Zuerich Eidgenoessische Technische Hochschule Zuerich
What technology area does this patent fall under?: Primary CPC classification G06T17/20. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Mar 04 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Avatar modeling and generation

Semantic deep face models

Neural face editing with intrinsic image disentangling

Face recognition method and apparatus

Frequently asked questions