Mixture of volumetric primitives for efficient neural rendering

US2024303951A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024303951-A1
Application numberUS-202418636687-A
CountryUS
Kind codeA1
Filing dateApr 16, 2024
Priority dateJan 26, 2021
Publication dateSep 12, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for training a real-time, modeling for animating an avatar for a subject is provided. The method includes collecting multiple images of a subject. The method also includes selecting a plurality of vertex positions in a guide mesh, indicative of a volumetric primitive enveloping the subject, determining a geometric attribute for the volumetric primitive including a position, a rotation, and a scale factor of the volumetric primitive, determining a payload attribute for each of the volumetric primitive, the payload attribute including a color value and an opacity value for each voxel in a voxel grid defining the volumetric primitive, determining a loss factor for each point in the volumetric primitive based on the geometric attribute, the payload attribute and a ground truth value, and updating a three-dimensional model for the subject. A system and a non-transitory, computer-readable medium storing instructions to perform the above method are also provided.

First claim

Opening claim text (preview).

1 . (canceled) 2 . A computer-implemented method, comprising: collecting multiple images including one or more different angles of view of a subject; selecting a plurality of vertex positions in a guide mesh, indicative of multiple vertices of a one or more volumetric primitives enveloping the subject; determining a geometric attribute for each of the one or more volumetric primitives; determining a payload attribute for each of the one or more volumetric primitives; determining loss factors associated with the one or more volumetric primitives based on the geometric attribute, the payload attribute, and a ground truth value; and updating a three-dimensional model for the subject according to the loss factors, the three-dimensional model including the one or more volumetric primitives. 3 . The computer-implemented method of claim 1 , wherein selecting the plurality of vertex positions in the guide mesh comprises selecting a constraining factor so that a volume of the one or more volumetric primitives is greater than a selected threshold. 4 . The computer-implemented method of claim 1 , wherein selecting the plurality of vertex positions in the guide mesh comprises selecting a minimum volume value of the one or more volumetric primitives so that each point in the images of the subject is within the one or more volumetric primitives. 5 . The computer-implemented method of claim 1 , wherein the one or more volumetric primitives are minimally overlapping and dynamically moving, and determining the geometric attribute for each of the one or more volumetric primitives comprises allowing a change in a position, a rotation, and a scale factor of the one or more volumetric primitives to reduce the loss factors. 6 . The computer-implemented method of claim 1 , further comprising determining a color value and an opacity value for each voxel in a voxel grid defining the one or more volumetric primitives by tracing a ray of points for each of the volumetric primitives and accumulating three projected color values and a projected opacity value from the images of the subject along a selected point of view. 7 . The computer-implemented method of claim 1 , wherein determining the payload attribute further comprises determining an opacity fade factor to avoid opacity artifacts in overlapping volume primitives close to a boundary of the one or more volumetric primitives. 8 . The computer-implemented method of claim 1 , wherein determining the loss factors comprises determining a mesh reconstruction loss based on the vertex positions in the guide mesh and a ground truth position on a tracked mesh. 9 . The computer-implemented method of claim 1 , further comprising selecting a number of volumetric primitives and a number of voxels in a voxel grid per volumetric primitive based on the loss factors. 10 . The computer-implemented method of claim 1 , further comprising interpolating the three-dimensional model between two key frames in a sequence of images of the subject. 11 . The computer-implemented method of claim 1 , further comprising: forming a background model with the multiple images excluding the subject; and updating the three-dimensional model for the subject comprises combining the one or more volumetric primitives with the background model. 12 . The computer-implemented method of claim 1 , further comprising: rendering a real-time representation of the subject based at least in part on the updated three-dimensional model. 13 . A system, comprising: a memory storing multiple instructions; and one or more processors configured to execute the instructions to cause the system to: collect multiple images including one or more different angles of view of a subject; select a plurality of vertex positions in a guide mesh, indicative of multiple vertices of a one or more volumetric primitives enveloping the subject; determine a geometric attribute for each of the one or more volumetric primitives; determine a payload attribute for each of the one or more volumetric primitives; determine loss factors associated with the one or more volumetric primitives based on the geometric attribute, the payload attribute, and a ground truth value; and update a three-dimensional model for the subject according to the loss factors, the three-dimensional model including the one or more volumetric primitives. 14 . The system of claim 13 , wherein to select the plurality of vertex positions in the guide mesh, the one or more processors execute instructions to select a minimum volume value of the one or more volumetric primitives so that each point in the images of the subject is within the one or more volumetric primitives. 15 . The system of claim 13 , wherein the one or more volumetric primitives are minimally overlapping and dynamically moving, and to determine the geometric attribute for each of the one or more volumetric primitives, the one or more processors execute instructions to allow a change in a position, a rotation, and a scale factor of the one or more volumetric primitives to reduce the loss factors. 16 . The system of claim 13 , wherein to determine the payload attribute the one or more processors execute instructions to determine an opacity fade factor to avoid opacity artifacts in overlapping volume primitives close to a boundary of the one or more volumetric primitives. 17 . A computer-implemented method, comprising: collecting a binocular image of a subject; generating a three-dimensional model of the subject including a patch of minimally overlapping volumetric primitives based on two or more different views of the subject from the binocular image; determining a payload attribute for one or more volumetric primitives of the three-dimensional model; determining loss factors associated with the one or more volumetric primitives based on a geometric attribute, the payload attribute, and a ground truth value; updating the three-dimensional model for the subject based on the loss factors, the three-dimensional model including the one or more volumetric primitives; and embedding the updated three-dimensional model of the subject in an immersive reality environment, for a real-time application. 18 . The computer-implemented method of claim 17 , further comprising adjusting a voxel count for each of the patch of minimally overlapping volumetric primitives based on a latency threshold for the real-time application. 19 . The computer-implemented method of claim 17 , wherein embedding the three-dimensional model of the subject in the immersive reality environment comprises animating the three-dimensional model by allowing a change in the geometric attribute in the patch of minimally overlapping volumetric primitives, according to the loss factors, wherein the geometric attribute includes a position, a rotation, and a scale factor of the one or more volumetric primitives. 20 . The computer-implemented method of claim 17 , wherein embedding the three-dimensional model of the subject in the immersive reality environment comprises convolving a translation, rotation and scale deviation of the patch of minimally overlapping volumetric primitives with a guide mesh selected from a sequence of binocular images of the subject. 21 . The computer-implemented method of claim 17 , wherein embedding the three-dimensional model of the subject in the immersive reality environment comprises interpolating the three-dimensional model between two key frames in a sequence of images of the subject.

Assignees

Inventors

Classifications

  • Colour editing, changing, or manipulating; Use of colour codes · CPC title

  • Finite element generation, e.g. wire-frame surface description, {tesselation} · CPC title

  • Ray-tracing · CPC title

  • G06T19/20Primary

    Editing of three-dimensional [3D] images, e.g. changing shapes or colours, aligning objects or positioning parts · CPC title

  • G06T17/00Primary

    Three-dimensional [3D] modelling for computer graphics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024303951A1 cover?
A method for training a real-time, modeling for animating an avatar for a subject is provided. The method includes collecting multiple images of a subject. The method also includes selecting a plurality of vertex positions in a guide mesh, indicative of a volumetric primitive enveloping the subject, determining a geometric attribute for the volumetric primitive including a position, a rotation,…
Who is the assignee on this patent?
Meta Platforms Tech Llc
What technology area does this patent fall under?
Primary CPC classification G06T19/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 12 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).