Creating simulation-ready clothed 3d human avatars from text descriptions

US2025363703A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025363703-A1
Application numberUS-202418979205-A
CountryUS
Kind codeA1
Filing dateDec 12, 2024
Priority dateMay 21, 2024
Publication dateNov 27, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Apparatuses, systems, and techniques for generating a clothed three-dimensional (3D) avatar character from a text prompt and enabling smooth animation through physics or neural simulators. In at least one embodiment, a clothed 3D avatar is generated through body layer modeling and garment layer modeling based on text descriptions. The outputs from the body layer and garment layer modeling are combined to generate an animation-ready, clothed 3D avatar.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for generating a clothed three-dimensional (3D) avatar, comprising: receiving one or more text prompts; modeling, based on the text prompts, a body layer for the clothed 3D avatar; modeling, based on the text prompts, a garment layer for the clothed 3D avatar; combining the body layer and the garment layer for the clothed 3D avatar; and outputting the clothed 3D avatar. 2 . The method of claim 1 , wherein modeling the body layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a body mesh for the clothed 3D avatar; modeling a first set of Gaussians corresponding to the body mesh; and outputting a body model of the clothed 3D avatar. 3 . The method of claim 1 , wherein modeling the garment layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a latent code corresponding to a garment template; generating, based on the latent code and using a decoder, an unsigned distance field (UDF) corresponding to the garment layer of the clothed 3D avatar; modeling a second set of Gaussians corresponding to the UDF; and outputting a garment model of the clothed 3D avatar. 4 . The method of claim 3 , wherein the latent code among a plurality of latent codes, wherein the plurality of latent codes correspond to a plurality of garment templates, and wherein the plurality of latent codes and the plurality of garment templates are learned from a garment dataset. 5 . The method of claim 3 , wherein the UDF comprises a plurality of unsigned distances, the method further comprises: determining, for each unsigned distance of the plurality of unsigned distances, an opacity value for the respective unsigned distance. 6 . The method of claim 3 , further comprising: deriving a garment mesh based on the UDF; generating, based on the garment mesh, a sequence of garment meshes corresponding to a sequence of poses associated with the clothed 3D avatar; and generating a third set of Gaussians corresponding to the garment mesh by transforming the second set of Gaussians corresponding to the UDF. 7 . The method of claim 3 , further comprising: training the generation of latent code and 3D Gaussians, wherein the training comprises: sampling a fourth set of 3D Gaussians from a pre-defined garment template mesh to optimize the fourth set of 3D Gaussians and a predicted latent code, wherein the predicted latent code corresponds to the pre-defined garment template mesh; extracting a predicted garment mesh from the optimized latent code; and sampling a fifth set of 3D Gaussians from the predicted garment mesh to optimize the attributes of the fifth set of 3D Gaussians. 8 . The method of claim 1 , further comprising: modeling lighting effects on the garment layer for the clothed 3D avatar. 9 . The method of claim 1 , wherein modeling the body layer and modeling the garment layer are performed in parallel, in sequence, or in a hybrid manner. 10 . The method of claim 1 , wherein the clothed 3D avatar is a clothed 3D human avatar. 11 . A system for generating a clothed three-dimensional (3D) avatar comprising: one or more processors configured to: receive one or more text prompts; model, based on the text prompts, a body layer for the clothed 3D avatar; model, based on the text prompts, a garment layer for the clothed 3D avatar; combine the body layer and the garment layer for the clothed 3D avatar; and output the clothed 3D avatar. 12 . The system of claim 11 , wherein modeling the body layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a body mesh for the clothed 3D avatar; modeling a first set of Gaussians corresponding to the body mesh; and outputting a body model of the clothed 3D avatar. 13 . The system of claim 11 , wherein modeling the garment layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a latent code corresponding to a garment template; generating, based on the latent code and using a decoder, an unsigned distance field (UDF) corresponding to the garment layer of the clothed 3D avatar; modeling a second set of Gaussians corresponding to the UDF; and outputting a garment model of the clothed 3D avatar. 14 . The system of claim 13 , wherein the latent code among a plurality of latent codes, wherein the plurality of latent codes correspond to a plurality of garment templates, and wherein the plurality of latent codes and the plurality of garment templates are learned from a garment dataset. 15 . The system of claim 13 , wherein the UDF comprises a plurality of unsigned distances, and wherein the one or more processors are further configured to: determine, for each unsigned distance of the plurality of unsigned distances, an opacity value for the respective unsigned distance. 16 . The system of claim 13 , wherein the one or more processors are further configured to: derive a garment mesh based on the UDF; generate, based on the garment mesh, a sequence of garment meshes corresponding to a sequence of poses associated with the clothed 3D avatar; and generate a third set of Gaussians corresponding to the garment mesh by transforming the second set of Gaussians corresponding to the UDF. 17 . The system of claim 13 , wherein the one or more processors are further configured to: train the generation of latent code and 3D Gaussians, wherein the training comprises: sampling a fourth set of 3D Gaussians from a pre-defined garment template mesh to optimize the fourth set of 3D Gaussians and a predicted latent code, wherein the predicted latent code corresponds to the pre-defined garment template mesh; extracting a predicted garment mesh from the optimized latent code; and sampling a fifth set of 3D Gaussians from the predicted garment mesh to optimize the attributes of the fifth set of 3D Gaussians. 18 . The system of claim 11 , wherein the one or more processors are further configured to: model lighting effects on the garment layer for the clothed 3D avatar. 19 . The system of claim 11 , wherein modeling the body layer and modeling the garment layer are performed in parallel, in sequence, or in a hybrid manner. 20 . The system of claim 11 , wherein the clothed 3D avatar is a clothed 3D human avatar. 21 . A machine-readable medium for generating a clothed three-dimensional (3D) avatar, having stored thereon a set of instructions, which if performed by one or more processors, cause the one or more processors to: receive one or more text prompts; model, based on the text prompts, a body layer for an avatar corresponding to the clothed 3D avatar; model, based on the text prompts, a garment layer for the clothed 3D avatar; combine the body layer and the garment layer for the clothed 3D avatar; and output the clothed 3D avatar. 22 . The machine-readable medium of claim 19 , wherein modeling the body layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a body mesh for the clothed 3D avatar; modeling a first set of Gaussians corresponding to the body mesh; and outputting a body model of the clothed 3D avatar, and wherein modeling the garment layer for the clothed 3D avatar comprises: determining, based on the one or more text prompts, a latent code corresponding to a garment template; generating, based on the latent code and

Assignees

Inventors

Classifications

  • Particle system, point based geometry or rendering · CPC title

  • G06T13/40Primary

    of characters, e.g. humans, animals or virtual beings · CPC title

  • Handling natural language data (speech analysis or synthesis, speech recognition G10L) · CPC title

  • Cloth · CPC title

  • G06T17/00Primary

    Three-dimensional [3D] modelling for computer graphics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025363703A1 cover?
Apparatuses, systems, and techniques for generating a clothed three-dimensional (3D) avatar character from a text prompt and enabling smooth animation through physics or neural simulators. In at least one embodiment, a clothed 3D avatar is generated through body layer modeling and garment layer modeling based on text descriptions. The outputs from the body layer and garment layer modeling are c…
Who is the assignee on this patent?
Nvidia Corp
What technology area does this patent fall under?
Primary CPC classification G06T13/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).