Methods of 3d clothed human reconstruction and animation from monocular image

US2023281921A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023281921-A1
Application numberUS-202217982945-A
CountryUS
Kind codeA1
Filing dateNov 8, 2022
Priority dateMar 1, 2022
Publication dateSep 7, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for 3D human model reconstruction and animation includes receiving a two-dimensional (2D) image of a human, segmenting the 2D image into a foreground with the human and a background without the human in the 2D image, generating a parametric model comprising a pose, a shape, and one or more rigging parameters based on the human in the foreground, predicting a textured three-dimensional (3D) model using implicit surface reconstruction of the human in the foreground, aligning the parametric model and the textured 3D model using a 3D registration, and generating a textured 3D clothed human model based on the aligned parametric model and the predicted textured 3D model.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method executed by at least one processor, the method comprising: receiving, by a processor, a two-dimensional (2D) image of a human; segmenting the 2D image into a foreground with the human and a background without the human in the 2D image; generating a parametric model comprising a pose, a shape, and one or more rigging parameters based on the human in the foreground; predicting a textured three-dimensional (3D) model using implicit surface reconstruction of the human in the foreground; aligning the parametric model and the textured 3D model using a 3D registration; and generating a textured 3D clothed human model based on the aligned parametric model and the predicted textured 3D model, wherein the 3D clothed human model comprises at least a 3D shape of the human in the 2D image with reconstructed surface textures in a reconstructed 3D space. 2 . The method of claim 1 , wherein the aligning comprises: transferring the texture from the textured 3D model to the parametric model; and transferring the pose, the shape, and the one or more rigging parameters from the parametric model to the textured 3D model. 3 . The method of claim 1 , wherein the parametric model is generated to fit the 2D image input. 4 . The method of claim 1 , wherein the parametric model and textured 3D model are aligned by minimizing a difference of one or more distance fields in 3D space. 5 . The method of claim 1 , wherein the textured 3D clothed human model is animated by a motion capture dataset. 6 . The method of claim 1 , wherein the parametric model is based on one of the following statistical human models: SMPL, SMPL-X, or STAR. 7 . The method of claim 1 , wherein the generating the textured 3D model using implicit surface reconstruction is based on a PIFu scheme from the 2D image. 8 . An apparatus comprising: at least one memory configured to store program code; and at least one processor configured to read the program code and operate as instructed by the program code, the program code including: receiving code configured to cause the at least one processor to receive, a two-dimensional (2D) image of a human; segmenting code configured to cause the at least one processor to segment the 2D image into a foreground with the human and a background without the human in the 2D image; first generating code configured to cause the at least one processor to generate a parametric model comprising a pose, a shape, and one or more rigging parameters based on the human in the foreground; second generating code configured to cause the at least one processor to predict a textured three-dimensional (3D) model using implicit surface reconstruction of the human in the foreground; aligning code configured to cause the at least one processor to align the parametric model and the textured 3D model using a 3D registration; and third generating code configured to cause the at least one processor to generate a textured 3D clothed human model based on the aligned parametric model and the predicted textured 3D model, wherein the 3D clothed human model comprises at least a 3D shape of the human in the 2D image with reconstructed surface textures in a reconstructed 3D space. 9 . The apparatus of claim 8 , wherein the aligning code further causes the processor to: transfer the texture from the textured 3D model to the parametric model; and transfer the pose, shape, and the one or more rigging parameters from the parametric model to the textured 3D model. 10 . The apparatus of claim 8 , wherein the parametric model is generated to fit the 2D image input. 11 . The apparatus of claim 8 , wherein the parametric model and textured 3D model are aligned by minimizing a difference of one or more distance fields in 3D space. 12 . The apparatus of claim 8 , wherein the textured 3D clothed human model is animated by a motion capture dataset. 13 . The apparatus of claim 8 , wherein the parametric model is based on any one of the following statistical human models: SMPL, SMPL-X, or STAR. 14 . The apparatus of claim 8 , wherein the textured 3D model using implicit surface reconstruction is based on a PIFu scheme from the 2D image. 15 . A non-transitory computer readable medium having stored thereon computer code which, when executed by at least one processor, causes the at least one processor to at least: receive a two-dimensional (2D) image of a human; segment the 2D image into a foreground with the human and a background without the human in the 2D image; generate a parametric model comprising a pose, a shape, and one or more rigging parameters based on the human in the foreground; predict a textured three-dimensional (3D) model using implicit surface reconstruction of the human in the foreground; align the parametric model and the textured 3D model using a 3D registration; and generate a textured 3D clothed human model based on the aligned parametric model and the predicted textured 3D model, wherein the 3D clothed human model comprises at least a 3D shape of the human in the 2D image with reconstructed surface textures in a reconstructed 3D space. 16 . The non-transitory computer readable medium according to claim 15 , wherein the instructions to cause the at least one processor to align the parametric model and the textured 3D model further cause the processor to: transfer the texture from the textured 3D model to the parametric model; and transfer the pose, the shape, and the one or more rigging parameters from the parametric model to the textured 3D model. 17 . The non-transitory computer readable medium according to claim 15 , wherein the parametric model is generated to fit the 2D image input. 18 . The non-transitory computer readable medium according to claim 15 , wherein the parametric model and textured 3D model are aligned by minimizing a difference of one or more distance fields in 3D space. 19 . The non-transitory computer readable medium according to claim 15 , wherein the textured 3D clothed human model is animated by a motion capture dataset. 20 . The non-transitory computer readable medium according to claim 15 , wherein the parametric model is based on any one of the following statistical human models: SMPL, SMPL-X, or STAR.

Assignees

Inventors

Classifications

  • G06V40/103Primary

    Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title

  • Human being; Person · CPC title

  • involving models · CPC title

  • Texture mapping · CPC title

  • of characters, e.g. humans, animals or virtual beings · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023281921A1 cover?
A method for 3D human model reconstruction and animation includes receiving a two-dimensional (2D) image of a human, segmenting the 2D image into a foreground with the human and a background without the human in the 2D image, generating a parametric model comprising a pose, a shape, and one or more rigging parameters based on the human in the foreground, predicting a textured three-dimensional …
Who is the assignee on this patent?
Tencent America LLC
What technology area does this patent fall under?
Primary CPC classification G06V40/103. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 07 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).