Flow-guided motion retargeting

US12056792B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12056792-B2
Application numberUS-202117557834-A
CountryUS
Kind codeB2
Filing dateDec 21, 2021
Priority dateDec 30, 2020
Publication dateAug 6, 2024
Grant dateAug 6, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods herein describe a motion retargeting system. The motion retargeting system accesses a plurality of two-dimensional images comprising a person performing a plurality of body poses, extracts a plurality of implicit volumetric representations from the plurality of body poses, generates a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose, and based on the three-dimensional warping field, generates a two-dimensional image of an artificial person performing the target pose.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: accessing a plurality of two-dimensional images comprising a person performing a plurality of body poses; extracting, using a first neural network, a plurality of implicit volumetric representations from the plurality of body poses; generating a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose; and based on the three-dimensional warping field, generating a two-dimensional image of an artificial person performing the target pose. 2. The method of claim 1 , wherein each implicit volumetric representation of the plurality of volumetric representations is in a unique resolution. 3. The method of claim 1 wherein accessing the plurality of two-dimensional images further comprises: for each image of the plurality of two-dimensional images: extracting a foreground region using a pre-trained segmentation model. 4. The method of claim 1 , wherein the first neural network comprises a two-dimensional encoding network and a three-dimensional encoding network. 5. The method of claim 1 , wherein the generated two-dimensional image is generated by a second neural network. 6. The method of claim 5 , wherein the second neural network comprises a two-dimensional decoding network and a three-dimensional decoding network. 7. The method of claim 1 , wherein generating the two-dimensional image further comprises: applying an image transformation to the two-dimensional image. 8. The method of claim 7 , wherein the image transformation is a rotation. 9. A system comprising: a processor; and a memory storing instructions that, when executed by the processor, configure the system to perform operations comprising: accessing a plurality of two-dimensional images comprising a person performing a plurality of body poses; extracting, using a first neural network, a plurality of implicit volumetric representations from the plurality of body poses; generating a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose; and based on the three-dimensional warping field, generating a two-dimensional image of an artificial person performing the target pose. 10. The system of claim 9 , wherein each implicit volumetric representation of the plurality of volumetric representations is in a unique resolution. 11. The system of claim 9 wherein accessing the plurality of two-dimensional images further comprises: for each image of the plurality of two-dimensional images: extracting a foreground region using a pre-trained segmentation model. 12. The system of claim 9 , wherein the first neural network comprises a two-dimensional encoding network and a three-dimensional encoding network. 13. The system of claim 9 , wherein the generated two-dimensional image is generated by a second neural network. 14. The system of claim 13 , wherein the second neural network comprises a two-dimensional decoding network and a three-dimensional decoding network. 15. The system of claim 9 , wherein generating the two-dimensional image further comprises: applying an image transformation to the two-dimensional image. 16. The system of claim 15 , wherein the image transformation is a rotation. 17. A non-transitory computer-readable storage medium, the computer-readable storage medium including instructions that when executed by a computer, cause the computer to perform operations comprising: accessing a plurality of two-dimensional images comprising a person performing a plurality of body poses; extracting, using a first neural network, a plurality of implicit volumetric representations from the plurality of body poses; generating a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose; and based on the three-dimensional warping field, generating a two-dimensional image of an artificial person performing the target pose. 18. The computer-readable storage medium of claim 17 , wherein each implicit volumetric representation of the plurality of volumetric representations is in a unique resolution. 19. The computer-readable storage medium of claim 17 wherein accessing the plurality of two-dimensional images further comprises: for each image of the plurality of two-dimensional images: extracting a foreground region using a pre-trained segmentation model. 20. The computer-readable storage medium of claim 17 , wherein the first neural network comprises a two-dimensional encoding network and a three-dimensional encoding network.

Assignees

Inventors

Classifications

  • Adversarial learning · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Supervised learning · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12056792B2 cover?
Systems and methods herein describe a motion retargeting system. The motion retargeting system accesses a plurality of two-dimensional images comprising a person performing a plurality of body poses, extracts a plurality of implicit volumetric representations from the plurality of body poses, generates a three-dimensional warping field, the three-dimensional warping field configured to warp the…
Who is the assignee on this patent?
Snap Inc
What technology area does this patent fall under?
Primary CPC classification G06T11/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 06 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).