System and method for generating character poses using deep learning

US12254550B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12254550-B2
Application numberUS-202318201041-A
CountryUS
Kind codeB2
Filing dateMay 23, 2023
Priority dateSep 4, 2020
Publication dateMar 18, 2025
Grant dateMar 18, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of generating or modifying poses in an animation of a character are disclosed. Variable numbers and types of supplied inputs are combined into a single input. The variable numbers and types of supplied inputs correspond to one or more effector constraints for one or more joints of the character. The single input is transformed into a pose embedding. The pose embedding includes a machine-learned representation of the single input. The pose embedding is expanded into a pose representation output. The pose representation output includes local rotation data and global position data for the one or more joints of the character.

First claim

Opening claim text (preview).

We claim: 1. A system comprising: one or more computer processors; one or more computer memories; a set of instructions stored in the one or more computer memories, the set of instructions configuring the one or more computer processors to perform operations, the operations comprising: combining a set of zero or more inputs including one or more different effector types for each of one or more joints of a character into a single input; transforming the single input into a pose embedding, the pose embedding including a machine-learned representation of the single input; and expanding the pose embedding into a pose representation output, the pose representation output including rotation data and position data for the one or more joints of the character. 2. The system of claim 1 , wherein the expanding of the pose embedding into the pose representation output includes translating the pose embedding into predictions of 3D joint positions for the one or more joints, the translating using a global position decoder machine-learned model, and generating the rotation data for the one or more joints, the generating using an inverse kinematic decoder machine-learned model. 3. The system of claim 2 , wherein one or more of the global position decoder machine-learned model and the inverse kinematic decoder machine-learned model are trained using a variable number and type of input effector constraints. 4. The system of claim 1 , the operations further comprising: transforming the rotation data by applying a forward kinematics pass to the pose representation output, the transforming including using one or more skeleton kinematic equations. 5. The system of claim 4 , wherein the transforming includes using one or more neural networks that are programmable for use with each of variable numbers and types of the set of zero or more inputs without requiring retraining of the one or more neural networks. 6. The system of claim 1 , wherein the transforming of the single input is performed by a pose encoder machine-learned model, wherein the pose encoder machine-learned model is trained using a variable number and type of input effector constraints. 7. The system of claim 1 , wherein the transforming includes applying a machine-learned model that, during training, uses a combination loss function, the combination loss function including rotation and position error terms via randomized weights based on randomly-generated effector tolerance levels. 8. The system of claim 1 , wherein a first joint of the one or more joints is constrained by 3D position coordinates and global rotation and a second joint of the one or more joints is constrained with a look-at effector. 9. A non-transitory computer-readable storage medium storing a set of instructions that, when executed by one or more computer processors, causes the one or more computer processors to perform operations, the operations comprising: combining a set of zero or more inputs including one or more different effector types for each of one or more joints of a character into a single input; transforming the single input into a pose embedding, the pose embedding including a machine-learned representation of the single input; and expanding the pose embedding into a pose representation output, the pose representation output including rotation data and position data for the one or more joints of the character. 10. The non-transitory computer-readable storage medium of claim 9 , wherein the expanding of the pose embedding into the pose representation output includes translating the pose embedding into predictions of 3D joint positions for the one or more joints, the translating using a global position decoder machine-learned model, and generating the rotation data for the one or more joints, the generating using an inverse kinematic decoder machine-learned model. 11. The non-transitory computer-readable storage medium of claim 10 , wherein one or more of the global position decoder machine-learned model and the inverse kinematic decoder machine-learned model are trained using a variable number and type of input effector constraints. 12. The non-transitory computer-readable storage medium of claim 9 , the operations further comprising: transforming the rotation data by applying a forward kinematics pass to the pose representation output, the transforming including using one or more skeleton kinematic equations. 13. The non-transitory computer-readable storage medium of claim 9 , wherein the transforming includes using one or more neural networks that are programmable for use with each of variable numbers and types of the set of zero or more inputs without requiring retraining of the one or more neural networks. 14. The non-transitory computer-readable storage medium of claim 9 , wherein the transforming of the single input is performed by a pose encoder machine-learned model, wherein the pose encoder machine-learned model is trained using a variable number and type of input effector constraints. 15. The non-transitory computer-readable storage medium of claim 9 , wherein the transforming includes applying a machine-learned model that, during training, uses a combination loss function, the combination loss function including rotation and position error terms via randomized weights based on randomly-generated effector tolerance levels. 16. A method comprising: combining a set of zero or more inputs including one or more different effector types for each of one or more joints of a character into a single input; transforming the single input into a pose embedding, the pose embedding including a machine-learned representation of the single input; and expanding the pose embedding into a pose representation output, the pose representation output including rotation data and position data for the one or more joints of the character. 17. The method of claim 16 , wherein the expanding of the pose embedding into the pose representation output includes translating the pose embedding into predictions of 3D joint positions for the one or more joints, the translating using a global position decoder machine-learned model, and generating the rotation data for the one or more joints, the generating using an inverse kinematic decoder machine-learned model. 18. The method of claim 17 , wherein one or more of the global position decoder machine-learned model and the inverse kinematic decoder machine-learned model are trained using a variable number and type of input effector constraints. 19. The method of claim 16 , further comprising: transforming the rotation data by applying a forward kinematics pass to the pose representation output, the transforming including using one or more skeleton kinematic equations. 20. The method of claim 19 , wherein the transforming includes using one or more neural networks that are programmable for use with each of variable numbers and types of the set of zero or more inputs without requiring retraining of the one or more neural networks.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12254550B2 cover?
A method of generating or modifying poses in an animation of a character are disclosed. Variable numbers and types of supplied inputs are combined into a single input. The variable numbers and types of supplied inputs correspond to one or more effector constraints for one or more joints of the character. The single input is transformed into a pose embedding. The pose embedding includes a machin…
Who is the assignee on this patent?
Unity IPR ApS
What technology area does this patent fall under?
Primary CPC classification G06T13/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).