Text-driven motion recommendation and neural mesh stylization system and a method for producing human mesh animation using the same

US12561878B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12561878-B2
Application numberUS-202418440889-A
CountryUS
Kind codeB2
Filing dateFeb 13, 2024
Priority dateFeb 13, 2023
Publication dateFeb 24, 2026
Grant dateFeb 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a text-driven motion recommendation and neural mesh stylization system and a method producing human mesh animation using the same. The system comprises at least one instruction stored in a memory, and a processor that executes the at least one instruction, wherein the at least one instruction, when executed by the processor, causes the processor to find raw action labels matching a query given as a text prompt in a human motion dataset stored in a database, encode the raw action labels and the query for vectorizing the raw action labels and the query, and measure similarity between the raw action labels and the query based on the vectorized vectors.

First claim

Opening claim text (preview).

What is claimed is: 1 . A text-driven motion recommendation and neural mesh stylization system comprising: at least one instruction stored in a memory; and a processor that executes the at least one instruction, wherein the at least one instruction, when executed by the processor, causes the processor to: find raw action labels matching a query given as a text prompt in a human motion dataset stored in a database; encode the raw action labels and the query for vectorizing the raw action labels and the query; measure similarity between the raw action labels and the query based on vectorized vectors to obtain content meshes; obtain style attributes comprising color and displacement from a decoupled neural style field (DNSF) network that takes a template human mesh and learn text-driven style attributes; and apply the style attributes to the content meshes to obtain a human mesh sequence in motion. 2 . The system of claim 1 , wherein the at least one instruction, when executed by the processor, further causes the processor to: select a plurality of indices of the raw action labels based on the measured similarity; and retrieve top-k action labels corresponding the plurality of indices by a top-k filter from encoded motion datasets with the raw action labels. 3 . The system of claim 2 , wherein the at least one instruction, when executed by the processor, further causes the processor to: vectorize the query and the top-k action labels; and retrieve a highest-scored raw action label as a final matched result for the input text prompt. 4 . The system of claim 3 , wherein the at least one instruction, when executed by the processor, further causes the processor to: find a best semantically matched motion sequence from a motion database based on the highest-scored raw action label; and sample the content meshes in multi-modal context corresponding to the best semantically matched motion sequence. 5 . The system of claim 1 , wherein the at least one instruction, when executed by the processor, further causes the processor to: map the style attributes from the template human mesh and merge the style attributes mapped from the template human mesh with the content meshes by the DNSF network. 6 . The system of claim 5 , wherein the at least one instruction, when executed by the processor, further causes the processor to: achieve a same mesh stylization as a basic neural style field while decoupling a style from a content mesh. 7 . The system of claim 1 , wherein the at least one instruction, when executed by the processor, further causes the processor to: detailize and texturize the human mesh sequence by optimizing the DNSF network in a temporally-consistent and pose-agnostic manner. 8 . The system of claim 7 , wherein the at least one instruction, when executed by the processor, further causes the processor to: compute a semantic loss between the text prompt and a text obtained by encoding the detailized and texturized human mesh sequence for optimizing the DNSF network. 9 . A method for producing human mesh animation performed by a processor, the method comprising: finding raw action labels matching a query given as a text prompt in a human motion dataset stored in a database; encoding the raw action labels and the query for vectorizing the raw action labels and the query; measuring similarity between the raw action labels and the query based on vectorized vectors to obtain content meshes; obtaining style attributes comprising color and displacement from a decoupled neural style field (DNSF) network that takes a template human mesh and learn text-driven style attributes; and applying the style attributes to the content meshes to obtain a human mesh sequence in motion. 10 . The method of claim 9 , further comprising: selecting a plurality of indices of the raw action labels based on the measured similarity; and retrieving top-k action labels corresponding the plurality of indices by a top-k filter from encoded motion datasets with the raw action labels. 11 . The method of claim 10 , further comprising: vectorizing the query and the top-k action labels; and retrieving a highest-scored raw action label as a final matched result for the text prompt. 12 . The method of claim 11 , further comprising: finding a best semantically matched motion sequence from a motion database based on the highest-scored raw action label; and sampling the content meshes in multi-modal context corresponding to the best semantically matched motion sequence. 13 . The method of claim 9 , further comprising: mapping the style attributes from the template human mesh and merge the style attributes mapped from the template human mesh with the content meshes by the DNSF network. 14 . The method of claim 13 , further comprising: achieving a same mesh stylization as a basic neural style field while decoupling a style from a content mesh. 15 . The method of claim 9 , further comprising: detailizing and texturizing the human mesh sequence by optimizing the DNSF network in a temporally-consistent and pose-agnostic manner. 16 . The method of claim 15 , further comprising: computing a semantic loss between the text prompt and a text obtained by encoding the detailized and texturized human mesh sequence for optimizing the DNSF network.

Assignees

Inventors

Classifications

  • Texture mapping · CPC title

  • Style variation · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Colour editing, changing, or manipulating; Use of colour codes · CPC title

  • Finite element generation, e.g. wire-frame surface description, {tesselation} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12561878B2 cover?
The present disclosure provides a text-driven motion recommendation and neural mesh stylization system and a method producing human mesh animation using the same. The system comprises at least one instruction stored in a memory, and a processor that executes the at least one instruction, wherein the at least one instruction, when executed by the processor, causes the processor to find raw actio…
Who is the assignee on this patent?
Postech Res & Business Dev Found
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).