What technology area does this patent fall under?

Primary CPC classification G06T17/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method and apparatus for processing action of virtual object, and storage medium

US12548244B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12548244-B2
Application number	US-202318139929-A
Country	US
Kind code	B2
Filing date	Apr 26, 2023
Priority date	Jul 7, 2021
Publication date	Feb 10, 2026
Grant date	Feb 10, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus for processing an action of a virtual object, and a storage medium are provided. The method specifically includes: receiving an action instruction, the action instruction including: an action identifier and time-dependent information of performing an action associated with the action identifier; determining an action video frame sequence corresponding to the action identifier; determining, from the action video frame sequence, an action state image corresponding to a preset state image of the virtual object at a target time, the target time being determined according to the time-dependent information; generating a connection video frame sequence according to the action state image, the connection video frame sequence connecting the preset state image with the action video frame sequence; and splicing the connection video frame sequence with the action video frame sequence, to obtain an action video. Embodiments of this application can improve action processing efficiency of a virtual object.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for processing an action of a virtual object performed by a computer device, the method comprising: receiving an action instruction, the action instruction comprising: an action identifier and time-dependent information of performing an action associated with the action identifier; determining, among a plurality of pre-generated action video frame sequences, each action video frame sequence corresponding to an action performed by the virtual object and having a corresponding action identifier, an action video frame sequence corresponding to the action identifier; determining, from the action video frame sequence, an action state image corresponding to a preset state image of the virtual object at a target time, the target time being determined according to the time-dependent information, further including: comparing a visual feature corresponding to the preset state image with a visual feature corresponding to a candidate action state image in the action video frame sequence to obtain a match value between the preset state image and the candidate action state image; and choosing the candidate action state image having a maximum match value with the preset state image as the action state image corresponding to the preset state image of the virtual object at the target time; obtaining a pair of images including an aligned preset state image and an aligned action state image by performing pose information alignment on the preset state image and the action state image that further improves a matching degree between the virtual object in the preset state image and the virtual object in the action state image; generating a connection video frame sequence according to the aligned preset state image and the aligned action state image, the connection video frame sequence representing a transition from a preset state corresponding to the preset state image to an action state corresponding to the action state image and connecting the preset state image with the action video frame sequence; and splicing the connection video frame sequence with the action video frame sequence, to obtain an action video. 2 . The method according to claim 1 , wherein the generating a connection video frame sequence according to the action state image comprises: determining optical flow features separately corresponding to the action state image; and generating the connection video frame sequence according to the optical flow features. 3 . The method according to claim 1 , wherein the generating a connection video frame sequence according to the action state image comprises: determining optical flow features and texture features and/or deep features separately corresponding to the action state image; and generating the connection video frame sequence according to the optical flow features and the texture features and/or the deep features. 4 . The method according to claim 1 , wherein the pose information includes position information and posture information of the virtual object in the preset state image and the action state image. 5 . The method according to claim 1 , further comprising: extracting a part preset state image from the preset state image, and determining, on the basis of three-dimensional reconstruction, a third visual feature corresponding to the part preset state image; extracting a part action state image from the action state image, and determining, on the basis of three-dimensional reconstruction, a fourth visual feature corresponding to the part action state image; generating a part connection video frame sequence according to the third visual feature and the fourth visual feature; and adding the part connection video frame sequence to the connection video frame sequence. 6 . The method according to claim 1 , wherein the time-dependent information comprises: text information corresponding to the action identifier. 7 . A computer device, comprising a processor and a memory, the memory storing a program, the program, when executed by the processor, causing the computer device to perform a method for processing an action of a virtual object including: receiving an action instruction, the action instruction comprising: an action identifier and time-dependent information of performing an action associated with the action identifier; determining, among a plurality of pre-generated action video frame sequences, each action video frame sequence corresponding to an action performed by the virtual object and having a corresponding action identifier, an action video frame sequence corresponding to the action identifier; determining, from the action video frame sequence, an action state image corresponding to a preset state image of the virtual object at a target time, the target time being determined according to the time-dependent information, further including: comparing a visual feature corresponding to the preset state image with a visual feature corresponding to a candidate action state image in the action video frame sequence to obtain a match value between the preset state image and the candidate action state image; and choosing the candidate action state image having a maximum match value with the preset state image as the action state image corresponding to the preset state image of the virtual object at the target time; obtaining a pair of images including an aligned preset state image and an aligned action state image by performing pose information alignment on the preset state image and the action state image that further improves a matching degree between the virtual object in the preset state image and the virtual object in the action state image; generating a connection video frame sequence according to the aligned preset state image and the aligned action state image, the connection video frame sequence representing a transition from a preset state corresponding to the preset state image to an action state corresponding to the action state image and connecting the preset state image with the action video frame sequence; and splicing the connection video frame sequence with the action video frame sequence, to obtain an action video. 8 . The computer device according to claim 7 , wherein the generating a connection video frame sequence according to the action state image comprises: determining optical flow features separately corresponding to the action state image; and generating the connection video frame sequence according to the optical flow features. 9 . The computer device according to claim 7 , wherein the generating a connection video frame sequence according to the action state image comprises: determining optical flow features and texture features and/or deep features separately corresponding to the action state image; and generating the connection video frame sequence according to the optical flow features and the texture features and/or the deep features. 10 . The computer device according to claim 7 , wherein the pose information includes position information and posture information of the virtual object in the preset state image and the action state image. 11 . The computer device according to claim 7 , wherein the method further comprises: extracting a part preset state image from the preset state image, and determining, on the basis of three-dimensional reconstruction, a third visual feature corresponding to the part preset state image; extracting a part action state image from the action state image, and determining, on the basis of three-dimensional reconstruction, a fourth visual feature corresponding to the part action state image; generating a part connection video frame sequence according to th

Assignees

Beijing Sogou Tech Dev Co

Inventors

Classifications

H04N5/265
Mixing · CPC title
G06V10/44
Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components · CPC title
G06V10/24
Aligning, centring, orientation detection or correction of the image · CPC title
G06V10/761
Proximity, similarity or dissimilarity measures · CPC title
G06V10/54
relating to texture · CPC title

Patent family

Related publications grouped by family.

View patent family 78416891

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12548244B2 cover?: A method and apparatus for processing an action of a virtual object, and a storage medium are provided. The method specifically includes: receiving an action instruction, the action instruction including: an action identifier and time-dependent information of performing an action associated with the action identifier; determining an action video frame sequence corresponding to the action identi…
Who is the assignee on this patent?: Beijing Sogou Tech Dev Co
What technology area does this patent fall under?: Primary CPC classification G06T17/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Apparatus and method for recognizing whether action objective is achieved

Methods and apparatus to generate temporal representations for action recognition systems

System and method for vision-based joint action and pose motion forecasting

Image processing method and apparatus, storage medium, and computer device

Dynamic current results for second device

Method for generating image and electronic device thereof

Frequently asked questions