Action recognition and pose estimation method and apparatus

US11478169B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11478169-B2
Application numberUS-202016846890-A
CountryUS
Kind codeB2
Filing dateApr 13, 2020
Priority dateOct 13, 2017
Publication dateOct 25, 2022
Grant dateOct 25, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Action recognition methods are disclosed. An embodiment of the methods includes: identifying a video that comprises images of a human body to be processed; identifying at least one image to be processed, wherein the at least one image is at least one of an optical flow image generated based on a plurality of frames of images in the video, or a composite image of one or more frames of images in the video; performing convolution on the at least one image to obtain a plurality of eigenvectors, wherein the plurality of eigenvectors indicate a plurality of features of different locations in the at least one image; determining a weight coefficient set of each of a plurality of human joints of the human body based on the plurality of eigenvectors, wherein the weight coefficient set comprises a weight coefficient of each of the plurality of eigenvectors for the human joint; weighting the plurality of eigenvectors based on the weight coefficient set to obtain an action feature of each of the plurality of human joints; determining an action feature of the human body based on the action feature of each of the human joints; and determining an action type of the human body based on the action feature of the human body.

First claim

Opening claim text (preview).

What is claimed is: 1. An action recognition method, comprising: identifying a video to be processed that comprises images of a human body; identifying at least one image to be processed, wherein the at least one image is at least one of an optical flow image generated based on a plurality of frames of images in the video, or a composite image of one or more frames of images in the video; performing convolution on the at least one image to obtain a plurality of eigenvectors, wherein the plurality of eigenvectors indicate a plurality of features of different locations in the at least one image; determining a weight coefficient set of each of a plurality of human joints of the human body based on the plurality of eigenvectors, wherein the weight coefficient set comprises a weight coefficient of each of the plurality of eigenvectors for the human joint a distance between a corresponding location of the first-type eigenvector in the at least one image and a corresponding location of the human joint in the at least one image is a first distance, a distance between a corresponding location of the second-type eigenvector in the at least one image and the corresponding location of the human joint in the at least one image is a second distance, the first distance is less than or equal to the second distance; weighting the plurality of eigenvectors based on the weight coefficient set to obtain an action feature of each of a plurality of human joints; determining an action feature of the human body based on the action feature of each of the human joints; and determining an action type of the human body based on the action feature of the human body. 2. The method according to claim 1 , wherein the weight coefficient set of each of the human joints is further determined based on cumulative action information of the human body, wherein the cumulative action information of the human body indicates an action feature of the human body accumulated before a current time point. 3. The method according to claim 1 , wherein the action feature of the human body is determined by weighting or combining the action feature of the plurality of human joints. 4. The method according to claim 1 , wherein the action feature of the human body is determined by weighting or combining action feature of each of a plurality of human parts of the human body, and wherein the action feature of each of the plurality of human parts is determined by weighting or combining the action feature of each of the plurality of human joints. 5. The method according to claim 1 , wherein the at least one image comprises the composite image and the optical flow image, and the performing convolution on the at least one image to obtain a plurality of eigenvectors comprises: performing convolution on the composite image to obtain a plurality of eigenvectors of the composite image; and performing convolution on the optical flow image to obtain a plurality of eigenvectors of the optical flow image; the determining the weight coefficient set comprises: determining a first-type weight coefficient set of each of the plurality of human joints based on the plurality of eigenvectors of the composite image; and determining a second-type weight coefficient set of each of the plurality of human joints based on the plurality of eigenvectors of the optical flow image; the weighting the plurality of eigenvectors based on the weight coefficient set comprises: weighting the plurality of eigenvectors of the composite image based on the first-type weight coefficient set to obtain a first-type action feature of each of the plurality of human joints; and weighting the plurality of eigenvectors of the optical flow image based on the second-type weight coefficient set to obtain a second-type action feature of each of the plurality of human joints; and wherein the action feature of the human body is determined by weighting or combining first-type action features of the plurality of human joints and second-type action features of the plurality of human joints. 6. A pose estimation method, comprising: identifying a video to be processed that comprises images of a human body; identifying at least one image to be processed, wherein the at least one image is at least one of an optical flow image generated based on a plurality of frames of images in the video, or a composite image of one or more frames of images in the video; performing convolution on the at least one image to obtain a plurality of eigenvectors, wherein the plurality of eigenvectors indicate a plurality of features of different locations in the at least one image; determining a weight coefficient set of each of a plurality of human joints of the human body based on the plurality of eigenvectors, wherein the weight coefficient set comprises a weight coefficient of each of the plurality of eigenvectors for the human joint, a weight coefficient of a first-type eigenvector is greater than or equal to a weight coefficient of a second-type eigenvector, a distance between a corresponding location of the first-type eigenvector in the at least one image and a corresponding location of the human joint in the at least one image is a first distance, a distance between a corresponding location of the second-type eigenvector in the at least one image and the corresponding location of the human joint in the at least one image is a second distance, the first distance is less than or equal to the second distance; determining a corresponding region of a first eigenvector of the plurality of eigenvectors in the at least one image as a location of the human joint, wherein a weight coefficient of the first eigenvector is a first weight coefficient, and the first weight coefficient is a weight coefficient greater than a preset threshold in the weight coefficient set of the human joint; and determining a pose of the human body based on the location of the human joint. 7. The method according to claim 6 , wherein the weight coefficient set of each of the human joints is further determined based on cumulative action information of the human body, wherein the cumulative action information of the human body indicates an action feature of the human body accumulated before a current time point. 8. The method according to claim 6 , wherein the at least one image comprises the composite image and the optical flow image, and the performing convolution on the at least one image to obtain a plurality of eigenvectors comprises: performing convolution on the composite image to obtain a plurality of eigenvectors of the composite image; and performing convolution on the optical flow image to obtain a plurality of eigenvectors of the optical flow image; the determining the weight coefficient set comprises: determining a first-type weight coefficient set of each of the plurality of human joints based on the plurality of eigenvectors of the composite image; and determining a second-type weight coefficient set of each of the plurality of human joints based on the plurality of eigenvectors of the optical flow image; and determining the weight coefficient set of each human joint based on the first-type weight coefficient set and the second-type weight coefficient set. 9. The method according to claim 8 , wherein the weight coefficient set of each human joint is determined based on weighting the first-type weight coefficient set and the second-type weight coefficient set. 10. An action recognition device, comprising at least one processor and a non-transitory medium storing program instructions, wherein the at least one processor, by executing the program instructions, causes the action recognition device to: identifying a video to be processed

Assignees

Inventors

Classifications

  • G06V40/20Primary

    Movements or behaviour, e.g. gesture recognition (recognition of facial expressions G06V40/16) · CPC title

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • A61B5/1128Primary

    using image analysis (A61B5/1127 takes precedence) · CPC title

  • Learning methods · CPC title

  • Determining posture transitions · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11478169B2 cover?
Action recognition methods are disclosed. An embodiment of the methods includes: identifying a video that comprises images of a human body to be processed; identifying at least one image to be processed, wherein the at least one image is at least one of an optical flow image generated based on a plurality of frames of images in the video, or a composite image of one or more frames of images in …
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V40/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 25 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).