Method for detecting hand key points, method for recognizing gesture, and related devices

US12299920B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12299920-B2
Application numberUS-202017780694-A
CountryUS
Kind codeB2
Filing dateAug 7, 2020
Priority dateNov 29, 2019
Publication dateMay 13, 2025
Grant dateMay 13, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a method for detecting hand key points. The method includes: acquiring a hand image to be detected; acquiring heat maps of the hand key points by inputting the hand image into a pre-trained heat map model, wherein the heat maps include two-dimensional coordinates of the hand key points; acquiring hand structured connection information by inputting the heat maps and the hand image into a pre-trained three-dimensional information prediction model; and determining, based on the hand structured connection information and the two-dimensional coordinates in the heat maps, three-dimensional coordinates of the hand key points in a world coordinate system.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for detecting hand key points, comprising: acquiring a hand image to be detected; acquiring heat maps of the hand key points by inputting the hand image into a pre-trained heat map model, wherein the heat maps comprise two-dimensional coordinates of the hand key points; acquiring hand structured connection information by inputting the heat maps and the hand image into a pre-trained three-dimensional information prediction model; determining, based on the hand structured connection information and the two-dimensional coordinates in the heat maps, three-dimensional coordinates of the hand key points in a world coordinate system; and recognizing, based on three-dimensional coordinates of the hand key points in the world coordinate system, a gesture expressed by a hand in the hand image to detect the hand key points in real time through a mobile terminal; wherein the hand structured connection information comprises joint bending angles formed by the hand key points and Euler angles of the hand; and determining, based on the hand structured connection information and the two-dimensional coordinates in the heat maps, the three-dimensional coordinates of the hand key points in the world coordinate system comprises: calculating, based on the joint bending angles, a first direction vector of a vector formed by two hand key points in a hand coordinate system of the hand; converting the first direction vector into a second direction vector in the world coordinate system based on the Euler angles; calculating a vector length of the vector based on the two-dimensional coordinates in the heat maps; acquiring the vector by calculating a product of the vector length and the second direction vector; and calculating the three-dimensional coordinates of the two hand key points forming the vector in the world coordinate system based on the vector. 2. The method according to claim 1 , wherein acquiring the hand image to be detected comprises: acquiring an original image; detecting a hand from the original image; and capturing an image having a preset size and containing the hand as the hand image to be detected. 3. The method according to claim 1 , wherein acquiring the heat maps of the hand key points by inputting the hand image into the pre-trained heat map model comprises: acquiring the heat map of each hand key point by inputting the hand image into the pre-trained heat map model, wherein a size of the heat map of each hand key point is equal to a size of the hand image. 4. The method according to claim 1 , wherein the heat maps comprise heat maps of all hand key points, and acquiring the hand structured connection information by inputting the heat maps and the hand image into the pre-trained three-dimensional information prediction model comprises: acquiring joint bending angles formed by the hand key points and Euler angles of the hand by inputting the heat maps of all hand key points and the hand image into the pre-trained three-dimensional information prediction model. 5. The method according to claim 1 , wherein calculating, based on the joint bending angles, the first direction vector of the vector formed by the two hand key points in the hand coordinate system of the hand comprises: determining, based on a pre-established hand model, a first direction vector of a vector from a wrist key point to a proximal phalanx key point of a middle finger; calculating a first direction vector of a vector from the wrist key point to a proximal phalanx key point of each finger based on the joint bending angles and the first direction vector of the vector from the wrist key point to the proximal phalanx key point of the middle finger; and calculating the first direction vector of the vector between the two key points connected by the phalanxes of each finger based on the joint bending angles and the first direction vector of the vector from the wrist key point to the proximal phalanx key point of each finger. 6. The method according to claim 1 , wherein converting the first direction vector into the second direction vector in the world coordinate system based on the Euler angles comprises: calculating an Euler rotation matrix based on the Euler angles; and acquiring the second direction vector of the first direction vector in the world coordinate system by calculating a product of the first direction vector and the Euler rotation matrix. 7. The method according to claim 1 , wherein calculating the vector length of the vector based on the two-dimensional coordinates in the heat maps comprises: determining the two hand key points forming the vector; determining, based on the heat maps of the two hand key points, two-dimensional coordinates of the two hand key points; and calculating the vector length of the vector based on the two-dimensional coordinates of the two hand key points. 8. The method according to claim 7 , wherein each pixel point on the heat maps is associated with a probability value, wherein the probability value represents a probability of the hand key point at each pixel point; and determining, based on the heat maps of the two hand key points, the two-dimensional coordinates of the two hand key points comprises: determining a pixel point with a greatest probability value from the heat map of each hand key point; acquiring local two-dimensional coordinates by acquiring coordinates of the pixel point with the greatest probability value in the heat map; and acquiring the two-dimensional coordinates of each hand key point by converting the local two-dimensional coordinates to coordinates in the hand image. 9. The method according to claim 1 , wherein calculating the three-dimensional coordinates of the two hand key points forming the vector in the world coordinate system based on the vector comprises: acquiring the three-dimensional coordinates of a wrist key point in the hand key points in the world coordinate system; and calculating the three-dimensional coordinates of the two hand key points forming the vector in the world coordinate system based on the three-dimensional coordinates of the wrist key point in the world coordinate system and the vector. 10. A method for recognizing a gesture, comprising: acquiring a hand image to be recognized; detecting key points in the hand image; and recognizing, based on the key points, the gesture expressed by a hand in the hand image; wherein detecting the key points in the hand image comprises: detecting the key points in the hand image according to a method for detecting hand key points, comprising: acquiring a hand image to be detected; acquiring heat maps of the hand key points by inputting the hand image into a pre-trained heat map model, wherein the heat maps comprise two-dimensional coordinates of the hand key points; acquiring hand structured connection information by inputting the heat maps and the hand image into a pre-trained three-dimensional information prediction model; determining, based on the hand structured connection information and the two-dimensional coordinates in the heat maps, three-dimensional coordinates of the hand key points in a world coordinate system; and recognizing, based on three-dimensional coordinates of the hand key points in the world coordinate system, the gesture expressed by the hand in the hand image to detect the hand key points in real time through a mobile terminal; wherein the hand structured connection information comprises joint bending angles formed by the hand key points and Euler angles of the hand; and determining, based on the hand structured connection information and the two-dimensional coordinates in the heat maps, the three-dimensional coordinat

Assignees

Inventors

Classifications

  • Human being; Person · CPC title

  • Probabilistic image processing · CPC title

  • Movements or behaviour, e.g. gesture recognition (recognition of facial expressions G06V40/16) · CPC title

  • Hand-related biometrics; Hand pose recognition · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12299920B2 cover?
Provided is a method for detecting hand key points. The method includes: acquiring a hand image to be detected; acquiring heat maps of the hand key points by inputting the hand image into a pre-trained heat map model, wherein the heat maps include two-dimensional coordinates of the hand key points; acquiring hand structured connection information by inputting the heat maps and the hand image in…
Who is the assignee on this patent?
Bigo Tech Pte Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 13 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).