Hand gesture recognition for cursor control

US2017192515A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017192515-A1
Application numberUS-201615197288-A
CountryUS
Kind codeA1
Filing dateJun 29, 2016
Priority dateJan 5, 2016
Publication dateJul 6, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system for hand gesture recognition is described herein. The system includes a display, camera, memory, and processor. The memory that is to store instructions and is communicatively coupled to the camera and the display. The processor is communicatively coupled to the camera, the display, and the memory. When the processor is to execute the instructions, the processor is to estimate one or more motion vectors of an object using a pair of consecutive frames and estimate an average motion vector of the object. The processor is also to obtain a descriptor based on histogram values from a histogram of optical flow (HOOF) of the one or more motion vectors and the average motion vector and classify the descriptor as a gesture.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for hand gesture recognition, comprising: a display; a camera; a memory that is to store instructions and that is communicatively coupled to the camera and the display; and a processor communicatively coupled to the camera, the display, and the memory, wherein when the processor is to execute the instructions, the processor is to: estimate one or more motion vectors of an object using a pair of consecutive frames; estimate an average motion vector of the object; obtain a descriptor based on histogram values from a histogram of optical flow (HOOF) of the one or more motion vectors and the average motion vector; and classify the descriptor as a gesture. 2 . The system of claim 1 , comprising: extracting a mask of the object using a depth image; and applying the optical flow to the mask of the object to obtain the one or more motion vectors. 3 . The system of claim 1 , wherein the histogram of the optical flow (HOOF) of the one or more motion vectors is built using a Lucas-Kanade optical flow algorithm. 4 . The system of claim 1 , wherein the histogram values and the average motion vectors are concatenated for a plurality of frames to obtain the descriptor. 5 . The system of claim 1 , wherein each descriptor includes histogram values and the average motion vector for K frames, wherein the value of K is dependent on the type of gesture. 6 . The system of claim 1 , wherein a machine learning algorithm is used to classify the descriptor as a gesture. 7 . The system of claim 1 , wherein the one or more motion vectors is calculated via an optical flow algorithm of a mask applied an infrared image, a red, green, blue (RGB) image, or a grayscale image. 8 . The system of claim 1 , wherein the average motion vector is estimated using good feature points are determined by corner tracking. 9 . The system of claim 1 , wherein the object is a hand. 10 . The system of claim 1 , wherein the gesture is used to control a cursor. 11 . A method, comprising: extracting a hand mask using depth data; estimating a plurality of motion vectors based via an optical flow applied to the hand mask; estimating an average motion vector from the optical flow; generating a descriptor based on a histogram of optical flow applied to the hand mask and the average motion vector; and classifying the descriptor as a gesture. 12 . The method of claim 11 , wherein the histogram of optical flow is calculated for a plurality of cells of a frame, and a value is calculated for each bin of the histogram of optical flow by average values of the bin. 13 . The method of claim 12 , wherein the average bin values and the average motion vector are concatenated for a plurality of frames to generate the descriptor. 14 . The method of claim 11 , comprising: obtaining the plurality of motion vectors for a plurality of good feature points in a sequence of frames; selecting the motion vectors with a lowest length; and averaging the motion vectors with the lowest length to obtain the average motion vector. 15 . The method of claim 11 , wherein the descriptor is invariant to rotation and hand type. 16 . The method of claim 11 , wherein a movement corresponding to the gesture is applied to a cursor rendered on a display. 17 . An apparatus for hand gesture recognition, comprising: an image capture mechanism to obtain a sequence of frames; an estimator to estimate a plurality of motion vectors and an average motion vector for each frame of the sequence of frames; a generator to generate a descriptor based on a histogram of the plurality of motion vectors and the average motion vector for the sequence of frames; and a classifier to classify the descriptor as a gesture. 18 . The apparatus of claim 17 , wherein the histogram is a histogram of optical flow, and the one or more motion vectors is projected onto the histogram of optical flow. 19 . The apparatus of claim 17 , wherein a frame is divided into a plurality of cells, and a histogram of optical flow is calculated for each cell, and each bin the histogram of optical flow is averaged to obtain the single frame feature vector that comprises histogram values. 20 . The apparatus of claim 17 , wherein histogram values and the average motion vector for each frame of the sequence of frames are concatenated to generate the descriptor. 21 . The apparatus of claim 17 , wherein each descriptor includes a set of histogram values and the average motion vector for K frames, wherein the value of K is dependent on the type of gesture. 22 . A tangible, non-transitory, computer-readable medium comprising instructions that, when executed by a processor, direct the processor to: extract a hand mask using depth data; estimate a plurality of motion vectors based via an optical flow applied to the hand mask; estimate an average motion vector from the optical flow; generate a descriptor based on a histogram of optical flow applied to the hand mask and the average motion vector; and classify the descriptor as a gesture. 23 . The computer readable medium of claim 22 , wherein each descriptor includes a concatenated plurality of histogram values and the average motion vector for K frames, wherein the value of K is dependent on a type of the gesture. 24 . The computer readable medium of claim 22 , wherein a machine learning algorithm is used to classify the descriptor as a gesture. 25 . The computer readable medium of claim 22 , wherein the hand mask is applied to an infrared image, a red, green, blue (RGB) image, or a grayscale image to estimate the plurality of motion vectors.

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Human being; Person · CPC title

  • Physics · mapped topic

  • Graph-based image processing · CPC title

  • G06F3/017Primary

    Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017192515A1 cover?
A system for hand gesture recognition is described herein. The system includes a display, camera, memory, and processor. The memory that is to store instructions and is communicatively coupled to the camera and the display. The processor is communicatively coupled to the camera, the display, and the memory. When the processor is to execute the instructions, the processor is to estimate one or m…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F3/017. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 06 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).