Machine-learning based gesture recognition using multiple sensors

US11449802B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11449802-B2
Application numberUS-202016937481-A
CountryUS
Kind codeB2
Filing dateJul 23, 2020
Priority dateNov 8, 2019
Publication dateSep 20, 2022
Grant dateSep 20, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device implementing a system for machine-learning based gesture recognition includes at least one processor configured to, receive, from a first sensor of the device, first sensor output of a first type, and receive, from a second sensor of the device, second sensor output of a second type that differs from the first type. The at least one processor is further configured to provide the first sensor output and the second sensor output as inputs to a machine learning model, the machine learning model having been trained to output a predicted gesture based on sensor output of the first type and sensor output of the second type. The at least one processor is further configured to determine the predicted gesture based on an output from the machine learning model, and to perform, in response to determining the predicted gesture, a predetermined action on the device.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving sensor data from a sensor of a device; providing the sensor data as input to a machine learning model, the machine learning model having been trained to output, while a gesture is being performed by a user of the device and prior to completion of the gesture, a predicted gesture, a predicted start time of the gesture, and a predicted end time of the gesture, based on the sensor data; determining the predicted gesture based on an output from the machine learning model; and performing, in response to determining the predicted gesture, a predetermined action on the device. 2. The method of claim 1 , wherein receiving the sensor data comprises receiving the sensor data during a first window of time that at least partially overlaps a gesture time of the gesture, the method further comprising: receiving additional sensor data from the sensor of the device during a second window of time that at least partially overlaps the gesture time of the gesture; providing the additional sensor data as input to the machine learning model; and determining the predicted gesture based on the output from the machine learning model that is based on the sensor data from the first window of time and based on an additional output of the machine learning model that is based on the additional sensor data from the second window of time. 3. The method of claim 2 , wherein determining the predicted gesture based on the output from the machine learning model that is based on the sensor data from the first window of time and based on the additional output of the machine learning model that is based on the additional sensor data from the second window of time comprises: aggregating a first predicted start time from the machine learning model that is based on the sensor data from the first window of time and a second predicted start time from the machine learning model that is based on the additional sensor data from the second window of time to determine a final predicted start time for the gesture; and aggregating a first predicted end time from the machine learning model that is based on the sensor data from the first window of time and a second predicted end time from the machine learning model that is based on the additional sensor data from the second window of time to determine a final predicted end time for the gesture. 4. The method of claim 3 , further comprising adjusting a size of an input buffer for the machine learning model based on the final predicted start time and the final predicted end time. 5. The method of claim 4 , wherein determining the predicted gesture comprises determining the predicted gesture at a time after the final predicted end time using sensor data in the input buffer having the adjusted size. 6. The method of claim 1 , the machine learning model having been trained to output the predicted start time of the gesture and the predicted end time of the gesture at least in part by generating a gesture label and a no-gesture label for each of multiple windows of time. 7. The method of claim 6 , wherein the gesture label and the no-gesture label each have a value that is smoothly continuous between a maximum value and a minimum value. 8. The method of claim 1 , wherein the predetermined action corresponds to changing a user interface on the device. 9. The method of claim 1 , wherein the predetermined action comprises sending gesture information or an instruction from the device to a companion device. 10. A device, comprising: at least one processor; and a memory including instructions that, when executed by the at least one processor, cause the at least one processor to: receive sensor data from a sensor of the device; provide the sensor data as input to a machine learning model, the machine learning model having been trained to output, while a gesture is being performed by a user of the device and prior to completion of the gesture, a predicted gesture, a predicted start time of the gesture, and a predicted end time of the gesture, based on the sensor data; determine the predicted gesture based on an output from the machine learning model; and perform, in response to determining the predicted gesture, a predetermined action on the device. 11. The device of claim 10 , wherein the one or more processors are configured to receive the sensor data during a first window of time that at least partially overlaps a gesture time of the gesture, and are further configured to: receive additional sensor data from the sensor of the device during a second window of time that at least partially overlaps the gesture time of the gesture; provide the additional sensor data as input to the machine learning model; and determine the predicted gesture based on the output from the machine learning model that is based on the sensor data from the first window of time and based on an additional output of the machine learning model that is based on the additional sensor data from the second window of time. 12. The device of claim 11 , wherein the one or more processors are configured to determine the predicted gesture based on the output from the machine learning model that is based on the sensor data from the first window of time and based on the additional output of the machine learning model that is based on the additional sensor data from the second window of time by: aggregating a first predicted start time from the machine learning model that is based on the sensor data from the first window of time and a second predicted start time from the machine learning model that is based on the additional sensor data from the second window of time to determine a final predicted start time for the gesture; and aggregating a first predicted end time from the machine learning model that is based on the sensor data from the first window of time and a second predicted end time from the machine learning model that is based on the additional sensor data from the second window of time to determine a final predicted end time for the gesture. 13. The device of claim 12 , wherein the one or more processors are further configured to adjust a size of an input buffer for the machine learning model based on the final predicted start time and the final predicted end time. 14. The device of claim 13 , wherein the one or more processors are configured to determine the predicted gesture at a time after the final predicted end time, using sensor data in the input buffer having the adjusted size. 15. The device of claim 10 , the machine learning model having been trained to output the predicted start time of the gesture and the predicted end time of the gesture at least in part by generating a gesture label and a no-gesture label for each of multiple windows of time. 16. The device of claim 15 , wherein the gesture label and the no-gesture label each have a value that is smoothly continuous between a maximum value and a minimum value. 17. The device of claim 10 , further comprising the sensor. 18. The device of claim 17 , wherein the device comprises a smartwatch. 19. The device of claim 18 , wherein the at least one processor is configured to receive the sensor data from the sensor of the device while the smartwatch is worn on a wrist of the user, and wherein the gesture corresponds to a single-handed gesture performed by a hand that is coupled to the wrist on which the smartwatch is worn. 20. A non-transitory computer-readable medium storing instructions which, when executed by one or more processors,

Assignees

Inventors

Classifications

  • G06F3/017Primary

    Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

  • Classification; Matching · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11449802B2 cover?
A device implementing a system for machine-learning based gesture recognition includes at least one processor configured to, receive, from a first sensor of the device, first sensor output of a first type, and receive, from a second sensor of the device, second sensor output of a second type that differs from the first type. The at least one processor is further configured to provide the first …
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06F3/017. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 20 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).