Multi-task machine learning for predicted touch interpretations

US10261685B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10261685-B2
Application numberUS-201615393611-A
CountryUS
Kind codeB2
Filing dateDec 29, 2016
Priority dateDec 29, 2016
Publication dateApr 16, 2019
Grant dateApr 16, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides systems and methods that leverage machine learning to predict multiple touch interpretations. In particular, the systems and methods of the present disclosure can include and use a machine-learned touch interpretation prediction model that has been trained to receive touch sensor data indicative of one or more locations of one or more user input objects relative to a touch sensor at one or more times and, in response to receipt of the touch sensor data, provide one or more predicted touch interpretation outputs. Each predicted touch interpretation output corresponds to a different type of predicted touch interpretation based at least in part on the touch sensor data. Predicted touch interpretations can include a set of touch point interpretations, a gesture interpretation, and/or a touch prediction vector for one or more future times.

First claim

Opening claim text (preview).

What is claimed is: 1. A computing device that determines touch interpretation from user input objects, comprising: at least one processor; a machine-learned touch interpretation prediction model, wherein the touch interpretation prediction model has been trained to directly receive raw touch sensor data indicative of measured touch sensor readings across a grid of points generated in response to one or more locations of one or more user input objects relative to a touch sensor at one or more times and, in response to receipt of the raw touch sensor data, output simultaneous multiple predicted touch interpretations, each predicted touch interpretation corresponding to a different type of predicted touch interpretation based at least in part on the touch sensor data, wherein the simultaneous multiple predicted touch interpretations comprise at least first and second predicted touch interpretations determined from a group comprising a set of touch point interpretations that respectively describe one or more intended touch points, a gesture interpretation that characterizes the set of touch point interpretations as a gesture determined from a predefined gesture class, and a touch prediction vector that describes one or more predicted future locations of the one or more user input objects respectively for one or more future times; and at least one tangible, non-transitory computer-readable medium that stores instructions that, when executed by the at least one processor, cause the at least one processor to perform operations, the operations comprising: obtaining a first set of raw touch sensor data indicative of one or more user input object locations relative to a touch sensor over time; inputting the first set of raw touch sensor data into the machine-learned touch interpretation prediction model; receiving, as an output of the touch interpretation prediction model, simultaneous multiple predicted touch interpretations that describe predicted intentions of the one or more user input objects, each predicted touch interpretation describing a different predicted aspect of the one or more user input objects; and performing one or more actions associated with the multiple predicted touch interpretations. 2. The computing device of claim 1 , wherein the one or more predicted touch interpretations comprise at least a first predicted touch interpretation comprising a set of touch point interpretations that respectively describe one or more intended touch points and a second predicted touch interpretation comprising a gesture interpretation that characterizes the set of touch point interpretations as a gesture determined from a predefined gesture class. 3. The computing device of claim 1 , wherein the machine-learned touch interpretation prediction model comprises a deep recurrent neural network with a plurality of output layers, each output layer corresponding to a different type of touch interpretation describing one or more predicted intentions of the one or more user input objects. 4. The computing device of claim 1 , wherein obtaining the first set of raw touch sensor data comprises obtaining the first set of raw touch sensor data associated with one or more fingers or hand portions of a user or a stylus operated by the user, the first set of raw touch sensor data descriptive of a location of the one or more fingers, hand portions or stylus relative to a touch-sensitive screen. 5. The computing device of claim 1 , wherein obtaining the first set of raw touch sensor data comprises: obtaining the first set of raw time sensor data that provides at least one value describing a change in the location of the one or more user input objects in an x dimension, at least one value describing a change in the location of the one or more user input objects in a y dimension, and at least one value describing a change in time; or obtaining the first set of raw time sensor data that provides at least two values describing at least two locations of the one or more user input objects in the x dimension, at least two values describing at least two locations of the one or more user input objects in the y dimension, and at least two values describing at least two times. 6. The computing device of claim 1 , wherein the machine-learned touch interpretation prediction model has been trained based on a first set of training data that includes a first portion of data corresponding to recorded touch sensor data indicative of one or more user input object locations relative to a touch sensor and a second portion of data corresponding to labels of determined touch interpretations applied to recorded screen content, wherein the first portion of data and the screen content are recorded at the same time. 7. The computing device of claim 1 , wherein the machine-learned touch interpretation prediction model has been trained based on a second set of training data that includes a first portion of data corresponding to an initial sequence of touch sensor data observations and a second portion of data corresponding to a subsequent sequence of touch sensor data observations. 8. The computing device of claim 1 , wherein the one or more predicted touch interpretations comprise a set of touch point interpretations that respectively describe zero, one or more touch points. 9. The computing device of claim 8 , wherein the set of touch point interpretations further comprises one or more of a touch type describing a predicted type of user input object associated with each touch point and an estimated pressure of the touch at each touch point. 10. The computing device of claim 1 , wherein the one or more predicted touch interpretations comprise a gesture interpretation that characterizes at least a portion of the first set of raw touch sensor data as a gesture determined from a predefined gesture class. 11. The computing device of claim 1 , wherein the one or more predicted touch interpretations comprise a touch prediction vector that describes one or more predicted future locations of the one or more user input objects respectively for one or more future times. 12. The computing device of claim 1 , wherein: the machine-learned touch interpretation prediction model comprises at least one shared layer and multiple different and distinct output layers positioned structurally after the at least one shared layer; and receiving, as an output of the touch interpretation prediction model, one or more predicted touch interpretations that describe predicted intentions of the one or more user input objects comprises receiving the one or more predicted touch interpretations from the multiple different and distinct output layers of the machine-learned touch interpretation prediction model. 13. One or more tangible, non-transitory computer-readable media storing computer-readable instructions that when executed by one or more processors cause the one or more processors to perform operations, the operations comprising: obtaining data descriptive of a machine-learned touch interpretation prediction model, wherein the touch interpretation prediction model has been trained to receive touch sensor data indicative of one or more locations of one or more user input objects relative to a touch sensor at one or more times and, in response to receipt of the touch sensor data, provide simultaneous multiple predicted touch interpretation, each predicted touch interpretation corresponding to a different type of predicted touch interpretation based at least in part on the touch sensor data; obtaining a first set of touch sensor data indicative of one or more user input object locations relative to a touch sensor over time; inputting the

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • using resistive elements, e.g. a single continuous surface or two parallel surfaces put in contact · CPC title

  • Machine learning · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10261685B2 cover?
The present disclosure provides systems and methods that leverage machine learning to predict multiple touch interpretations. In particular, the systems and methods of the present disclosure can include and use a machine-learned touch interpretation prediction model that has been trained to receive touch sensor data indicative of one or more locations of one or more user input objects relative …
Who is the assignee on this patent?
Google Inc, Google Llc
What technology area does this patent fall under?
Primary CPC classification G06F3/04883. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 16 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).