Multi-sensor based user interface
US-2017060254-A1 · Mar 2, 2017 · US
US9953216B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9953216-B2 |
| Application number | US-201514596168-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 13, 2015 |
| Priority date | Jan 13, 2015 |
| Publication date | Apr 24, 2018 |
| Grant date | Apr 24, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems, methods, and computer-readable media are provided for performing actions in response to gestures made by a user in captured images. In accordance with one implementation, a computer-implemented system is provided that includes an image capture device that captures at least one image, a memory device that stores instructions, and at least one processor that executes the instructions stored in the memory device. In some implementations, the processor receives, from the image capture device, at least one image including a gesture made by a user and analyzes the at least one image to identify the gesture made by the user. In some implementations, the processor also determines, based on the identified gesture, one or more actions to perform on the at least one image.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented system comprising: an image capture device that captures images; a memory device that stores instructions; and at least one processor that executes the instructions to perform operations comprising: receiving, from the image capture device, at least one image including a gesture made by a user; analyzing the at least one image to identify the gesture made by the user in the at least one image; determining, based on the identified gesture, a first action to perform on the at least one image; determining a selection area for the gesture; identifying an area of interest in the at least one image based on the determined selection area of the gesture, wherein the area of interest includes non-textual content; performing the first action on the identified area of interest, wherein the first action comprises: classifying the non-textual content included in the area of interest into at least one of a plurality of different types of non-textual content into which the non-textual content is classifiable by the computer-implemented system, wherein the computer-implemented system is capable of recognizing each of a face, an object, and a landscape; and generating a first result that indicates the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; determining a second action to be performed on the identified area of interest based at least in part on the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; and performing the second action on the identified area of interest. 2. The computer-implemented system according to claim 1 , wherein the selection area for the gesture is determined by a shape and a location of the gesture in the at least one image. 3. The computer-implemented system according to claim 1 , wherein: determining the second action to be performed on the identified area of interest comprises determining, based on the gesture and the first result, the second action to be performed on the identified area of interest. 4. A non-transitory, computer-readable medium storing instructions, the instructions configured to cause at least one processor to perform operations comprising: receiving at least one image including a gesture made by a user; analyzing the at least one image to identify the gesture made by the user in the at least one image; determining, based on the gesture, a first action to perform on the at least one image; determining a selection area for the gesture; identifying an area of interest in the at least one image based on the determined selection area of the gesture, wherein the area of interest includes non-textual content; and performing the first action on the identified area of interest, wherein the first action comprises: classifying the non-textual content included in the area of interest into at least one of a plurality of different types of non-textual content into which the non-textual content is classifiable by the at least one processor, wherein the instructions are configured to cause the at least one processor to be capable of recognizing each of a face, an object, and a landscape; and generating a first result that indicates the type of non-textual content into which the content included in the area of interest was classified; determining a second action to be performed on the identified area of interest based at least in part on the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; and performing the second action on the identified area of interest. 5. The computer-readable medium according to claim 4 , wherein the selection area for the gesture is determined by a shape and a location of the gesture in the at least one image. 6. The computer-readable medium according to claim 4 , wherein: determining the second action comprises determining, based on the gesture and the first result, the second action to be performed on the identified area of interest. 7. A method comprising the following operations performed by one or more processors: receiving at least one image including a single gesture made by a user; analyzing the at least one image to identify the single gesture made by the user in the at least one image; determining, based on the single gesture, a first action to perform on the at least one image; determining a selection area indicated by the single gesture, such that both of the first action and the selection area are determined based on the single gesture made by the user in the at least one image; identifying an area of interest in the at least one image based on the determined selection area indicated by the gesture; and performing the first action on the identified area of interest, wherein performing the first action comprises recognizing, by the one or more processors that are capable of recognizing each of a face, an object, or a landscape, at least one of the face, the object, or the landscape within the area of interest. 8. The method according to claim 7 , wherein the selection area for the single gesture is determined by a shape and a location of the single gesture in the at least one image. 9. The method according to claim 7 , wherein the first action results in a first result and the method further comprises: determining, based on the single gesture and the first result, a second action; and performing the second action. 10. The computer-implemented system according to claim 1 , wherein performing the second action comprises at least one of performing a facial recognition on a first face, identifying a first object, or determining a location. 11. The computer-implemented system according to claim 1 , wherein: classifying content included in the area of interest into at least one of the plurality of different types of content comprises classifying the content included in the area of interest as a first face; and performing the second action comprises automatically performing facial recognition on the first face. 12. The computer-implemented system according to claim 1 , wherein: determining the selection area for the gesture comprises determining the selection area based on a first image that includes the gesture made by the user; and identifying the area of interest in the at least one image comprises: obtaining a second image that depicts a same scene as the first image that includes the gesture made by the user, wherein the second image does not include the gesture made by the user; and identifying the area of interest in the second image based at least in part on the selection area determined from the first image. 13. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at least one image comprises: obtaining depth of field measurements for the at least one image; and detecting the gesture in a foreground of the at least one image based at least in part on the depth of field measurements. 14. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at least one image comprises: providing the at least one image to a deep neural network; and receiving an identifier for the gesture from the deep neural network. 15. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at lea
Detection arrangements using opto-electronic means (constructional details of pointing devices not related to the detection arrangement using opto-electronic means G06F3/033; optical digitisers G06F3/042) · CPC title
Recognition of hand or arm movements, e.g. recognition of deaf sign language (static hand signs G06V40/113) · CPC title
based on user input or interaction · CPC title
Physics · mapped topic
Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.