Systems and methods for performing actions in response to user gestures in captured images

US9953216B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9953216-B2
Application numberUS-201514596168-A
CountryUS
Kind codeB2
Filing dateJan 13, 2015
Priority dateJan 13, 2015
Publication dateApr 24, 2018
Grant dateApr 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and computer-readable media are provided for performing actions in response to gestures made by a user in captured images. In accordance with one implementation, a computer-implemented system is provided that includes an image capture device that captures at least one image, a memory device that stores instructions, and at least one processor that executes the instructions stored in the memory device. In some implementations, the processor receives, from the image capture device, at least one image including a gesture made by a user and analyzes the at least one image to identify the gesture made by the user. In some implementations, the processor also determines, based on the identified gesture, one or more actions to perform on the at least one image.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented system comprising: an image capture device that captures images; a memory device that stores instructions; and at least one processor that executes the instructions to perform operations comprising: receiving, from the image capture device, at least one image including a gesture made by a user; analyzing the at least one image to identify the gesture made by the user in the at least one image; determining, based on the identified gesture, a first action to perform on the at least one image; determining a selection area for the gesture; identifying an area of interest in the at least one image based on the determined selection area of the gesture, wherein the area of interest includes non-textual content; performing the first action on the identified area of interest, wherein the first action comprises: classifying the non-textual content included in the area of interest into at least one of a plurality of different types of non-textual content into which the non-textual content is classifiable by the computer-implemented system, wherein the computer-implemented system is capable of recognizing each of a face, an object, and a landscape; and generating a first result that indicates the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; determining a second action to be performed on the identified area of interest based at least in part on the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; and performing the second action on the identified area of interest. 2. The computer-implemented system according to claim 1 , wherein the selection area for the gesture is determined by a shape and a location of the gesture in the at least one image. 3. The computer-implemented system according to claim 1 , wherein: determining the second action to be performed on the identified area of interest comprises determining, based on the gesture and the first result, the second action to be performed on the identified area of interest. 4. A non-transitory, computer-readable medium storing instructions, the instructions configured to cause at least one processor to perform operations comprising: receiving at least one image including a gesture made by a user; analyzing the at least one image to identify the gesture made by the user in the at least one image; determining, based on the gesture, a first action to perform on the at least one image; determining a selection area for the gesture; identifying an area of interest in the at least one image based on the determined selection area of the gesture, wherein the area of interest includes non-textual content; and performing the first action on the identified area of interest, wherein the first action comprises: classifying the non-textual content included in the area of interest into at least one of a plurality of different types of non-textual content into which the non-textual content is classifiable by the at least one processor, wherein the instructions are configured to cause the at least one processor to be capable of recognizing each of a face, an object, and a landscape; and generating a first result that indicates the type of non-textual content into which the content included in the area of interest was classified; determining a second action to be performed on the identified area of interest based at least in part on the at least one type of non-textual content into which the non-textual content included in the area of interest was classified; and performing the second action on the identified area of interest. 5. The computer-readable medium according to claim 4 , wherein the selection area for the gesture is determined by a shape and a location of the gesture in the at least one image. 6. The computer-readable medium according to claim 4 , wherein: determining the second action comprises determining, based on the gesture and the first result, the second action to be performed on the identified area of interest. 7. A method comprising the following operations performed by one or more processors: receiving at least one image including a single gesture made by a user; analyzing the at least one image to identify the single gesture made by the user in the at least one image; determining, based on the single gesture, a first action to perform on the at least one image; determining a selection area indicated by the single gesture, such that both of the first action and the selection area are determined based on the single gesture made by the user in the at least one image; identifying an area of interest in the at least one image based on the determined selection area indicated by the gesture; and performing the first action on the identified area of interest, wherein performing the first action comprises recognizing, by the one or more processors that are capable of recognizing each of a face, an object, or a landscape, at least one of the face, the object, or the landscape within the area of interest. 8. The method according to claim 7 , wherein the selection area for the single gesture is determined by a shape and a location of the single gesture in the at least one image. 9. The method according to claim 7 , wherein the first action results in a first result and the method further comprises: determining, based on the single gesture and the first result, a second action; and performing the second action. 10. The computer-implemented system according to claim 1 , wherein performing the second action comprises at least one of performing a facial recognition on a first face, identifying a first object, or determining a location. 11. The computer-implemented system according to claim 1 , wherein: classifying content included in the area of interest into at least one of the plurality of different types of content comprises classifying the content included in the area of interest as a first face; and performing the second action comprises automatically performing facial recognition on the first face. 12. The computer-implemented system according to claim 1 , wherein: determining the selection area for the gesture comprises determining the selection area based on a first image that includes the gesture made by the user; and identifying the area of interest in the at least one image comprises: obtaining a second image that depicts a same scene as the first image that includes the gesture made by the user, wherein the second image does not include the gesture made by the user; and identifying the area of interest in the second image based at least in part on the selection area determined from the first image. 13. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at least one image comprises: obtaining depth of field measurements for the at least one image; and detecting the gesture in a foreground of the at least one image based at least in part on the depth of field measurements. 14. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at least one image comprises: providing the at least one image to a deep neural network; and receiving an identifier for the gesture from the deep neural network. 15. The computer-implemented system according to claim 1 , wherein analyzing the at least one image to identify the gesture made by the user in the at lea

Assignees

Inventors

Classifications

  • G06F3/0304Primary

    Detection arrangements using opto-electronic means (constructional details of pointing devices not related to the detection arrangement using opto-electronic means G06F3/033; optical digitisers G06F3/042) · CPC title

  • Recognition of hand or arm movements, e.g. recognition of deaf sign language (static hand signs G06V40/113) · CPC title

  • based on user input or interaction · CPC title

  • Physics · mapped topic

  • Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9953216B2 cover?
Systems, methods, and computer-readable media are provided for performing actions in response to gestures made by a user in captured images. In accordance with one implementation, a computer-implemented system is provided that includes an image capture device that captures at least one image, a memory device that stores instructions, and at least one processor that executes the instructions sto…
Who is the assignee on this patent?
Google Inc, Google Llc
What technology area does this patent fall under?
Primary CPC classification G06F3/0304. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).