Affecting the behavior of a user device based on a user's gaze

US9526127B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9526127-B1
Application numberUS-201213676517-A
CountryUS
Kind codeB1
Filing dateNov 14, 2012
Priority dateNov 18, 2011
Publication dateDec 20, 2016
Grant dateDec 20, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device may determine whether a user is facing a display screen associated with the device; and present feedback to the user. When presenting the feedback, the device may present visual information, that is based on the feedback, when determining that the user is facing the display screen associated with the device, and present audio information, that is based on the feedback, when determining that the user is not facing the display screen associated with the device. At least a portion of the audio information might not be presented when the visual information is presented when the user is facing the display screen associated with the device.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving, by a user device, audio data corresponding to an utterance of a user; determining that a first portion of a transcription of the utterance includes a keyword that is associated with a voice command; after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command, determining, by the user device, that an image of the user does not include one or more features that are characteristic of the user facing a display of the user device; and in response to determining that the image of the user does not include one or more features that are characteristic of the user facing the display of the user device, determining, by the user device, to prevent a second portion of the transcription of the utterance from being processed as a voice command. 2. The method of claim 1 , comprising: discarding the second portion of the transcription without inputting the second portion of the transcription to a dialog engine based on determining to prevent the second portion of the transcription of the utterance from being processed as a voice command. 3. The method of claim 1 , comprising: generating the image of the user by a camera on the user device after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command. 4. The method of claim 1 , wherein the voice command is a command for the user to take an action other than taking a picture. 5. The method of claim 1 , wherein determining the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining a direction of a gaze of the user; and classifying the gaze of the user as a gaze that is not directed toward the display of the user device. 6. The method of claim 1 , wherein determining that the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining that one or more particular facial features are not visible in the image. 7. The method of claim 1 , wherein determining that the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining an orientation of two or more particular facial features of the user with respect to each other. 8. The method of claim 1 , wherein determining that the image includes one or more features that are characteristic of the user facing a display of the user device comprises: comparing the image to a different image in which the user is labeled as facing the display of the mobile device. 9. A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: receiving, by a user device, audio data corresponding to an utterance of a user; determining that a first portion of a transcription of the utterance includes a keyword that is associated with a voice command; after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command, determining, by the user device, that an image of the user does not include one or more features that are characteristic of the user facing a display of the user device; and in response to determining that the image of the user does not include one or more features that are characteristic of the user facing the display of the user device, determining, by the user device, to prevent a second portion of the transcription of the utterance from being processed as a voice command. 10. The system of claim 9 , wherein the operations further comprise: discarding the second portion of the transcription without inputting the second portion of the transcription to a dialog engine based on determining to prevent the second portion of the transcription of the utterance from being processed as a voice command. 11. The system of claim 9 , wherein the operations further comprise: generating the image of the user by a camera on the user device after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command. 12. The system of claim 9 , wherein the voice command is a command for the user to take an action other than taking a picture. 13. The system of claim 9 , wherein determining the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining a direction of a gaze of the user; and classifying the gaze of the user as a gaze that is not directed toward the display of the user device. 14. The system of claim 9 , wherein determining that the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining that one or more particular facial features are not visible in the image. 15. The system of claim 9 , wherein determining that the image does not include one or more features that are characteristic of the user facing a display of the user device comprises: determining an orientation of two or more particular facial features of the user with respect to each other. 16. The system of claim 9 , wherein determining that the image includes one or more features that are characteristic of the user facing a display of the user device comprises: comparing the image to a different image in which the user is labeled as facing the display of the mobile device. 17. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising: receiving, by a user device, audio data corresponding to an utterance of a user; determining that a first portion of a transcription of the utterance includes a keyword that is associated with a voice command; after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command, determining, by the user device, that an image of the user does not include one or more features that are characteristic of the user facing a display of the user device; and in response to determining that the image of the user does not include one or more features that are characteristic of the user facing the display of the user device, determining, by the user device, to prevent a second portion of the transcription of the utterance from being processed as a voice command. 18. The medium of claim 17 , wherein the operations further comprise: discarding the second portion of the transcription without inputting the second portion of the transcription to a dialog engine based on determining to prevent the second portion of the transcription of the utterance from being processed as a voice command. 19. The medium of claim 17 , wherein the operations further comprise: generating the image of the user by a camera on the user device after determining that the first portion of the transcription of the utterance includes the keyword that is associated with the voice command. 20. The medium of claim 17 , wherein the voice command is a command for the user to take an action other than taking a picture

Assignees

Inventors

Classifications

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • using visible light · CPC title

  • using position of the lips, movement of the lips or face analysis · CPC title

  • H04W88/02Primary

    Terminal devices · CPC title

  • Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9526127B1 cover?
A device may determine whether a user is facing a display screen associated with the device; and present feedback to the user. When presenting the feedback, the device may present visual information, that is based on the feedback, when determining that the user is facing the display screen associated with the device, and present audio information, that is based on the feedback, when determining…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06F3/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).