Promoting voice actions to hotwords

US9263035B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9263035-B2
Application numberUS-201414221520-A
CountryUS
Kind codeB2
Filing dateMar 21, 2014
Priority dateDec 5, 2013
Publication dateFeb 16, 2016
Grant dateFeb 16, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method comprising: determining, by a device, that a voice command which includes a first term followed by a second term has been newly designated as a hotword; determining, by the device, that an existing hotword detector is configured to detect utterances that include the first term followed by a third term that is different from the second term, wherein a hotword detector comprises a neural network that is configured to estimate a likelihood of a match between (i) acoustic feature associated with an utterance and (ii) corresponding acoustic features associated with representative utterances of a particular term, without transcribing the utterance; and enabling, by the device, a set of hotword detectors including a first, a second, and a third hotword detector configured to detect utterances that include the first term, the second term, and the third term, respectively. 2. The method of claim 1 , wherein a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. 3. The method of claim 1 , comprising: disabling the existing hotword detector that is configured to detect utterances that include the first term followed by the different, third term. 4. The method of claim 1 , comprising: receiving an utterance including the first term followed by another term; receiving an output, from the first hotword detector, that the utterance includes the first term; and in response to receiving the output, determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively. 5. The method of claim 1 , comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is the second term; and in response determining that the other term is the second term, determining that the utterance is a voice input. 6. The method of claim 1 , comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is the third term; and in response determining that the other term is the third term, determining that the utterance is a voice input. 7. The method of claim 1 , comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is not the second term or the third term; and in response determining that the other term is not the second term or the third term, determining that the utterance is not a voice input. 8. The method of claim 1 , comprising: determining that a second voice command which includes the first term followed by a different, fourth term has been newly designated as a hotword; in response to determining that the voice command which includes the first term followed by the second term has been designated as a hotword, determining that an existing hotword detector is configured to detect utterances that include the first term; and in response to determining that an existing hotword detector is configured to detect utterances that include the first term, enabling a fourth hotword detector configured to detect utterances that include the different, fourth term. 9. The method of claim 1 , wherein a hotword comprises (i) a term that wakes the device up from a sleep state or hibernation state or (ii) a term that triggers semantic interpretation on the term or on one or more terms that follow the term. 10. A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: determining, by a device, that a voice command which includes a first term followed by a second term has been newly designated as a hotword; determining, by the device, that an existing hotword detector is configured to detect utterances that include the first term followed by a third term that is different from the second term wherein a hotword detector comprises a neural network that is configured to estimate a likelihood of a match between (i) acoustic feature associated with an utterance and (ii) corresponding acoustic features associated with representative utterances of a particular term, without transcribing the utterance; and enabling, by the device, a set of hotword detectors including a first, a second, and a third hotword detector configured to detect utterances that include the first term, the second term, and the third term, respectively. 11. The system of claim 10 , wherein a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. 12. The system of claim 10 , the operations comprising: disabling the existing hotword detector that is configured to detect utterances that include the first term followed by the different, third term. 13. The system of claim 10 , the operations comprising: receiving an utterance including the first term followed by another term; receiving an output, from the first hotword detector, that the utterance includes the first term; and in response to receiving the output, determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively. 14. The system of claim 10 , the operations comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is the second term; and in response determining that the other term is the second term, determining that the utterance is a voice input. 15. The system of claim 10 , the operations comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is the third term; and in response determining that the other term is the third term, determining that the utterance is a voice input. 16. The system of claim 10 , the operations comprising: in response to determining whether the other term is the second term or the third term using the second hotword detector and the third hotword detector, respectively, determining that the other term is not the second term or the third term; and in response determining that the other term is not the second term or the third term, determining that the utterance is not a voice input. 17. The system of claim 10 , the operations comprising: determining that a second voice command which includes the first term followed by a different, fourth term has been newly designated as a hotword; in response to determining that the voice command which includes the first term followed by the second term has been designated as a hotword, determining that an existing hotword detector is configured to detect utterances that include the first term; and in response to determining that an existing hotword detector is configured to detect utterances that include the first term, enabling a fourth hotword detector configured to detect utteranc

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Execution procedure of a spoken command · CPC title

  • Memory allocation or algorithm optimisation to reduce hardware requirements · CPC title

  • G10L15/08Primary

    Speech classification or search · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9263035B2 cover?
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voi…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 16 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).