Contextual hotwords

US10839803B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10839803-B2
Application numberUS-201916362831-A
CountryUS
Kind codeB2
Filing dateMar 25, 2019
Priority dateDec 27, 2016
Publication dateNov 17, 2020
Grant dateNov 17, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: determining, by a computing device, a current context associated with the computing device; based on the current context associated with the computing device, determining, by the computing device and from among multiple different commands that the computing device is configured to execute, a command that is currently available for execution by a user to open an application that is not currently running on the computing device, wherein the multiple different commands other than the command are not currently available for execution by the user; determining, by the computing device, a hotword that, when spoken by the user and detected by the computing device, instructs the computing device to execute the command, wherein each of the multiple different commands other than the command corresponds to a respective different hotword for triggering performance of a different respective operation other than opening the application that is not currently running on the computing device; selecting, by the computing device and from among multiple hotword models, a hotword model that is configured to recognize audio of the hotword, wherein each of the multiple hotword models are configured to recognize audio of the respective hotword; receiving, by the computing device, audio data of an utterance; providing, by the computing device, the audio data of the utterance as an input to the hotword model without providing the audio data of the utterance as an input to the multiple hotword models other than the hotword model; based on providing the audio data of the utterance as the input to the hotword model, determining, by the computing device, that the utterance includes the hotword; and based on determining that the utterance includes the hotword, performing, by the computing device, the command by opening the application to commence running on the computing device. 2. The method of claim 1 , wherein determining that the utterance includes the hotword comprises determining that the utterance includes the hotword without performing speech recognition on the audio data. 3. The method of claim 1 , wherein the audio data only includes the hotword. 4. The method of claim 1 , further comprising: receiving, by the computing device, additional audio data of an additional utterance that includes an additional hotword; providing, by the computing device, the additional audio data of the additional utterance as an input to the hotword model without providing the additional audio data of the additional utterance as an input to the multiple hotword models other than the hotword model; based on providing the additional audio data of the additional utterance as the input to the hotword model, determining, by the computing device, that the additional utterance does not include the hotword; and bypassing, by the computing device, performing an additional command associated with the additional hotword. 5. The method of claim 1 , wherein each of the multiple hotword models are generated based on audio data of previous utterances that included a corresponding hotword. 6. The method of claim 1 , further comprising: determining, by the computing device, that the command is no longer available for execution by the user; determining, by the computing device and from among the multiple different commands that the computing device is configured to execute, that an additional command is available for execution by the user; determining, by the computing device, an additional hotword that, when spoken by the user and detected by the computing device, instructs the computing device to execute the additional command; selecting, by the computing device and from among the multiple hotword models, an additional hotword model that is configured to recognize audio of the additional hotword; receiving, by the computing device, additional audio data of an additional utterance, wherein the additional utterance includes the hotword; providing, by the computing device, the additional audio data of the additional utterance as an input to the additional hotword model without providing the audio data of the utterance as an input to the multiple hotword models other than the additional hotword model; based on providing the additional audio data of the additional utterance as the input to the additional hotword model, determining, by the computing device, that the additional utterance does not include the additional hotword; and based on determining that the additional utterance does not include the additional hotword, bypassing, by the computing device, performance of the additional command. 7. The method of claim 1 , wherein determining that the utterance includes the hotword comprises: determining audio features of the audio data; based on the audio features, determining a hotword confidence score that reflects a likelihood that the utterance includes the hotword; and based on the hotword confidence score, determining that the utterance includes the hotword. 8. A system comprising: one or more computers; and one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: determining, by a computing device, a current context associated with the computing device; based on the current context associated with the computing device, determining, by the computing device and from among multiple different commands that the computing device is configured to execute, a command that is currently available for execution by a user to open an application that is not currently running on the computing device, wherein the multiple different commands other than the command are not currently available for execution by the user; determining, by the computing device, a hotword that, when spoken by the user and detected by the computing device, instructs the computing device to execute the command, wherein each of the multiple different commands other than the command corresponds to a respective different hotword for triggering performance of a different respective operation other than opening the application that is not currently running on the computing device; selecting, by the computing device and from among multiple hotword models, a hotword model that is configured to recognize audio of the hotword, wherein each of the multiple hotword models are configured to recognize audio of the respective hotword; receiving, by the computing device, audio data of an utterance; providing, by the computing device, the audio data of the utterance as an input to the hotword model without providing the audio data of the utterance as an input to the multiple hotword models other than the hotword model; based on providing the audio data of the utterance as the input to the hotword model, determining, by the computing device, that the utterance includes the hotword; and based on determining that the utterance includes the hotword, performing, by the computing device, the command by opening the application to commence running on the computing device. 9. The system of claim 8 , wherein determining that the utterance includes the hotword comprises determining that the utterance includes the hotword without performing speech recognition on the audio data. 10. The system of claim 8 , wherein the audio data only includes the hotword. 11. The system of claim 8 , wherein the operations further comprise: receiving, by the computing device, additional audio data of an additional utterance that includes an additional hotword; providing, by the computing devic

Assignees

Inventors

Classifications

  • of application context · CPC title

  • using non-speech characteristics · CPC title

  • Execution procedure of a spoken command · CPC title

  • Word spotting · CPC title

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10839803B2 cover?
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing d…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 17 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).