Dynamic enrollment of user-defined wake-up key-phrase for speech enabled computer system

US10672380B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10672380-B2
Application numberUS-201715855379-A
CountryUS
Kind codeB2
Filing dateDec 27, 2017
Priority dateDec 27, 2017
Publication dateJun 2, 2020
Grant dateJun 2, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.

First claim

Opening claim text (preview).

What is claimed is: 1. A processor-implemented method for wake-on-voice (WOV) key-phrase enrollment, the method comprising: generating, by a processor-based system, a WOV key-phrase model based on a user-provided WOV enrollment key-phrase, the WOV key-phrase model employed by a WOV processor for detecting of a correct sequence of sub-phonetic units of the WOV key-phrase spoken by the user and triggering operation of an automatic speech recognition (ASR) processor in response to the WOV key-phrase detection; and updating, by the processor-based system, an ASR language model based on the user-provided WOV enrollment key-phrase, the ASR language model employed by the ASR processor for recognizing speech utterances spoken by the user, wherein updating the ASR language model comprises generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided WOV enrollment key-phrase and incorporating the generated ASR key-phrase model into the ASR language model. 2. The method of claim 1 , wherein the user-provided WOV enrollment key-phrase is provided as a text entry, the method further comprising performing a grapheme to phoneme conversion on the text entry for the generation of the WOV key-phrase model. 3. The method of claim 1 , wherein the triggering of the ASR processor comprises waking the ASR processor from a lower power consuming idle state to a higher power consuming recognition state. 4. The method of claim 3 , wherein the WOV processor consumes less power than the ASR processor when the ASR processor is in the higher power consuming recognition state. 5. A system for wake-on-voice (WOV) key-phrase enrollment, the system comprising: a WOV key-phrase model generation circuit to generate a WOV key-phrase model based on a user-provided WOV enrollment key-phrase, the WOV key-phrase model employed by a WOV processor for detecting of a correct sequence of sub-phonetic units of the WOV key-phrase spoken by the user and triggering operation of an automatic speech recognition (ASR) processor in response to the WOV key-phrase detection; an ASR model update circuit to update an ASR language model based on the user-provided WOV enrollment key-phrase, the ASR language model employed by the ASR processor for recognizing speech utterances spoken by the user; and an ASR key-phrase model generation circuit to generate an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided WOV enrollment key-phrase and incorporate the generated ASR key-phrase model into the ASR language model. 6. The system of claim 5 , wherein the user-provided WOV enrollment key-phrase is provided as a text entry, the system further comprises a grapheme to phoneme conversion circuit to convert the text entry to phonemes for the generation of the WOV key-phrase model. 7. The system of claim 5 , wherein the triggering of the ASR processor comprises waking the ASR processor from a lower power consuming idle state to a higher power consuming recognition state. 8. The system of claim 7 , wherein the WOV processor consumes less power than the ASR processor when the ASR processor is in the higher power consuming recognition state. 9. A processor-implemented method for wake-on-voice (WOV) key-phrase enrollment, the method comprising: generating, by a processor-based system, a WOV key-phrase model based on a user-provided WOV enrollment key-phrase, the WOV key-phrase model employed by a WOV processor for detecting of a correct sequence of sub-phonetic units of the WOV key-phrase spoken by the user and triggering operation of an automatic speech recognition (ASR) processor in response to the WOV key-phrase detection; and updating, by the processor-based system, an ASR language model based on the user-provided WOV enrollment key-phrase, the ASR language model employed by the ASR processor for recognizing speech utterances spoken by the user, wherein updating the ASR language model comprises performing a sub-phonetic conversion of the WOV key-phrase model and incorporating the converted WOV key-phrase model into the ASR language model. 10. The method of claim 9 , wherein the user-provided WOV enrollment key-phrase is provided as a text entry, the method further comprising performing a grapheme to phoneme conversion on the text entry for the generation of the WOV key-phrase model. 11. The method of claim 9 , wherein the triggering of the ASR processor comprises waking the ASR processor from a lower power consuming idle state to a higher power consuming recognition state. 12. The method of claim 11 , wherein the WOV processor consumes less power than the ASR processor when the ASR processor is in the higher power consuming recognition state.

Assignees

Inventors

Classifications

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

  • Execution procedure of a spoken command · CPC title

  • G10L15/063Primary

    Training · CPC title

  • Word spotting · CPC title

  • Phonemes, fenemes or fenones being the recognition units · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10672380B2 cover?
Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operat…
Who is the assignee on this patent?
Intel Ip Corp
What technology area does this patent fall under?
Primary CPC classification G10L15/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 02 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).