Wake word selection assistance architectures and methods

US11790891B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11790891-B2
Application numberUS-202117539622-A
CountryUS
Kind codeB2
Filing dateDec 1, 2021
Priority dateMay 5, 2019
Publication dateOct 17, 2023
Grant dateOct 17, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Generally discussed herein are devices, systems, and methods for custom wake word selection assistance. A method can include receiving, at a device, data indicating a custom wake word provided by a user, determining one or more characteristics of the custom wake word, determining that use of the custom wake word will cause more than a threshold rate of false detections based on the characteristics, rejecting the custom wake word as the wake word for accessing a personal assistant in response to determining that use of the custom wake word will cause more than a threshold rate of false detections, and setting the custom wake word as the wake word in response to determining that use of the custom wake word will not cause more than the threshold rate of false detections.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for custom wake word selection assistance, the method comprising: receiving, at a device, data indicating a custom wake word provided by a user; determining one or more characteristics of the custom wake word including counting a number of unique phonemes within the custom wake word; and updating the custom wake word as the wake word in response to determining that the number of unique phonemes is greater than a threshold number of unique phonemes. 2. The method of claim 1 , wherein the method further includes refraining from updating the custom wake word as the wake word in response to determining the number of unique phonemes is less than a threshold number of unique phonemes. 3. The method of claim 1 , further comprising: providing, to the user, a series of possible pronunciations of the custom wake word; receiving, by the user, data indicating which of the possible pronunciations were selected by the user; and determining the one or more characteristics based on the selected pronunciations. 4. The method of claim 3 , wherein the possible pronunciations are provided to the user in text form or audio form. 5. The method of claim 1 , further comprising receiving an acceptable false acceptance rate from the user. 6. The method of claim 5 , wherein determining the one or more characteristics of the custom wake word further includes determining, using a speech recognition model, a false acceptance rate based on audio that includes no utterances of the custom wake word, and the method further comprises rejecting the custom wake word as the wake word in response to determining the false acceptance rate is greater than the acceptable false acceptance rate. 7. The method of claim 1 , wherein determining the one or more characteristics of the custom wake word further includes determining, using a speech recognition model, a correct acceptance rate based on audio that includes the custom wake word, the correct acceptance rate indicating a number of correct acceptances of the custom wake word per unit time. 8. The method of claim 7 , wherein the audio is synthetically generated using various voice fonts, emotions, and prosody. 9. The method of claim 8 , wherein a font of the fonts is determined based on a location of the user. 10. The method of claim 1 , wherein determining the one or more characteristics of the custom wake word further includes counting a number of stop sounds and plosives in the custom wake word and rejecting the custom wake word as the wake word in response to determining the number of stop sounds and plosives is less than a threshold number. 11. The method of claim 1 , wherein: determining the one or more characteristics of the custom wake word further includes determining at least two characteristics including a number of unique phonemes in the custom wake word and a false acceptance rate of the custom wake word, the method further includes determining a weighted combination of the at least two characteristics, and rejecting the custom wake word as the wake word in response to determining the weight combination is less than a threshold. 12. A system comprising: processing circuitry; memory including instructions that, when executed by the processing circuitry, cause the processing circuitry to perform operations for custom wake word selection, the operations comprising: receiving, at a device, data indicating a custom wake word provided by a user; determining one or more characteristics of the custom wake word including counting a number of unique phonemes within the custom wake word; and updating the custom wake word as the wake word in response to determining that the number of unique phonemes is greater than a threshold number of unique phonemes. 13. The system of claim 12 , wherein the operations further include refraining from updating the custom wake word as the wake word in response to determining the number of unique phonemes is less than a threshold number of unique phonemes. 14. The system of claim 12 , wherein the operations further include: providing, to the user, a series of possible pronunciations of the custom wake word; receiving, by the user, data indicating which of the possible pronunciations were selected by the user; and determining the one or more characteristics based on the selected pronunciations. 15. The system of claim 14 , wherein the possible pronunciations are provided to the user in text form or audio form. 16. The system of claim 12 , wherein the operations further comprise receiving an acceptable false acceptance rate from the user. 17. The system of claim 16 , wherein determining the one or more characteristics of the custom wake word further includes determining, using a speech recognition model, a false acceptance rate based on audio that includes no utterances of the custom wake word, and the operations further comprise rejecting the custom wake word as the wake word in response to determining the false acceptance rate is greater than the acceptable false acceptance rate. 18. A non-transitory machine-readable medium including instructions that, when executed by a machine, cause the machine to perform operations of custom wake word selection assistance, the operations comprising: Receiving data indicating a custom wake word provided by a user; determining one or more characteristics of the custom wake word including counting a number of unique phonemes within the custom wake word; and updating the custom wake word as the wake word in response to determining that the number of unique phonemes is greater than a threshold number of unique phonemes. 19. The non-transitory machine-readable medium of claim 18 , wherein determining the one or more characteristics of the custom wake word further includes determining, using a speech recognition model, a correct acceptance rate based on audio that includes the custom wake word, the correct acceptance rate indicating a number of correct acceptances of the custom wake word per unit time. 20. The non-transitory machine-readable medium of claim 19 , wherein: the audio is synthetically generated using various voice fonts, emotions, and prosody; and a font of the fonts is determined based on a location of the user.

Assignees

Inventors

Classifications

  • G10L15/07Primary

    to the speaker · CPC title

  • Speech synthesis; Text to speech systems · CPC title

  • Voice editing, e.g. manipulating the voice of the synthesiser · CPC title

  • Prosody rules derived from text; Stress or intonation · CPC title

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11790891B2 cover?
Generally discussed herein are devices, systems, and methods for custom wake word selection assistance. A method can include receiving, at a device, data indicating a custom wake word provided by a user, determining one or more characteristics of the custom wake word, determining that use of the custom wake word will cause more than a threshold rate of false detections based on the characterist…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/07. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 17 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).