Method and device for voice recognition training

US10510337B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10510337-B2
Application numberUS-201815949145-A
CountryUS
Kind codeB2
Filing dateApr 10, 2018
Priority dateJul 23, 2013
Publication dateDec 17, 2019
Grant dateDec 17, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method on a mobile device for voice recognition training is described. A voice training mode is entered. A voice training sample for a user of the mobile device is recorded. The voice training mode is interrupted to enter a noise indicator mode based on a sample background noise level for the voice training sample and a sample background noise type for the voice training sample. The voice training mode is returned to from the noise indicator mode when the user provides a continuation input that indicates a current background noise level meets an indicator threshold value.

First claim

Opening claim text (preview).

We claim: 1. A method comprising: executing, by a processor of a mobile device, a first mode of the mobile device, the first mode configured to: display on a screen in communication with the processor a first graphical user interface including a prompt instructing a user associated with the mobile device to speak a designated phrase for training a voice recognition system of the mobile device, the voice recognition system configured to recognize a voice of the user; receive a first voice training sample corresponding to the user speaking the designated phrase; and determine whether a noise level for the received first voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received first voice training sample exceeds the predetermined threshold, executing, by the processor, a second mode of the mobile device, the second mode configured to display on the screen of the mobile device a second graphical user interface comprising: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 2. The method of claim 1 , further comprising, in response to determining that the noise level for the first voice training sample exceeds the predetermined threshold: ceasing, by the processor, execution of the first mode of the mobile device; and rejecting, by the processor, the first voice training sample from use in training the voice recognition system. 3. The method of claim 1 , wherein the second mode is further configured to not process any voice samples spoken by the user during execution of the second mode of the mobile device. 4. The method of claim 1 , wherein the first mode is further configured to, when the noise level for the received first voice training sample does not exceed the predetermined threshold, process the first voice training sample for use in training the speech recognition system. 5. The method of claim 4 , wherein the first mode is further configured to display again, in the first graphical user interface, the prompt instructing the user to speak the designated phrase. 6. The method of claim 1 , wherein the designated phrase comprises a trigger phrase including one or more words. 7. A mobile device comprising: a processor; a screen in communication with the processor; and memory hardware in communication with the processor and storing instructions, that when executed by the processor, cause the processor to perform one or more operations comprising: executing a first mode of the mobile device, the first mode configured to: display on the screen a first graphical user interface including a prompt instructing a user associated with the mobile device to speak a designated phrase for training a voice recognition system of the mobile device, the voice recognition system configured to recognize a voice of the user; receive a first voice training sample corresponding to the user speaking the designated phrase; and determine whether a noise level for the received first voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received first voice training sample exceeds the predetermined threshold, executing a second mode of the mobile device, the second mode configured to display on the screen a second graphical user interface comprising: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 8. The mobile device of claim 7 , wherein the operations further comprise, in response to determining that the noise level for the first voice training sample exceeds the predetermined threshold: ceasing execution of the first mode of the mobile device; and rejecting the first voice training sample from use in training the voice recognition system. 9. The mobile device of claim 7 , wherein the second mode is further configured to not process any voice samples spoken by the user during execution of the second mode of the mobile device. 10. The mobile device of claim 7 , wherein the first mode is further configured to, when the noise level for the received first voice training sample does not exceed the predetermined threshold, process the first voice training sample for use in training the speech recognition system. 11. The mobile device of claim 10 , wherein the first mode is further configured to display again, in the first graphical user interface, the prompt instructing the user to speak the designated phrase. 12. The mobile device of claim 7 , wherein the designated phrase comprises a trigger phrase including one or more words. 13. A method of voice recognition training, the method comprising: receiving, at a processor of a mobile device, a voice training sample corresponding to a user speaking a designated phrase while the mobile device is displaying a first user interface on a screen of the mobile device, the first user interface prompting the user to speak the designated phrase for training voice recognition software configured to recognize a voice of the user; determining, by the processor, whether a noise level for the received voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received voice training sample exceeds the predetermined threshold, displaying, by the processor, a second user interface on the screen of the mobile device, the second user interface configured to display: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 14. The method of claim 13 , further comprising, in response to determining that the noise level for the received voice training sample exceeds the predetermined threshold, rejecting, by the mobile device, the voice training sample from use in training the voice recognition software. 15. The method of claim 13 , further comprising, when displaying the second user interface, preventing, by the processor, the mobile device from accepting voice samples spoken by the user. 16. The method of claim 13 , further comprising, when the determined noise level for the received voice training sample does not exceed the predetermined threshold, processing, by the processor, the voice training sample for use in training the speech recognition software. 17. The method of claim 13 , wherein the designated phrase comprises a trigger phrase including one or more words. 18.

Assignees

Inventors

Classifications

  • Interactive procedures · CPC title

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • for discriminating voice from noise · CPC title

  • Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title

  • Terminal devices · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10510337B2 cover?
A method on a mobile device for voice recognition training is described. A voice training mode is entered. A voice training sample for a user of the mobile device is recorded. The voice training mode is interrupted to enter a noise indicator mode based on a sample background noise level for the voice training sample and a sample background noise type for the voice training sample. The voice tra…
Who is the assignee on this patent?
Google Technology Holdings LLC, Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 17 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).