Language models using spoken language modeling
US-2024386885-A1 · Nov 21, 2024 · US
US10510337B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10510337-B2 |
| Application number | US-201815949145-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 10, 2018 |
| Priority date | Jul 23, 2013 |
| Publication date | Dec 17, 2019 |
| Grant date | Dec 17, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method on a mobile device for voice recognition training is described. A voice training mode is entered. A voice training sample for a user of the mobile device is recorded. The voice training mode is interrupted to enter a noise indicator mode based on a sample background noise level for the voice training sample and a sample background noise type for the voice training sample. The voice training mode is returned to from the noise indicator mode when the user provides a continuation input that indicates a current background noise level meets an indicator threshold value.
Opening claim text (preview).
We claim: 1. A method comprising: executing, by a processor of a mobile device, a first mode of the mobile device, the first mode configured to: display on a screen in communication with the processor a first graphical user interface including a prompt instructing a user associated with the mobile device to speak a designated phrase for training a voice recognition system of the mobile device, the voice recognition system configured to recognize a voice of the user; receive a first voice training sample corresponding to the user speaking the designated phrase; and determine whether a noise level for the received first voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received first voice training sample exceeds the predetermined threshold, executing, by the processor, a second mode of the mobile device, the second mode configured to display on the screen of the mobile device a second graphical user interface comprising: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 2. The method of claim 1 , further comprising, in response to determining that the noise level for the first voice training sample exceeds the predetermined threshold: ceasing, by the processor, execution of the first mode of the mobile device; and rejecting, by the processor, the first voice training sample from use in training the voice recognition system. 3. The method of claim 1 , wherein the second mode is further configured to not process any voice samples spoken by the user during execution of the second mode of the mobile device. 4. The method of claim 1 , wherein the first mode is further configured to, when the noise level for the received first voice training sample does not exceed the predetermined threshold, process the first voice training sample for use in training the speech recognition system. 5. The method of claim 4 , wherein the first mode is further configured to display again, in the first graphical user interface, the prompt instructing the user to speak the designated phrase. 6. The method of claim 1 , wherein the designated phrase comprises a trigger phrase including one or more words. 7. A mobile device comprising: a processor; a screen in communication with the processor; and memory hardware in communication with the processor and storing instructions, that when executed by the processor, cause the processor to perform one or more operations comprising: executing a first mode of the mobile device, the first mode configured to: display on the screen a first graphical user interface including a prompt instructing a user associated with the mobile device to speak a designated phrase for training a voice recognition system of the mobile device, the voice recognition system configured to recognize a voice of the user; receive a first voice training sample corresponding to the user speaking the designated phrase; and determine whether a noise level for the received first voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received first voice training sample exceeds the predetermined threshold, executing a second mode of the mobile device, the second mode configured to display on the screen a second graphical user interface comprising: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 8. The mobile device of claim 7 , wherein the operations further comprise, in response to determining that the noise level for the first voice training sample exceeds the predetermined threshold: ceasing execution of the first mode of the mobile device; and rejecting the first voice training sample from use in training the voice recognition system. 9. The mobile device of claim 7 , wherein the second mode is further configured to not process any voice samples spoken by the user during execution of the second mode of the mobile device. 10. The mobile device of claim 7 , wherein the first mode is further configured to, when the noise level for the received first voice training sample does not exceed the predetermined threshold, process the first voice training sample for use in training the speech recognition system. 11. The mobile device of claim 10 , wherein the first mode is further configured to display again, in the first graphical user interface, the prompt instructing the user to speak the designated phrase. 12. The mobile device of claim 7 , wherein the designated phrase comprises a trigger phrase including one or more words. 13. A method of voice recognition training, the method comprising: receiving, at a processor of a mobile device, a voice training sample corresponding to a user speaking a designated phrase while the mobile device is displaying a first user interface on a screen of the mobile device, the first user interface prompting the user to speak the designated phrase for training voice recognition software configured to recognize a voice of the user; determining, by the processor, whether a noise level for the received voice training sample exceeds a predetermined threshold; and in response to determining that the noise level for the received voice training sample exceeds the predetermined threshold, displaying, by the processor, a second user interface on the screen of the mobile device, the second user interface configured to display: a notification that recommends an environment conducive to voice training; and a graphical element, wherein the second mode is further configured to enable the graphical element of the second graphical user interface for selection by the user when a background noise level does not exceed the predetermined threshold, the enabled graphical element, when selected by the user, causes the processor to transition from executing in the second mode back to executing in the first mode. 14. The method of claim 13 , further comprising, in response to determining that the noise level for the received voice training sample exceeds the predetermined threshold, rejecting, by the mobile device, the voice training sample from use in training the voice recognition software. 15. The method of claim 13 , further comprising, when displaying the second user interface, preventing, by the processor, the mobile device from accepting voice samples spoken by the user. 16. The method of claim 13 , further comprising, when the determined noise level for the received voice training sample does not exceed the predetermined threshold, processing, by the processor, the voice training sample for use in training the speech recognition software. 17. The method of claim 13 , wherein the designated phrase comprises a trigger phrase including one or more words. 18.
Interactive procedures · CPC title
Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title
for discriminating voice from noise · CPC title
Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title
Terminal devices · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.