Language models using spoken language modeling
US-2024386885-A1 · Nov 21, 2024 · US
US9875744B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9875744-B2 |
| Application number | US-201715467028-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 23, 2017 |
| Priority date | Jul 23, 2013 |
| Publication date | Jan 23, 2018 |
| Grant date | Jan 23, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method on a mobile device for voice recognition training is described. A voice training mode is entered. A voice training sample for a user of the mobile device is recorded. The voice training mode is interrupted to enter a noise indicator mode based on a sample background noise level for the voice training sample and a sample background noise type for the voice training sample. The voice training mode is returned to from the noise indicator mode when the user provides a continuation input that indicates a current background noise level meets an indicator threshold value.
Opening claim text (preview).
We claim: 1. A computer-implemented method comprising: providing, by a mobile computing device, a first user interface indicating that (i) a background noise indicator mode is active, and (ii) a background noise loudness satisfies a loudness threshold or a background noise variance satisfies a variance threshold, the first user interface including a disabled continuation indicator; upon determining that the background noise loudness no longer satisfies the loudness threshold or the background noise variance no longer satisfies the variance threshold, enabling, by the mobile computing device, the disabled continuation indicator include on the first user interface; and in response to receiving data indicating a selection of the enabled continuation indicator, providing, by the mobile computing device, a second user interface indicating that (i) a voice training mode is active, and (ii) a voice training sample is to be spoken by a user. 2. The method of claim 1 , wherein the first user interface includes a dial-type interface and a needle that indicates a level of the background noise. 3. The method of claim 1 , wherein the first user interface does not indicate that (i) the voice training mode is active, and (ii) the voice training sample is to be spoken by the user. 4. The method of claim 1 , wherein the second user interface does not indicate that (i) the background noise indicator mode is active, and (ii) the background noise loudness satisfies the loudness threshold or the background noise variance satisfies the variance threshold. 5. The method of claim 1 , wherein the loudness threshold is based on the background noise variance. 6. The method of claim 1 , comprising: after providing the second user interface and before the voice training sample is spoken by the user, automatically providing, by the mobile computing device, the first user interface based on the background noise loudness satisfying the loudness threshold or the background noise variance satisfying the variance threshold. 7. A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: providing, by a mobile computing device, a first user interface indicating that (i) a background noise indicator mode is active, and (ii) a background noise loudness satisfies a loudness threshold or a background noise variance satisfies a variance threshold, the first user interface including a disabled continuation indicator; upon determining that the background noise loudness no longer satisfies the loudness threshold or the background noise variance no longer satisfies the variance threshold, enabling, by the mobile computing device, the disabled continuation indicator include on the first user interface; and in response to receiving data indicating a selection of the enabled continuation indicator, providing, by the mobile computing device, a second user interface indicating that (i) a voice training mode is active, and (ii) a voice training sample is to be spoken by a user. 8. The system of claim 7 , wherein the first user interface includes a dial-type interface and a needle that indicates a level of the background noise. 9. The system of claim 7 , wherein the first user interface does not indicate that (i) the voice training mode is active, and (ii) the voice training sample is to be spoken by the user. 10. The system of claim 7 , wherein the second user interface does not indicate that (i) the background noise indicator mode is active, and (ii) the background noise loudness satisfies the loudness threshold or the background noise variance satisfies the variance threshold. 11. The system of claim 7 , wherein the loudness threshold is based on the background noise variance. 12. The system of claim 7 , wherein the operations further comprise: after providing the second user interface and before the voice training sample is spoken by the user, automatically providing, by the mobile computing device, the first user interface based on the background noise loudness satisfying the loudness threshold or the background noise variance satisfying the variance threshold. 13. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising: providing, by a mobile computing device, a first user interface indicating that (i) a background noise indicator mode is active, and (ii) a background noise loudness satisfies a loudness threshold or a background noise variance satisfies a variance threshold, the first user interface including a disabled continuation indicator; upon determining that the background noise loudness no longer satisfies the loudness threshold or the background noise variance no longer satisfies the variance threshold, enabling, by the mobile computing device, the disabled continuation indicator include on the first user interface; and in response to receiving data indicating a selection of the enabled continuation indicator, providing, by the mobile computing device, a second user interface indicating that (i) a voice training mode is active, and (ii) a voice training sample is to be spoken by a user. 14. The medium of claim 13 , wherein the first user interface includes a dial-type interface and a needle that indicates a level of the background noise. 15. The medium of claim 13 , wherein the first user interface does not indicate that (i) the voice training mode is active, and (ii) the voice training sample is to be spoken by the user. 16. The medium of claim 13 , wherein the second user interface does not indicate that (i) the background noise indicator mode is active, and (ii) the background noise loudness satisfies the loudness threshold or the background noise variance satisfies the variance threshold. 17. The medium of claim 13 , wherein the operations further comprise: after providing the second user interface and before the voice training sample is spoken by the user, automatically providing, by the mobile computing device, the first user interface based on the background noise loudness satisfying the loudness threshold or the background noise variance satisfying the variance threshold. 18. The medium of claim 13 , wherein the loudness threshold is based on the background noise variance.
Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title
Training · CPC title
Training, enrolment or model building · CPC title
Interactive procedures · CPC title
for discriminating voice from noise · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.