Selective speech recognition scoring using articulatory features
US-9355636-B1 · May 31, 2016 · US
US9805712B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9805712-B2 |
| Application number | US-201414896588-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 18, 2014 |
| Priority date | Apr 1, 2014 |
| Publication date | Oct 31, 2017 |
| Grant date | Oct 31, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for recognizing a voice and a device for recognizing a voice are provided. The method includes: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance.
Opening claim text (preview).
What is claimed is: 1. A method for recognizing a voice, comprising: collecting, with a processor, voice information input by a user; extracting, with the processor, characteristics from the voice information to obtain characteristic information; decoding, with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model. 2. The method according to claim 1 , wherein after obtaining characteristic information, the method further comprises: filtering, with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 3. The method according to claim 2 , wherein filtering the characteristic information comprises: performing an extraction of frame skipping on the characteristic information. 4. The method according to claim 1 , wherein calculating the compressed characteristic information comprises: performing a parallel computation on the compressed characteristic information. 5. The method according to claim 4 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 6. A device for recognizing a voice, comprising: a collecting module, configured to collect with a processor, voice information input by a user; an extracting module, configured to extract with the processor, characteristics from the voice information to obtain characteristic information; a decoding module, configured to decode with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein the decoding module is configured to: perform a data compression on the characteristic information to obtain compressed characteristic information, and calculate the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculate data after acoustic model scoring according to the language model to obtain a score of language model; obtain the recognized voice information according to the score of acoustic model and the score of language model. 7. The device according to claim 6 , further comprising: a filtering module, configured to filter with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 8. The device according to claim 7 , wherein the filtering module is configured to perform an extraction of frame skipping on the characteristic information. 9. The device according to claim 6 , wherein the decoding module calculates the compressed characteristic information by: performing a parallel computation on the compressed characteristic information. 10. The device according to claim 9 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 11. A mobile device, comprising: one or more processors; a memory; one or more programs, wherein the one or more programs are stored in the memory, and when executed by the one or more processors, perform following operations: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model.
Feature extraction for speech recognition; Selection of recognition unit · CPC title
Subband vocoders · CPC title
Assessment or evaluation of speech recognition systems · CPC title
using context dependencies, e.g. language models · CPC title
Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.