Method and device for recognizing voice

US9805712B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9805712-B2
Application numberUS-201414896588-A
CountryUS
Kind codeB2
Filing dateDec 18, 2014
Priority dateApr 1, 2014
Publication dateOct 31, 2017
Grant dateOct 31, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for recognizing a voice and a device for recognizing a voice are provided. The method includes: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for recognizing a voice, comprising: collecting, with a processor, voice information input by a user; extracting, with the processor, characteristics from the voice information to obtain characteristic information; decoding, with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model. 2. The method according to claim 1 , wherein after obtaining characteristic information, the method further comprises: filtering, with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 3. The method according to claim 2 , wherein filtering the characteristic information comprises: performing an extraction of frame skipping on the characteristic information. 4. The method according to claim 1 , wherein calculating the compressed characteristic information comprises: performing a parallel computation on the compressed characteristic information. 5. The method according to claim 4 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 6. A device for recognizing a voice, comprising: a collecting module, configured to collect with a processor, voice information input by a user; an extracting module, configured to extract with the processor, characteristics from the voice information to obtain characteristic information; a decoding module, configured to decode with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein the decoding module is configured to: perform a data compression on the characteristic information to obtain compressed characteristic information, and calculate the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculate data after acoustic model scoring according to the language model to obtain a score of language model; obtain the recognized voice information according to the score of acoustic model and the score of language model. 7. The device according to claim 6 , further comprising: a filtering module, configured to filter with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 8. The device according to claim 7 , wherein the filtering module is configured to perform an extraction of frame skipping on the characteristic information. 9. The device according to claim 6 , wherein the decoding module calculates the compressed characteristic information by: performing a parallel computation on the compressed characteristic information. 10. The device according to claim 9 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 11. A mobile device, comprising: one or more processors; a memory; one or more programs, wherein the one or more programs are stored in the memory, and when executed by the one or more processors, perform following operations: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model.

Assignees

Inventors

Classifications

  • G10L15/02Primary

    Feature extraction for speech recognition; Selection of recognition unit · CPC title

  • Subband vocoders · CPC title

  • G10L15/01Primary

    Assessment or evaluation of speech recognition systems · CPC title

  • using context dependencies, e.g. language models · CPC title

  • Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9805712B2 cover?
A method for recognizing a voice and a device for recognizing a voice are provided. The method includes: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wh…
Who is the assignee on this patent?
Baidu online network technology beijing co ltd
What technology area does this patent fall under?
Primary CPC classification G10L15/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 31 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).