Who is the assignee on this patent?

Baidu online network technology beijing co ltd

What technology area does this patent fall under?

Primary CPC classification G10L15/02. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Oct 31 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method and device for recognizing voice

US9805712B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9805712-B2
Application number	US-201414896588-A
Country	US
Kind code	B2
Filing date	Dec 18, 2014
Priority date	Apr 1, 2014
Publication date	Oct 31, 2017
Grant date	Oct 31, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for recognizing a voice and a device for recognizing a voice are provided. The method includes: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for recognizing a voice, comprising: collecting, with a processor, voice information input by a user; extracting, with the processor, characteristics from the voice information to obtain characteristic information; decoding, with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model. 2. The method according to claim 1 , wherein after obtaining characteristic information, the method further comprises: filtering, with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 3. The method according to claim 2 , wherein filtering the characteristic information comprises: performing an extraction of frame skipping on the characteristic information. 4. The method according to claim 1 , wherein calculating the compressed characteristic information comprises: performing a parallel computation on the compressed characteristic information. 5. The method according to claim 4 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 6. A device for recognizing a voice, comprising: a collecting module, configured to collect with a processor, voice information input by a user; an extracting module, configured to extract with the processor, characteristics from the voice information to obtain characteristic information; a decoding module, configured to decode with the processor, the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein the decoding module is configured to: perform a data compression on the characteristic information to obtain compressed characteristic information, and calculate the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculate data after acoustic model scoring according to the language model to obtain a score of language model; obtain the recognized voice information according to the score of acoustic model and the score of language model. 7. The device according to claim 6 , further comprising: a filtering module, configured to filter with the processor, the characteristic information to obtain filtered characteristic information, so as to decode the filtered characteristic information. 8. The device according to claim 7 , wherein the filtering module is configured to perform an extraction of frame skipping on the characteristic information. 9. The device according to claim 6 , wherein the decoding module calculates the compressed characteristic information by: performing a parallel computation on the compressed characteristic information. 10. The device according to claim 9 , wherein the parallel computation comprises at least one of data parallel computation, instruction parallel computation and thread parallel computation. 11. A mobile device, comprising: one or more processors; a memory; one or more programs, wherein the one or more programs are stored in the memory, and when executed by the one or more processors, perform following operations: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wherein the acoustic model is obtained by data compression in advance; wherein decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information comprises: performing a data compression on the characteristic information to obtain compressed characteristic information, and calculating the compressed characteristic information according to the acoustic model that is obtained by the data compression in advance to obtain a score of acoustic model; calculating data after acoustic model scoring according to the language model to obtain a score of language model; obtaining the recognized voice information according to the score of acoustic model and the score of language model.

Assignees

Baidu online network technology beijing co ltd

Inventors

Classifications

G10L15/02Primary
Feature extraction for speech recognition; Selection of recognition unit · CPC title
G10L19/0208
Subband vocoders · CPC title
G10L15/01Primary
Assessment or evaluation of speech recognition systems · CPC title
G10L15/183
using context dependencies, e.g. language models · CPC title
G10L19/022
Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title

Patent family

Related publications grouped by family.

View patent family 51040722

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9805712B2 cover?: A method for recognizing a voice and a device for recognizing a voice are provided. The method includes: collecting voice information input by a user; extracting characteristics from the voice information to obtain characteristic information; decoding the characteristic information according to an acoustic model and a language model obtained in advance to obtain recognized voice information, wh…
Who is the assignee on this patent?: Baidu online network technology beijing co ltd
What technology area does this patent fall under?: Primary CPC classification G10L15/02. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Oct 31 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).