Speech search device and speech search method

US2016336007A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016336007-A1
Application numberUS-201415111860-A
CountryUS
Kind codeA1
Filing dateFeb 6, 2014
Priority dateFeb 6, 2014
Publication dateNov 17, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is a speech search device including a recognizer 2 that refers to an acoustic model and language models having different learning data and performs voice recognition on an input speech, to acquire a recognized character string for each language model, a character string comparator 6 that compares the recognized character string for each language models with the character strings of search target words stored in a character string dictionary, and calculates a character string matching score showing the degree of matching of the recognized character string with respect to each of the character strings of the search target words, to acquire both a character string having the highest character string matching score and this character string matching score for each recognized character strings, and a search result determinator 8 that refers to the acquired score and outputs one or more search target words in descending order of the scores.

First claim

Opening claim text (preview).

1 . A speech search device comprising: a recognizer to refer to an acoustic model and a plurality of language models having different learning data and perform voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said plurality of language models; a character string dictionary storage to store a character string dictionary in which pieces of information showing character strings of search target words each serving as a target for speech search are stored; a character string comparator to compare the recognized character string for each of said plurality of language models, the recognized character string being acquired by said recognizer, with the character strings of the search target words which are stored in said character string dictionary and calculate a character string matching score showing a degree of matching of said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said recognized character strings; and a search result determinator to calculate a total score as a weighted sum of two or more of said character string matching score acquired by said character string comparator, and the acoustic likelihood and the language likelihood acquired by said recognizer, and output, as a search result, one or more search target words in descending order of calculated total scores. 2 . (canceled) 3 . The speech search device according to claim 1 , wherein said speech search device comprises an acoustic likelihood calculator to refer to a high-accuracy acoustic model having a higher degree of recognition accuracy than said acoustic model which is referred to by said recognizer, and perform an acoustic pattern comparison between the recognized character string for each of said plurality of language models, the recognized character string being acquired by said recognizer, and said input speech, to calculate a comparison acoustic likelihood, and wherein said recognizer acquires a language likelihood of said recognized character string, and said search result determinator calculates a total score as a weighted sum of two or more of the character string matching score acquired by said character string comparator, the comparison acoustic likelihood calculated by said acoustic likelihood calculator, and the language likelihood acquired by said recognizer, and outputs, as a search result, one or more search target words in descending order of calculated total scores. 4 . The speech search device according to claim 1 , wherein said speech search device classifies said plurality of language models into two or more groups, and assigns a recognition process performed by said recognizer to each of said two or more groups. 5 . A speech search device comprising: a recognizer to refer to an acoustic model and at least one language model and perform voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said one or more language models; a character string dictionary storage to store a character string dictionary in which pieces of information showing character strings of search target words each serving as a target for speech search are stored; a character string comparator to acquire an external recognized character string which is acquired by, in an external device, referring to an acoustic model and a language model having learning data different from that of the one or more language models which are referred to by said recognizer, and performing voice recognition on said input speech, compare the external recognized character string acquired thereby and the recognized character string acquired by said recognizer with the character strings of the search target words stored in said character string dictionary, and calculate character string matching scores showing degrees of matching of said external recognized character string and said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said external recognized character string and said recognized character string; and a search result determinator to calculate a total score as a weighted sum of two or more of said character string matching score acquired by said character string comparator, and the acoustic likelihood and the language likelihood of said recognized character string which are acquired by said recognizer, and an acoustic likelihood and a language likelihood of said external recognized character string which are acquired from said external device, and output, as a search result, one or more search target words in descending order of calculated total scores. 6 . (canceled) 7 . The speech search device according to claim 5 , wherein said speech search device comprises an acoustic likelihood calculator to refer to a high-accuracy acoustic model having a higher degree of recognition accuracy than said acoustic model which is referred to by said recognizer, and perform an acoustic pattern comparison between the recognized character string acquired by said recognizer and the external recognized character string acquired by the external device, and said input speech, to calculate a comparison acoustic likelihood, and wherein said recognizer acquires a language likelihood of said recognized character string, and said search result determinator calculates a total score as a weighted sum of two or more of the character string matching score acquired by said character string comparator, the comparison acoustic likelihood calculated by said acoustic likelihood calculator, the language likelihood of said recognized character string which is acquired by said recognizer, and a language likelihood of said external recognized character string which is acquired from said external device, and outputs, as a search result, one or more search target words in descending order of calculated total scores. 8 . A speech search method comprising the steps of: in a recognizer, referring to an acoustic model and a plurality of language models having different learning data and performing voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said plurality of language models; in a character string comparator, comparing the recognized character string for each of said plurality of language models with character strings of search target words each serving as a target for speech search, the character strings being stored in a character string dictionary, and calculating a character string matching score showing a degree of matching of said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said recognized character strings; and in a search result determinator, calculating a total score as a weighted sum of two or more of said character string matching score, and said acoustic likelihood and said language likelihood, and outputting, as a search result, one or more search target words in descending order of calculated total scores.

Assignees

Inventors

Classifications

  • for retrieval · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • using phonetics · CPC title

  • Calculation of difference between files · CPC title

  • using context dependencies, e.g. language models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016336007A1 cover?
Disclosed is a speech search device including a recognizer 2 that refers to an acoustic model and language models having different learning data and performs voice recognition on an input speech, to acquire a recognized character string for each language model, a character string comparator 6 that compares the recognized character string for each language models with the character strings o…
Who is the assignee on this patent?
Mitsubishi Electric Corp
What technology area does this patent fall under?
Primary CPC classification G10L15/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 17 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).