Method of correcting dictionary, program for correcting dictionary, voice processing apparatus, and robot

US10636415B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10636415-B2
Application numberUS-201715791447-A
CountryUS
Kind codeB2
Filing dateOct 24, 2017
Priority dateOct 31, 2016
Publication dateApr 28, 2020
Grant dateApr 28, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method is provided for correcting a dictionary used in a voice processing apparatus. The method includes first extracting a speech of a target speaker from audio collected by a microphone, and estimating a speech phonemic sequence configuring the speech. The method also includes calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic sequence that corresponds to a first word registered in the first dictionary, and second extracting the first word corresponding to a highest match degree as a spoken word spoken by the target speaker. The method further includes first correcting a second dictionary based on the highest match degree, the second dictionary indicating a relation between a second word and a third word, and second correcting the second dictionary by correcting the relation between the third word matching the spoken word and the second word.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of correcting a dictionary used in a voice processing apparatus, the method comprising: first extracting a speech of a target speaker from audio collected by a microphone corresponding to the voice processing apparatus; estimating a speech phonemic sequence configuring the speech; calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic sequence that corresponds to a first word registered in the first dictionary, the first dictionary being stored in a memory included in the voice processing apparatus and indicating a corresponding relation between the first word and the first phonemic sequence; second extracting, from the first dictionary, the first word corresponding to a highest match degree as a spoken word spoken by the target speaker; first correcting a second dictionary based on the highest match degree, the second dictionary indicating a corresponding relation between a second word and a third word, the third word corresponding to a language unit spoken by the target speaker intending to speak the second word; and second correcting the second dictionary by correcting the corresponding relation between the third word matching the spoken word and the second word that corresponds to the third word. 2. The method according to claim 1 , wherein when the spoken word is included in the second dictionary and the highest match degree is at least a first threshold value, the second dictionary is corrected by deleting, from the second dictionary, a combination of the second word matching the spoken word and the third word that corresponds to the second word. 3. The method according to claim 1 , wherein the second dictionary further includes the corresponding relation between the second word and frequency of the target speaker speaking the third word which matches the spoken word, and wherein when the spoken word is registered in the second dictionary as the second word, the highest match degree is at least the first threshold value, and the frequency of the third word that corresponds to the second word which matches the spoken word is at or less than a second threshold value, the second dictionary is corrected by deleting, from the second dictionary, the combination of the second word matching the spoken word and the third word that corresponds to the second word. 4. The method according to claim 1 , wherein the target speaker is a target child who is an object of voice recognition in the voice processing apparatus, wherein whether or not the speaker is the target child is determined by comparing a voiceprint extracted from the speech and the voiceprint of the target child previously recorded in the memory, and wherein the speech phonemic sequence is estimated when the speaker is determined to be the target child. 5. The method according to claim 1 , wherein the memory further stores a history table indicating the corresponding relation between the first word and the match degree, and wherein when a process of recording the first word corresponding to the highest match degree in association with the match degree in the history table is repeated a predetermined number of times, the second dictionary is corrected, based on the match degree in the history table, by correcting the corresponding relation between the second word and the third word registered in the second dictionary. 6. A non-transitory computer readable storage medium encoded with an executable computer program for correcting a dictionary used in a voice processing apparatus, that when executed by a processor causes the processor to perform operations, the operations comprising: first extracting a speech of a target speaker from audio collected from a microphone corresponding to the voice processing apparatus; estimating a speech phonemic sequence configuring the speech; calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic sequence that corresponds to a first word registered in the first dictionary, the first dictionary being stored in a memory included in the voice processing apparatus and indicating a corresponding relation between the first word and the first phonemic sequence; second extracting, from the first dictionary, the first word corresponding to a highest match degree as a spoken word spoken by the target speaker; first correcting a second dictionary based on the highest match degree, the second dictionary indicating a corresponding relation between a second word and a third word, the third word corresponding to a language unit spoken by the target speaker intending to speak the second word; and second correcting the second dictionary by correcting the corresponding relation between the third word matching the spoken word and second word that corresponds to the third word. 7. A voice processing apparatus, comprising: a processor; and a memory, wherein the processor executes a computer program to perform operations, the operations including first extracting a speech of a target speaker from audio collected from a microphone corresponding to the voice processing apparatus; estimating a speech phonemic sequence configuring the speech; calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic sequence that corresponds to a first word registered in the first dictionary, the first dictionary being stored in a memory and indicating a corresponding relation between the first word and the first phonemic sequence; second extracting, from the first dictionary, the first word corresponding to a highest match degree as a spoken word spoken by the target speaker; first correcting a second dictionary based on the highest match degree, the second dictionary indicating the corresponding relation between a second word and a third word, the third word corresponding to a language unit spoken by the target speaker intending to speak the second word; and second correcting the second dictionary by correcting the corresponding relation between the third word matching the spoken word and the second word that corresponds to the third word. 8. A robot, comprising: a voice processing apparatus according to claim 7 ; a casing in which the voice processing apparatus is incorporated; and a displacement mechanism displacing the casing.

Assignees

Inventors

Classifications

  • G10L15/063Primary

    Training · CPC title

  • G10L15/187Primary

    Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams · CPC title

  • using lexical or orthographic knowledge sources · CPC title

  • G10L15/02Primary

    Feature extraction for speech recognition; Selection of recognition unit · CPC title

  • Phonemes, fenemes or fenones being the recognition units · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10636415B2 cover?
A method is provided for correcting a dictionary used in a voice processing apparatus. The method includes first extracting a speech of a target speaker from audio collected by a microphone, and estimating a speech phonemic sequence configuring the speech. The method also includes calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic seq…
Who is the assignee on this patent?
Panasonic Ip Man Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L15/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 28 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).