Sound processing device and sound processing method

US9542937B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9542937-B2
Application numberUS-201414148813-A
CountryUS
Kind codeB2
Filing dateJan 7, 2014
Priority dateJan 15, 2013
Publication dateJan 10, 2017
Grant dateJan 10, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A sound processing device includes a noise suppression unit configured to suppress a noise component included in an input sound signal, an auxiliary noise addition unit configured to add auxiliary noise to the input sound signal, whose noise component has been suppressed by the noise suppression unit, to generate an auxiliary noise-added signal, a distortion calculation unit configured to calculate a degree of distortion of the auxiliary noise-added signal, and a control unit configured to control an addition amount by which the auxiliary noise addition unit adds the auxiliary noise based on the degree of distortion calculated by the distortion calculation unit.

First claim

Opening claim text (preview).

What is claimed is: 1. A sound processing device, comprising: at least one processor; and at least one memory including computer program code, wherein the at least one memory and the computer program code are configured to, with the at least on processor, cause the sound processing device to suppress a noise component included in an input sound signal, add auxiliary noise to the input sound signal, whose noise component has been suppressed by the noise suppression unit, to generate an auxiliary noise-added signal, calculate a degree of distortion of the auxiliary noise-added signal, estimate a speech recognition rate corresponding to the degree of distortion, calculate a kurtosis ratio which is a ratio of a kurtosis of the auxiliary noise-added signal to a kurtosis of the input sound signal as the degree of distortion, determine an addition amount based on the kurtosis ratio as an index value indicating the degree of distortion, calculate a power spectrum based on the noise component, calculate a complex noise-removed spectrum by subtracting the noise power from the power spectrum, transform the complex noise-removed spectrum into the input sound signal whose noise component has been suppressed, calculate a differential addition amount which is a difference between the determined addition amount and an ideal addition amount in which the speech recognition rate is the highest, control the addition amount, by which the auxiliary noise is added to the input sound signal whose noise component has been suppressed to maximize a speech recognition rate based on the kurtosis ratio, by using the differential addition amount, and perform a speech recognition process on the auxiliary noise-added signal. 2. The sound processing device according to claim 1 , wherein the at least one memory and the computer program code are further configured to, with the at least on processor, cause the sound processing device to estimate the speech recognition rate based on the degree of distortion of the auxiliary noise-added signal generated by suppressing the noise component with at least two types of suppression amounts, to select the suppression amount with which the estimated speech recognition rate is maximized, and to cause the noise suppression unit to suppress the noise component with the selected suppression amount. 3. The sound processing device according to claim 2 , wherein the at least one memory and the computer program code are further configured to, with the at least on processor, cause the sound processing device to control the addition amount of the auxiliary noise so as to maximize the estimated speech recognition rate with the selected suppression amount. 4. The sound processing device according to claim 1 , wherein the at least one memory and the computer program code are further configured to, with the at least on processor, cause the sound processing device to calculate the degree of distortion for each component of the auxiliary noise-added signal, and perform the speech recognition process so that the larger the degree of distortion of a component becomes, the smaller an influence of the component becomes. 5. A sound processing method, comprising: detecting a noise component included in an input sound signal and suppressing the noise component detected from the input sound signal; adding auxiliary noise to the input sound signal, whose noise component has been suppressed in the noise suppression step, to generate an auxiliary noise-added signal; calculating a degree of distortion of the auxiliary noise-added signal; estimating a speech recognition rate corresponding to the degree of distortion; calculating a kurtosis ratio which is a ratio of a kurtosis of the auxiliary noise-added signal to a kurtosis of the input sound signal as the degree of distortion; determining an addition amount based on the kurtosis ratio as an index value indicating the degree of distortion; calculating a power spectrum based on the noise component; calculating a complex noise-removed spectrum by subtracting the noise power from the power spectrum; transforming the complex noise-removed spectrum into the input sound signal whose noise component has been suppressed; calculating a differential addition amount which is a difference between the determined addition amount and an ideal addition amount in which the speech recognition rate is the highest; controlling the addition amount, by which the auxiliary noise is added to the input sound signal whose noise component has been suppressed to maximize a speech recognition rate based on the kurtosis ratio, by using the differential addition amount; and performing a speech recognition process on the auxiliary noise-added signal.

Assignees

Inventors

Classifications

  • G10L15/20Primary

    Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • Noise filtering · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9542937B2 cover?
A sound processing device includes a noise suppression unit configured to suppress a noise component included in an input sound signal, an auxiliary noise addition unit configured to add auxiliary noise to the input sound signal, whose noise component has been suppressed by the noise suppression unit, to generate an auxiliary noise-added signal, a distortion calculation unit configured to calcu…
Who is the assignee on this patent?
Honda Motor Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L15/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 10 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).