Adaptive phase difference based noise reduction for automatic speech recognition (ASR)

US9449594B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9449594-B2
Application numberUS-201314124790-A
CountryUS
Kind codeB2
Filing dateSep 17, 2013
Priority dateSep 17, 2013
Publication dateSep 20, 2016
Grant dateSep 20, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of a system and method for adapting a phase difference-based noise reduction system are generally described herein. In some embodiments, spatial information associated with a first and second audio signal are determined, wherein the first and second audio signals including a target audio inside a beam and noise from outside the beam. A signal-to-noise ratio (SNR) associated with the audio signals is estimated. A mapping of phase differences to gain factors is adapted for determination of attenuation factors for attenuating frequency bins associated with noise outside the beam. Spectral subtraction is performed to remove estimated noise from the single-channel signal based on a weighting that affects frequencies associated with a target signal less. Frequency dependent attenuation factors are applied to attenuate frequency bins outside the beam to produce a target signal having noise reduced.

First claim

Opening claim text (preview).

What is claimed is: 1. An adaptive phase difference-based noise reduction system, comprising: a first channel for receiving a first audio signal at a first microphone; a second channel for receiving a second audio signal at a second microphone, the first and second audio signals including a target audio inside a beam and noise from outside the beam; a processor, coupled to the first and second channel, the processor arranged to: determine spatial information associated with the first audio signal and with the second audio signal; adjust phase differences by applying an adjustment for a phase difference greater than π equal to −2π+ the phase difference greater than π and applying an adjustment for a phase difference less than −π equal to 2π+the phase difference less than −π; to estimate a signal-to-noise ratio (SNR) on a single-channel signal derived from the first and second audio signals using the determined spatial information; adapt a mapping of the adjusted phase differences to gain factors for determination of frequency dependent attenuation factors for attenuating frequency bins associated with noise outside the beam, wherein to adapt the mapping of the adjusted phase differences to gain factors includes: calculating phase difference errors for the frequency bins to be respectively equal to an absolute value of a phase difference divided by a phase-threshold vector component and subtracting a beam factor, components of a phrase-threshold vector corresponding to the frequency bins; and clamping negative phase difference error values to zero to prevent frequency bins inside an inner portion of the beam from being attenuated; and apply the frequency dependent attenuation factors to attenuate frequency bins outside the beam to produce a target signal having noise reduced. 2. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor is further arranged to perform spectral subtraction on the single-channel signal derived from the first and second audio signals to remove estimated noise from the single-channel signal based on one selected from a group consisting of a weighting that affects frequencies associated with a target signal less and the estimated SNR. 3. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor determines spatial information by adjusting the phase difference with a computed offset to allow beam steering. 4. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor calculates the phase difference error by scaling phase differences to match time differences of arrival between the first audio signal of the first channel and the second audio signal of the second channel for each frequency bin. 5. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor estimates the signal-to-noise ratio (SNR) on the single-channel signal derived from the first and second audio signals using the determined spatial information by estimating a current signal-to-noise-ratio (SNR) based on the calculated phase difference error for differentiating frequency bins from inside the beam from frequency bins outside the beam, and the single-channel signal. 6. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor is further arranged to downmix the first and second audio signals of the first and second channels to derive the single-channel signal and to estimates the SNR by computing a ratio of noise energy outside of the beam and speech energy inside the beam. 7. The adaptive phase difference-based noise reduction system of claim 6 , wherein the processor computes the ratio of noise energy outside of the beam and speech energy inside the beam by determining magnitudes of frequencies outside of the beam when a phase difference error is determined to be greater than zero, calculating a weighted time average of the magnitudes of frequencies outside of the beam, determining magnitudes of frequencies inside of the beam, estimating an instantaneous signal energy inside the beam and calculating a weighted time average of the estimation of the instantaneous signal energy inside the beam. 8. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor adapts the mapping of phase differences to gain factors for determination of attenuation factors by computing attenuation factors based on a predetermined parameter, the estimated SNR and a calculated phase difference error for each of the frequency bins. 9. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor is further arranged to calculate phase difference errors between more than two channels and combining attenuation factors obtained for each channel pair. 10. The adaptive phase difference-based noise reduction system of claim 1 , wherein the processor derives the single-channel signal by processing more than two input audio signals of more than two channels to generate the single-channel signal. 11. A method for adapting a phase difference-based noise reduction system, comprising determining spatial information associated with a first audio signal received in a first channel from a first microphone and with a second audio signal received in a second channel from a second microphone, the first and second audio signals including a target audio inside a beam and noise from outside the beam; adjusting phase differences by applying an adjustment for a phase difference greater than π equal to −2π+ the phase difference greater than π and applying an adjustment for a phase difference less than π equal to 2π+ the phase difference less than −π; estimating a signal-to-noise ratio (SNR) on a single-channel signal derived from the first and second audio signals using the determined spatial information; adapting a mapping of the adjusted phase differences to gain factors for determination of frequency dependent attenuation factors for attenuating frequency bins associated with noise outside the beam, wherein adapting the mapping from the phase differences to gain factors includes: calculating phase difference errors for the frequency bins to be respectively equal to an absolute value of a phase difference divided by a phase-threshold vector component and subtracting a beam factor, components of a phase-threshold vector corresponding to the frequency bins; and clamping negative phase difference error values to zero to prevent frequency bins inside an inner portion of the beam from being attenuated; and applying the frequency dependent attenuation factors to attenuate frequency bins outside the beam to produce a target signal having noise reduced. 12. The method of claim 11 further comprises performing spectral subtraction on a single-channel signal derived from the first and second audio signals to remove estimated noise from the single-channel signal based on a weighting that affects frequencies associated with a target signal less, wherein the performing spectral subtraction on a single-channel signal further comprises subtracting noise from the single channel signal based on the estimated SNR. 13. The method of claim 11 , wherein the determining spatial information further comprises adjusting the phase difference with a computed offset to allow beam steering. 14. The method of claim 11 , wherein the determining the phase difference errors further comprises scaling phase differences to match time differences of arrival between the first audio signal of the first channel and the second audio signal of the second channel for each frequency bin and where

Assignees

Inventors

Classifications

  • Microphone arrays; Beamforming · CPC title

  • the noise being separate speech, e.g. cocktail party · CPC title

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

  • audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants (echo suppression in two-way loud-speaking telephone systems H04M9/02; sound field processing per se H04S7/30) · CPC title

  • Processing in the frequency domain · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9449594B2 cover?
Embodiments of a system and method for adapting a phase difference-based noise reduction system are generally described herein. In some embodiments, spatial information associated with a first and second audio signal are determined, wherein the first and second audio signals including a target audio inside a beam and noise from outside the beam. A signal-to-noise ratio (SNR) associated with the…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G10L21/0232. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).