Split-domain speech signal enhancement

US10741192B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10741192-B2
Application numberUS-201815973214-A
CountryUS
Kind codeB2
Filing dateMay 7, 2018
Priority dateMay 7, 2018
Publication dateAug 11, 2020
Grant dateAug 11, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus for estimating speech signal in split-domain is disclosed. The method includes performing LP analysis on a noisy speech signal to generate a first plurality of LPC and a first residual signal. The method also includes estimating speech LPC spectrum to generate cleaned LPC. The method further includes estimating speech residual spectrum to generate cleaned residual signal. The method also includes synthesizing output signals based on the cleaned LPC and the cleaned residual signal.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for estimating speech signal at an electronic device, the method comprising: receiving, at a microphone, input signals, wherein the input signals include at least a noise signal component and a speech signal component; determining, by the electronic device, whether to perform a first filtering operation based on a characteristic of the input signals; performing, by the electronic device, the first filtering operation on a first portion of the input signals to generate a plurality of first linear predictive filter coefficients (LPC) and a first residual signal; calculating, by the electronic device, frequency response of the plurality of the first LPC to generate a first magnitude spectrum and a first phase spectrum, wherein the first magnitude spectrum corresponds to magnitude component of the frequency response and the first phase spectrum corresponds to phase component of the frequency response; converting, by the electronic device, the first residual signal into frequency-domain signal to generate a second magnitude spectrum and a second phase spectrum, wherein the second magnitude spectrum corresponds to magnitude component of the first residual signal in frequency domain and the second phase spectrum corresponds to phase component of the first residual signal in frequency domain; estimating, by the electronic device, a third magnitude spectrum based on the first magnitude spectrum, wherein the third magnitude spectrum corresponds to the speech signal component; estimating, by the electronic device, a fourth magnitude spectrum based on the second magnitude spectrum, wherein the fourth magnitude spectrum corresponds to the speech signal component; and synthesizing output signals, by the electronic device, based on the third magnitude spectrum and the fourth magnitude spectrum. 2. The method of claim 1 , wherein synthesizing the output signals comprises: calculating, by the electronic device, a plurality of second linear predictive filter coefficients (LPC) based on the third magnitude spectrum; and performing, by the electronic device, a second filtering operation based at least in part on the plurality of the second LPC to generate the output signals. 3. The method of claim 2 , wherein synthesizing the output signals comprises converting, by the electronic device, the fourth magnitude spectrum into time-domain signal to generate a second residual signal, wherein the second filtering operation to generate the output signals is based on the second residual signal. 4. The method of claim 1 , wherein estimating the third magnitude spectrum is based on one among a non-negative matrix factorization technique and a neural network based technique. 5. The method of claim 1 , wherein estimating the fourth magnitude spectrum is based on one among a non-negative matrix factorization technique and a neural network based technique. 6. The method of claim 1 , wherein estimating the third magnitude spectrum comprises estimating a plurality of weights based at least on one among a speech dictionary and a noise dictionary trained in linear predictive filter coefficients (LPC) domain. 7. The method of claim 1 , wherein estimating the fourth magnitude spectrum comprises estimating a plurality of weights based at least on one among a speech dictionary and a noise dictionary trained in residual signal domain. 8. The method of claim 7 , wherein at least one weight of the plurality of weights is perceptually weighted or filtered to enhance periodicity. 9. The method of claim 2 , wherein calculating the plurality of the second LPC is further based on the first phase spectrum. 10. The method of claim 3 , wherein converting the fourth magnitude spectrum into time-domain signal is further based on the second phase spectrum. 11. The method of claim 2 , wherein the first filtering operation corresponds to linear predictive analysis filtering and the second filtering operation corresponds to linear predictive synthesis filtering. 12. The method of claim 6 , wherein estimating the third magnitude spectrum comprises: estimating a first plurality of weight vector based on the speech dictionary; and estimating a second plurality of weight vector based on the noise dictionary, wherein the third magnitude spectrum is based on the first plurality of weight vector. 13. The method of claim 6 , wherein estimating the fourth magnitude spectrum comprises: estimating a third plurality of weight vector based on the speech dictionary; and estimating a fourth plurality of weight vector based on the noise dictionary, wherein the fourth magnitude spectrum is based on the third plurality of weight vector. 14. An apparatus for estimating speech signal, comprising: a microphone configured to receive input signals, wherein the input signals include at least a noise signal component and a speech signal component; a memory configured to store the input signals; and a processor coupled to the memory, the processor configured to: perform a first filtering operation on a first portion of the input signals to generate a plurality of first linear predictive filter coefficients (LPC) and a first residual signal; calculate frequency response of the plurality of the first LPC to generate a first magnitude spectrum and a first phase spectrum, wherein the first magnitude spectrum corresponds to magnitude component of the frequency response and the first phase spectrum corresponds to phase component of the frequency response; convert the first residual signal into frequency-domain signal to generate a second magnitude spectrum and a second phase spectrum, wherein the second magnitude spectrum corresponds to magnitude component of the first residual signal in frequency domain and the second phase spectrum corresponds to phase component of the first residual signal in frequency domain; estimate a third magnitude spectrum based on the first magnitude spectrum, wherein the third magnitude spectrum corresponds to the speech signal component; estimate a fourth magnitude spectrum based on the second magnitude spectrum, wherein the fourth magnitude spectrum corresponds to the speech signal component; convert, based on the second phase spectrum, the fourth magnitude spectrum into time-domain signal to generate a second residual signal; and synthesize output signals based on the third magnitude spectrum and the second residual signal. 15. The apparatus of claim 14 , wherein the processor is further configured to determine whether to perform the first filtering operation based on a characteristic of the input signals. 16. The apparatus of claim 14 , wherein the processor is configured to synthesize the output signals based on a plurality of second linear predictive filter coefficients (LPC) that is based on the third magnitude spectrum. 17. The apparatus of claim 14 , wherein the processor is configured to estimate the third magnitude spectrum based on one among a non-negative matrix factorization technique and a neural network based technique. 18. The apparatus of claim 14 , wherein the processor is configured to estimate the fourth magnitude spectrum based on one among a non-negative matrix factorization technique and a neural network based technique. 19. The apparatus of claim 14 , wherein the processor is further configured to estimate a plurality of weights based at least on one among a speech dictionary and a noise dictionary trained in linear predictive filter coefficients (LPC) domain. 20. The apparatus of claim 14 , wher

Assignees

Inventors

Classifications

  • Pitch determination of speech signals · CPC title

  • the extracted parameters being prediction coefficients · CPC title

  • Noise filtering · CPC title

  • Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters · CPC title

  • using neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10741192B2 cover?
A method and an apparatus for estimating speech signal in split-domain is disclosed. The method includes performing LP analysis on a noisy speech signal to generate a first plurality of LPC and a first residual signal. The method also includes estimating speech LPC spectrum to generate cleaned LPC. The method further includes estimating speech residual spectrum to generate cleaned residual sign…
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 11 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).