Packet loss concealment for bandwidth extension of speech signals

US9280978B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9280978-B2
Application numberUS-201313851245-A
CountryUS
Kind codeB2
Filing dateMar 27, 2013
Priority dateMar 27, 2012
Publication dateMar 8, 2016
Grant dateMar 8, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is a speech receiving apparatus. A low-band PLC module and a synthesis filter reconstructs a low-band speech signal of a lost frame from a previous good frame. A high-band PLC module reconstructs a high-band speech signal of the lost frame from the previous good frame. A transforming part transforms the low-band speech signal into a frequency range. A bandwidth extending part generates at least an extended MDCT coefficient as information for the high-band speech signal from the low-band speech signal transformed by the transforming part. A smoothing part smoothes the extended MDCT coefficient. An inverse transforming part inversely transforms the extended MDCT coefficient smoothed by the smoothing part to a time domain. A synthesizing part synthesizes the low-band speech signal, and the high-band speech signal which is inverse-transformed by the inverse transforming part and reconstructed, to output a wideband speech signal.

First claim

Opening claim text (preview).

What is claimed is: 1. A speech receiving apparatus comprising: a low-band packet loss concealment (PLC) module and a synthesis filter reconstructing a low-band speech signal of a lost frame from a previous good frame; a high-band PLC module reconstructing a high-band speech signal of the lost frame from the previous good frame; a transforming part transforming the low-band speech signal to a frequency domain; a bandwidth extending part generating at least an extended modified discrete cosine transform (MDCT) coefficient as information for the high-band speech signal from the low-band speech signal transformed by the transforming part; a smoothing part smoothing the extended MDCT coefficient; an inverse transforming part inversely transforming the extended MDCT coefficient smoothed by the smoothing part to a time domain; and a synthesizing part synthesizing the low-band speech signal, and the high-band speech signal that is inverse-transformed by the inverse transforming part and reconstructed, to output a wideband speech signal; wherein the bandwidth extending part performs spectral folding of low-band MDCT coefficients to generate at least a part of the extended MDCT coefficients. 2. The speech receiving apparatus of claim 1 , wherein the bandwidth extending part comprises at least two processing parts generating the extended MDCT coefficient by a different process according to the frequency range. 3. The speech receiving apparatus of claim 1 , wherein the bandwidth extending part comprises a spectral folding part and a spectral smoothing part, generating at least a part of the extended MDCT coefficients by folding and smoothing the MDCT coefficients of the low-band speech signal. 4. The speech receiving apparatus of claim 1 , wherein the bandwidth extending part comprises a voiced/unvoiced speech determining part utilizing the MDCT coefficients of the low-band speech signal by different processes according to a voiced or unvoiced speech. 5. The speech receiving apparatus of claim 4 , wherein the bandwidth extending part comprises a voiced speech processing part performing a harmonic spectral folding when an input speech is determined to be the voiced speech by the voiced/unvoiced speech determining part. 6. The speech receiving apparatus of claim 4 , wherein the bandwidth extending part comprises an unvoiced speech processing part performing a spectral folding of a high autocorrelation section from the low band when an input speech is determined to be the unvoiced speech by the voiced/unvoiced speech determining part. 7. The speech receiving apparatus of claim 4 , wherein the voiced/unvoiced speech determining part determines the voiced or unvoiced speech according to a tilt of a spectral tilt parameter. 8. The speech receiving apparatus of claim 1 , wherein, in the bandwidth extending part, the extended MDCT coefficient for a second frequency range is generated by folding the MDCT coefficient of the low-band speech signal, the extended MDCT coefficient for a third frequency range higher than the second frequency range is generated by folding and smoothing the MDCT coefficient of the low-band speech signal, the extended MDCT coefficient for a first frequency range lower than the second frequency range is generated by differently processing the MDCT coefficient of the low-band speech signal according to whether an input speech is a voiced or unvoiced speech. 9. The speech receiving apparatus of claim 8 , wherein the first frequency range is 4-4.6 kHz, the second frequency range is 4.6-5.5 kHz, and the third frequency range is 5.5-7 kHz. 10. The speech receiving apparatus of claim 1 , wherein the bandwidth extending part comprises a shaping part shaping the extended MDCT coefficient that is generated by a different process according to the frequency range and then synthesized. 11. A speech receiving method comprising: reconstructing a low-band speech signal of a lost frame from a previous good frame; transforming the reconstructed low-band speech signal to a frequency domain to provide a low-band modified discrete cosine transform (MDCT) coefficient; processing the low-band MDCT coefficient by different methods according to the frequency ranges of the high band, which are classified into at least two cases, to provide an extended MDCT coefficient of a high-band speech signal; inversely transforming the extended MDCT coefficient to a time domain to reconstruct the high-band speech signal; and synthesizing the reconstructed high-band speech signal and the low-band speech signal; wherein a second frequency range that is a part of the extended MDCT coefficients is obtained by folding the low-band MDCT coefficient. 12. The speech receiving method of claim 11 , prior to the reconstructing of the high-band speech signal, further comprising smoothing the high-band extended MDCT coefficient using the high-band MDCT coefficient reconstructed in the previous good frame in order to inhibit the high-band extended MDCT coefficients from being abruptly changed. 13. The speech receiving method of claim 11 , wherein a third frequency range that is a part of the extended MDCT coefficients and is higher than the second frequency range is obtained by folding and smoothing the low-band MDCT coefficient. 14. The speech receiving method of claim 11 , wherein a third frequency range that is a part of the extended MDCT coefficients utilizes the low-band MDCT coefficient by using different methods according to whether an input speech is a voiced or unvoiced speech. 15. The speech receiving method of claim 14 , wherein, when the input speech is the voiced speech, the extended MDCT coefficient is obtained by using the low-band MDCT coefficient by a harmonic spectral replication method. 16. The speech receiving method of claim 14 , wherein, when the input speech is the unvoiced speech, the extended MDCT coefficient is obtained by using the low-band MDCT coefficient by an autocorrelation spectral replication method.

Assignees

Inventors

Classifications

  • Details of processing therefor · CPC title

  • G10L19/005Primary

    Correction of errors induced by the transmission channel, if related to the coding algorithm · CPC title

  • using orthogonal transformation · CPC title

  • G10L19/02Primary

    using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

  • H04M11/06Primary

    Simultaneous speech and data transmission, e.g. telegraphic transmission over the same conductors · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9280978B2 cover?
Disclosed is a speech receiving apparatus. A low-band PLC module and a synthesis filter reconstructs a low-band speech signal of a lost frame from a previous good frame. A high-band PLC module reconstructs a high-band speech signal of the lost frame from the previous good frame. A transforming part transforms the low-band speech signal into a frequency range. A bandwidth extending part generate…
Who is the assignee on this patent?
Kwangju Inst Sci & Tech
What technology area does this patent fall under?
Primary CPC classification G10L19/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).