Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy

US9666201B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9666201-B2
Application numberUS-201615068908-A
CountryUS
Kind codeB2
Filing dateMar 14, 2016
Priority dateSep 26, 2013
Publication dateMay 30, 2017
Grant dateMay 30, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention provides a bandwidth extension method and apparatus. The method includes: acquiring a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; and performing, according to the bandwidth extension parameter, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal. The high frequency band signal recovered by using the bandwidth extension method and apparatus in the embodiments of the present invention is close to an original high frequency band signal, and the quality is satisfactory.

First claim

Opening claim text (preview).

The invention claimed is: 1. A decoder implemented bandwidth extension method, comprising: receiving a bit stream encoded from an audio signal; performing decoding operations on the bit stream, wherein a low frequency signal is generated via the decoding operations, wherein a collection of parameters is acquired via the decoding operations, and wherein the collection of parameters comprises one or more of the following parameters: a linear predictive coefficient (LPC), a set of line spectral frequency (LSF) parameters, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; predicting a high-frequency gain according to the LPC, and any one of or a combination of: a voicing factor, a noise gate factor, a spectrum tilt factor, and a classification parameter; predicting a high frequency excitation signal by selecting a frequency band from a low frequency excitation signal according to a difference value between the LSF parameters, wherein the low frequency excitation signal is represented by a sum of the adaptive codebook contribution and the algebraic codebook contribution; and generating a high frequency band signal from the high frequency excitation signal and the high frequency gain to recover the audio signal. 2. The method according to claim 1 , wherein the high frequency excitation signal is predicted according to the decoding rate. 3. The method according to claim 1 , wherein predicting the high-frequency gain comprise: computing an initial high-frequency gain according to the LPC; and correcting the initial high-frequency gain according to a first correction factor to obtain the high-frequency gain, wherein the first correction factor comprises one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor. 4. The method according to claim 3 , wherein the first correction factor is determined according to the decoded low-frequency signal. 5. The method according to claim 3 , further comprising: correcting the high-frequency gain and the high frequency excitation signal according to a second correction factor; wherein the second correction factor comprises at least one of a classification parameter and a signal type. 6. The method according to claim 3 , wherein the high frequency excitation signal is based on a weighted combination of the predicted high frequency excitation signal and a random noise signal, wherein a weight of the weighted combination is determined according to a value of a classification parameter and/or a voicing factor of the decoded low-frequency signal. 7. The method according to claim 1 , wherein the high-frequency gain is corrected according to the pitch period. 8. The method according to claim 1 , wherein the generation of the high frequency band signal comprises: correcting the high frequency excitation signal by using the predicted high-frequency gain, and passing the corrected high frequency excitation signal through a LPC synthesis filter to obtain the high frequency band signal. 9. A bandwidth extension apparatus having a processor coupled to a memory storing instructions, wherein the processor executes the instructions to: receive a bit stream encoded from an audio signal; perform decoding operations on the bit stream, wherein a low frequency signal is generated via the decoding operations, wherein a collection of parameters is acquired via the decoding operations, and wherein the collection of parameters comprises one or more of the following parameters: a linear predictive coefficient (LPC), a set of line spectral frequency (LSF) parameters, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; predict a high-frequency gain according to the LPC, and any one of or a combination of: a voicing factor, a noise gate factor, a spectrum tilt factor, and a classification parameter; predict a high frequency excitation signal by selecting a frequency band from a low frequency excitation signal according to a difference value between the LSF parameters, wherein the low frequency excitation signal is represented by a sum of the adaptive codebook contribution and the algebraic codebook contribution; and generate a high frequency band signal from the high frequency excitation signal and the high frequency gain to recover the audio signal. 10. The apparatus according to claim 9 , wherein the high frequency excitation signal is predicted according to the decoding rate. 11. The apparatus according to claim 9 , wherein the processor is further configured to compute an initial high-frequency gain according to the LPC; and correct the initial high-frequency gain according to a first correction factor to obtain the high-frequency gain, wherein the first correction factor comprises one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor. 12. The apparatus according to claim 11 , wherein the first correction factor is determined according to the decoded low-frequency signal. 13. The apparatus according to claim 11 , wherein the processor is further configured to correct the high-frequency gain and the high frequency excitation signal according to a second correction factor; wherein the second correction factor comprises at least one of a classification parameter and a signal type. 14. The apparatus according to claim 11 , wherein the high frequency excitation signal is based on a weighted combination of the predicted high frequency-excitation signal and a random noise signal, wherein a weight of the weighted combination is determined according to a value of a classification parameter and/or a voicing factor of the decoded low-frequency signal. 15. The apparatus according to claim 14 , wherein the processor is further configured to: correct the high frequency excitation signal by using the predicted high frequency gain, and passing the corrected high frequency excitation signal through a LPC synthesis filter to obtain the high frequency band signal. 16. The apparatus according to claim 9 , wherein the high-frequency gain is corrected according to the pitch period. 17. A non-transitory computer-readable storage medium containing computer instructions that, when executed by a processor, cause the processor to perform the steps of: receiving a bit stream encoded from an audio signal; performing decoding operations on the bit stream, wherein a low frequency signal is generated via the decoding operations, wherein a collection of parameters is acquired via the decoding operations, and wherein the collection of parameters comprises one or more of the following parameters: a linear predictive coefficient (LPC), a set of line spectral frequency (LSF) parameters, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; predicting a high-frequency gain according to the LPC, and any one of or a combination of: a voicing factor, a noise gate factor, a spectrum tilt factor, and a classification parameter; predicting a high frequency excitation signal by selecting a frequency band from a low frequency excitation signal according to a difference value between the LSF parameters, wherein the low frequency excitation signal is represented by a sum of the adaptive codebook contribution and the algebraic codebook contribution; and generating a high frequency band signal from the high frequency excitation signal and the high frequency gain to recover the audio signal.

Assignees

Inventors

Classifications

  • G10L19/087Primary

    using mixed excitation models, e.g. MELP, MBE, split band LPC or HVXC · CPC title

  • Codebook adaptations · CPC title

  • Pitch determination of speech signals · CPC title

  • the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders · CPC title

  • G10L21/038Primary

    using band spreading techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9666201B2 cover?
The present invention provides a bandwidth extension method and apparatus. The method includes: acquiring a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic c…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/087. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 30 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).