Apparatus and method for discriminating speech, and computer readable medium
US-9330682-B2 · May 3, 2016 · US
US9697843B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9697843-B2 |
| Application number | US-201414265693-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 30, 2014 |
| Priority date | Apr 30, 2014 |
| Publication date | Jul 4, 2017 |
| Grant date | Jul 4, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A particular method includes determining, at a device, a voicing classification of an input signal. The input signal corresponds to an audio signal. The method also includes controlling an amount of an envelope of a representation of the input signal based on the voicing classification. The method further includes modulating a white noise signal based on the controlled amount of the envelope. The method also includes generating a high band excitation signal based on the modulated white noise signal.
Opening claim text (preview).
What is claimed is: 1. A method comprising: extracting a voicing classification parameter of an input signal based on a received bitstream, wherein the input signal corresponds to an audio signal; controlling a frequency range of an envelope of a representation of the input signal based on the voicing classification parameter, the frequency range controlled based on a cut-off frequency of a low-pass filter applied to the representation of the input signal; modulating a white noise signal based on the controlled frequency range of the envelope; and generating a high band excitation signal corresponding to a decoded version of the audio signal based on the modulated white noise signal. 2. The method of claim 1 , further comprising controlling a magnitude of the envelope. 3. The method of claim 1 , further comprising controlling at least one of a shape of the envelope or a gain of the envelope. 4. The method of claim 3 , wherein an extent of variation of the shape of the envelope is greater when the voicing classification parameter corresponds to strongly voiced than when the voicing classification parameter corresponds to strongly unvoiced. 5. The method of claim 1 , wherein the voicing classification parameter indicates whether the input signal is a strongly voice signal, a weakly voiced signal, a weakly unvoiced signal, or a strongly unvoiced signal. 6. The method of claim 1 , further comprising determining the cut-off frequency based on the voicing classification parameter. 7. The method of claim 1 , wherein the cut-off frequency is greater when the voicing classification parameter corresponds to strongly voiced than when the voicing classification parameter corresponds to strongly unvoiced. 8. The method of claim 1 , wherein extracting the voicing classification parameter is performed by a decoder. 9. The method of claim 1 , wherein controlling the frequency range of the envelope of the representation of the input signal based on the voicing classification parameter is performed by a mobile communication device. 10. The method of claim 1 , wherein controlling the frequency range of the envelope of the representation of the input signal based on the voicing classification parameter is performed by a fixed location communication unit. 11. The method of claim 1 , wherein controlling the frequency range of the envelope of the representation comprises adjusting the representation of the input signal in a transform domain. 12. The method of claim 1 , wherein the representation of the input signal includes a low band excitation signal of an encoded version of the audio signal or a high band excitation signal of the encoded version of the audio signal. 13. The method of claim 1 , wherein the representation of the input signal includes a harmonically extended excitation signal and wherein the harmonically extended excitation signal is generated from a low band excitation signal of an encoded version of the audio signal. 14. The method of claim 1 , further comprising generating a scaled white noise signal by combining a scaled unmodulated white noise signal with a scaled modulated white noise signal, wherein the high band excitation signal is based on the scaled white noise signal. 15. The method of claim 1 , wherein the envelope comprises a time-varying envelope, and further comprising updating the envelope more than once per frame of the input signal. 16. An apparatus comprising: a voicing classifier configured to extract a voicing classification parameter of an input signal based on a received bitstream, wherein the input signal corresponds to an audio signal; an envelope adjuster configured to control a frequency range of an envelope of a representation of the input signal based on the voicing classification parameter, the frequency range controlled based on a cut-off frequency of a low-pass filter applied to the representation of the input signal; a modulator configured to modulate a white noise signal based on the controlled frequency range of the envelope; and an output circuit configured to generate a high band excitation signal based on the modulated white noise signal. 17. The apparatus of claim 16 , wherein the envelope adjuster is configured to control, based on the voicing classification parameter, at least one of a shape of the envelope, a magnitude of the envelope, or a gain of the envelope. 18. The apparatus of claim 17 , wherein at least one of the shape of the envelope, the magnitude of the envelope, or the gain of the envelope is controlled by adjusting one or more poles of linear predictive coding (LPC) coefficients based on the voicing classification parameter. 19. The apparatus of claim 17 , wherein at least one of the shape of the envelope, the magnitude of the envelope, or the gain of the envelope is configured to be controlled based on adjusted coefficients of a filter, the adjusted coefficients determined based on the voicing classification parameter, and wherein the modulator is configured to apply the filter to the white noise signal to generate the modulated white noise signal. 20. The apparatus of claim 16 , further comprising an antenna; and a receiver coupled to the antenna and configured to receive the bitstream. 21. The apparatus of claim 20 , wherein the receiver, the voicing classifier, the envelope adjuster, the modulator, and the output circuit are integrated into a mobile communication device. 22. The apparatus of claim 20 , wherein the receiver, the voicing classifier, the envelope adjuster, the modulator, and the output circuit are integrated into a fixed location communication unit. 23. The apparatus of claim 16 , further comprising: a high band encoder configured to encode a high band portion of the audio signal based on the high band excitation signal; and a transmitter configured to transmit an encoded audio signal to another device, wherein the encoded audio signal is an encoded version of the audio signal. 24. A computer-readable storage device storing instructions that, when executed by at least one processor, cause the at least one processor to: extract a voicing classification parameter of an input signal based on a received bitstream, wherein the input signal corresponds to an audio signal; control a frequency range of an envelope of a representation of the input signal based on the voicing classification parameter, the frequency range controlled based on a cut-off frequency of a low-pass filter applied to the representation of the input signal; modulate a white noise signal based on the controlled frequency range of the envelope; and generate a high band excitation signal based on the modulated white noise signal. 25. The computer-readable storage device of claim 24 , wherein the instructions are further executable to cause the at least one processor to control a shape of the envelope based on the voicing classification parameter. 26. The computer-readable storage device of claim 24 , wherein the instructions are further executable to cause the at least one processor to control at least one of a magnitude of the envelope or a gain of the envelope. 27. An apparatus comprising: means for extracting a voicing classification parameter of an input signal based on a received bitstream, wherein the input signal corresponds to an audio signal; means for controlling a frequency range of an envelope of a representation of the inp
Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters · CPC title
Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title
Noise filtering · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.