Method and apparatus for processing audio signal using spectral data of audio signal

US9275648B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9275648-B2
Application numberUS-74714808-A
CountryUS
Kind codeB2
Filing dateDec 18, 2008
Priority dateDec 18, 2007
Publication dateMar 1, 2016
Grant dateMar 1, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of processing an audio signal is disclosed. The present invention includes receiving spectral data corresponding to a first band in a frequency band including the first band and a second band, determining a copy band based on frequency information of the copy band corresponding to a partial band of the first band, and generating spectral data of a target band corresponding to the second band using the spectral data of the copy band, wherein the copy band exists in an upper part of the first band.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of decoding an audio signal, comprising: receiving, by a decoding apparatus, spatial information and a narrow-band downmix signal, the narrow-band downmix signal corresponding to a first band from a bitstream; decoding, by the decoding apparatus, the narrow-band downmix signal using one of an audio coding scheme and a speech coding scheme, wherein the speech coding scheme includes Linear Prediction Coding (LPC); obtaining, by the decoding apparatus, spectral data corresponding to the first band from the narrow-band downmix signal; determining, by the decoding apparatus, a copy band based on frequency information of the copy band, the copy band corresponding to a partial band of the first band; time-dilating the spectral signal of the copy band, and decimating the time dilated spectral signal, wherein the time dilating step dilates a time domain signal using a phase vocoder scheme; generating, by the decoding apparatus, spectral data of a target band based on the decimated spectral signal, the target band corresponding to the second band using spectral data of the copy band; and generating an output signal using a broadband downmix signal and spatial information, wherein the broadband downmix signal includes the spectral data of the target band and the spectral data of the first band, the output signal including at least two channels, wherein the copy band exists in an upper part of the first band, wherein the frequency information of the copy band comprises at least one of a start frequency, a start band, and index information indicating the start band, and wherein the at least one of a start frequency, a start band, and index information indicating the start band is determined variably per frame and is formed from a high frequency band or a low frequency band based on a numerical value of a brightness of sound for the audio signal using a spectral centroid. 2. The method of claim 1 , wherein the spectral data of the target band is generated by using at least one of gain information corresponding to a gain between the spectral data of the copy band and the target band, and harmonic information of the copy band. 3. The method of claim 1 , wherein the spatial information is used to generate the output signal by upmixing the broadband downmix signal and includes at least one of channel level difference information, inter-channel correlation information, channel prediction coefficient and downmix gain information. 4. The method of claim 1 , wherein a bandwidth of the target band is different from that of the copy band, and at least two bandwidths of the target band are generated using the copy band. 5. The method of claim 4 , wherein different gain information is applied to each bandwidth of the target band, and each of the gain information is obtained using an energy ratio between the spectral data of the copy band to the target band. 6. An apparatus configured to decode an audio signal, comprising: a de-multiplexer configured to receive spatial information and a narrow-band downmix signal, the narrow-band downmix signal corresponding to a first band from a bitstream; an audio signal decoding unit implemented by a processor to decode the narrow-band downmix signal using an audio coding scheme; a speech signal decoding unit implemented by the processor to decode the narrow-band downmix signal using a speech coding scheme including Linear Prediction Coding (LPC); a copy band determining unit implemented by the processor to obtain spectral data corresponding to the first band from the narrow-band downmix signal, and to determine a copy band based on frequency information of the copy band, the copy band corresponding to a partial band of the first band; a target band information generating unit implemented by the processor to time-dilate the spectral signal of the copy band, and decimate the time dilated spectral signal, wherein the time-dilating of the spectral signal of the copy band includes dilating a time domain signal using a phase vocoder scheme, the target band information generating unit further configured to generate spectral data of a target band, the target band corresponding to a second band using the spectral data of the copy band; and a multi-channel generating unit implemented by the processor to generate an output signal using a broadband downmix signal and spatial information, wherein the broadband downmix signal includes the spectral data of the target band and the spectral data of the first band, the output signal including at least two channels, wherein the copy band exists in an upper part of the first band, wherein the frequency information of the copy band comprises at least one of a start frequency, a start band, and index information indicating the start band, and wherein the at least one of a start frequency, a start band, and index information indicating the start band is determined variably per frame and is formed from a high frequency band or a low frequency band based on a numerical value of a brightness of sound for the audio signal using a spectral centroid. 7. The apparatus of claim 6 , wherein the spectral data of the target band is generated using at least one of gain information corresponding to a gain between the spectral data of the copy band and the target band, and harmonic information of the copy band. 8. The apparatus of claim 6 , wherein the spatial information is used to generate the output signal by upmixing the broadband downmix signal and includes at least one of channel level difference information, inter-channel correlation information, channel prediction coefficient and downmix gain information. 9. A non-transitory computer-readable storage medium having recorded thereon a computer program for executing an audio decoding method, the audio decoding method comprising: receiving spatial information and a narrow-band downmix signal, the narrow-band downmix signal corresponding to a first band from a bitstream; decoding the narrow-band downmix signal using one of an audio coding scheme and a speech coding scheme, wherein the speech coding scheme includes Linear Prediction Coding (LPC); obtaining spectral data corresponding to the first band from the narrow-band downmix signal; determining a copy band based on frequency information of the copy band corresponding to a partial band of the first band; time-dilating the spectral signal of the copy band, and decimating the time dilated spectral signal, wherein the time dilating step dilates a time domain signal using a phase vocoder scheme; generating spectral data of a target band corresponding to a second band using spectral data of the copy band; and generating an output signal using a broadband downmix signal and spatial information, wherein the broadband downmix signal includes the spectral data of the target band and the spectral data of the first band, wherein the copy band exists in an upper part of the first band, and wherein the at least one of a start frequency, a start band, and index information indicating the start band is determined variably per frame and is formed from a high frequency band or a low frequency band based on a numerical value of a brightness of sound for the audio signal using a spectral centroid.

Assignees

Inventors

Classifications

  • using subband decomposition · CPC title

  • Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title

  • using band spreading techniques · CPC title

  • Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9275648B2 cover?
A method of processing an audio signal is disclosed. The present invention includes receiving spectral data corresponding to a first band in a frequency band including the first band and a second band, determining a copy band based on frequency information of the copy band corresponding to a partial band of the first band, and generating spectral data of a target band corresponding to the secon…
Who is the assignee on this patent?
Lee Hyun Kook, Kim Dong Soo, Yoon Sung Yong, and 3 more
What technology area does this patent fall under?
Primary CPC classification G10L19/0204. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 01 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).