What technology area does this patent fall under?

Primary CPC classification G10L19/008. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu May 19 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Noise filling in multichannel audio coding

US2016140974A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2016140974-A1
Application number	US-201615002375-A
Country	US
Kind code	A1
Filing date	Jan 20, 2016
Priority date	Jul 22, 2013
Publication date	May 19, 2016
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In multichannel audio coding, an improved coding efficiency is achieved by the following measure: the noise filling of zero-quantized scale factor bands is performed using noise filling sources other than artificially generated noise or spectral replica. In particular, the coding efficiency in multichannel audio coding may be rendered more efficient by performing the noise filling based on noise generated using spectral lines from a previous frame of, or a different channel of the current frame of, the multichannel audio signal.

First claim

Opening claim text (preview).

1 . A parametric frequency-domain audio decoder, configured to identify first scale factor bands of a spectrum of a first channel of a current frame of a multichannel audio signal, within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum, within which at least one spectral line is quantized to non-zero; fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a downmix of a previous frame of the multichannel audio signal, with adjusting a level of the noise using a scale factor of the predetermined scale factor band; dequantize the spectral lines within the second scale factor bands using scale factors of the second scale factor bands; and inverse transform the spectrum acquired from the first scale factor bands filled with the noise the level of which is adjusted using the scale factors of the first scale factor bands, and the second scale factor bands dequantized using the scale factors of the second scale factor bands, so as to acquire a time domain portion of the first channel of the multichannel audio signal. 2 . The parametric frequency-domain audio decoder according to claim 1 , further configured to, in the filling, adjust a level of a co-located portion of a spectrum of the downmix of the previous frame, spectrally co-located to the predetermined scale factor band, using the scale factor of the predetermined scale factor band, and add the co-located portion having its level adjusted, to the predetermined scale factor band. 3 . The parametric frequency-domain audio decoder according to claim 2 , further configured to predict a subset of the scale factor bands from a different channel or downmix of the current frame to acquire an inter-channel prediction, and use the predetermined scale factor band filled with the noise, and the second scale factor bands dequantized using the scale factors of the second scale factor bands as a prediction residual of the inter-channel prediction to acquire the spectrum. 4 . The parametric frequency-domain audio decoder according to claim 3 , further configured to, in predicting the subset of the scale factor bands, perform an imaginary part estimation of the different channel or downmix of the current frame using the spectrum of the downmix of the previous frame. 5 . The parametric frequency-domain audio decoder according to claim 1 , wherein the current channel and the other channel are subject to MS coding in the data stream, and the parametric frequency-domain audio decoder is configured to subject the spectrum to MS decoding. 6 . The parametric frequency-domain audio decoder according to claim 1 , further configured to sequentially extract the scale factors of the first and second scale factor bands from a data stream using context-adaptive entropy decoding with context determination depending on, and/or using predictive decoding with spectral prediction depending on, already extracted scale factors in a spectral neighborhood of a currently extracted scale factor, with the scale factors spectrally arranged according to a spectral order among the first and second scale factor bands. 7 . The parametric frequency-domain audio decoder according to claim 1 , further configured such that the noise is additionally generated using pseudorandom or random noise. 8 . The parametric frequency-domain audio decoder according to claim 7 , further configured to adjust a level of the pseudorandom or random noise equally for the first scale factor bands, according to a noise parameter signaled in a data stream for the current frame. 9 . The parametric frequency-domain audio decoder according to claim 1 , further configured to equally modify the scale factors of the first scale factor bands relative to the scale factors of the second scale factor bands using a modifying parameter signaled in a data stream for the current frame. 10 . A parametric frequency-domain audio encoder, configured to quantize spectral lines of a spectrum of a first channel of a current frame of a multichannel audio signal using preliminary scale factors of scale factor bands within the spectrum; identify first scale factor bands in the spectrum within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum within which at least one spectral line is quantized to non-zero, within a prediction and/or rate control loop, fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a downmix of a previous frame of the multichannel audio signal, with adjusting a level of the noise using an actual scale factor of the predetermined scale factor band; and signal the actual scale factor for the predetermined scale factor band instead of the preliminary scale factor. 11 . The parametric frequency-domain audio encoder according to claim 10 , further configured to calculate the actual scale factor for the predetermined scale factor band based on a level of an un-quantized version of the spectral lines of the spectrum of the first channel within the predetermined scale factor band and additionally based on the spectral lines of the downmix of the previous frame of the multichannel audio signal or spectral lines of a different channel of the current frame of the multichannel audio signal. 12 . A parametric frequency-domain audio decoder, configured to Identify first scale factor bands of a spectrum of a first channel of a current frame of a multichannel audio signal, within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum, within which at least one spectral line is quantized to non-zero; Fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a different channel of the current frame of the multichannel audio signal, with adjusting a level of the noise using a scale factor of the predetermined scale factor band; dequantize the spectral lines within the second scale factor bands using scale factors of the second scale factor bands; and inverse transform the spectrum acquired from the first scale factor bands filled with the noise the level of which is adjusted using the scale factors of the first scale factor bands, and the second scale factor bands dequantized using the scale factors of the second scale factor bands, so as to acquire a time domain portion of the first channel of the multichannel audio signal. 13 . The parametric frequency-domain audio decoder according to claim 12 , further configured to, in the filling, adjust a level of a co-located portion of a spectrum of the downmix of the previous frame, spectrally co-located to the predetermined scale factor band, using the scale factor of the predetermined scale factor band, and add the co-located portion having its level adjusted, to the predetermined scale factor band. 14 . The parametric frequency-domain audio decoder according to claim 13 , further configured to predict a subset of the scale factor bands from a different channel or downmix of the current frame to acquire an inter-channel prediction, and use the predetermined scale factor band filled with the noise, and the second scale factor bands dequantized using the scale factors of the second scale factor bands as a prediction residual of the inter-channel prediction to acquire the spectrum. 15 . The parametric frequency-domain audio decoder according to claim 14 , further configured to, in predicting the subset of

Assignees

Fraunhofer Ges Forschung

Inventors

Classifications

G10L19/008Primary
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
H04S2400/03
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
G10L19/028Primary
Noise substitution, i.e. substituting non-tonal spectral components by noisy source (comfort noise for discontinuous speech transmission G10L19/012) · CPC title
G10L19/035
Scalar quantisation · CPC title
H04S3/008
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

Patent family

Related publications grouped by family.

View patent family 48832792

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016140974A1 cover?: In multichannel audio coding, an improved coding efficiency is achieved by the following measure: the noise filling of zero-quantized scale factor bands is performed using noise filling sources other than artificially generated noise or spectral replica. In particular, the coding efficiency in multichannel audio coding may be rendered more efficient by performing the noise filling based on nois…
Who is the assignee on this patent?: Fraunhofer Ges Forschung
What technology area does this patent fall under?: Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu May 19 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).