Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US-2024098445-A1 · Mar 21, 2024 · US
US2016140974A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016140974-A1 |
| Application number | US-201615002375-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jan 20, 2016 |
| Priority date | Jul 22, 2013 |
| Publication date | May 19, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In multichannel audio coding, an improved coding efficiency is achieved by the following measure: the noise filling of zero-quantized scale factor bands is performed using noise filling sources other than artificially generated noise or spectral replica. In particular, the coding efficiency in multichannel audio coding may be rendered more efficient by performing the noise filling based on noise generated using spectral lines from a previous frame of, or a different channel of the current frame of, the multichannel audio signal.
Opening claim text (preview).
1 . A parametric frequency-domain audio decoder, configured to identify first scale factor bands of a spectrum of a first channel of a current frame of a multichannel audio signal, within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum, within which at least one spectral line is quantized to non-zero; fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a downmix of a previous frame of the multichannel audio signal, with adjusting a level of the noise using a scale factor of the predetermined scale factor band; dequantize the spectral lines within the second scale factor bands using scale factors of the second scale factor bands; and inverse transform the spectrum acquired from the first scale factor bands filled with the noise the level of which is adjusted using the scale factors of the first scale factor bands, and the second scale factor bands dequantized using the scale factors of the second scale factor bands, so as to acquire a time domain portion of the first channel of the multichannel audio signal. 2 . The parametric frequency-domain audio decoder according to claim 1 , further configured to, in the filling, adjust a level of a co-located portion of a spectrum of the downmix of the previous frame, spectrally co-located to the predetermined scale factor band, using the scale factor of the predetermined scale factor band, and add the co-located portion having its level adjusted, to the predetermined scale factor band. 3 . The parametric frequency-domain audio decoder according to claim 2 , further configured to predict a subset of the scale factor bands from a different channel or downmix of the current frame to acquire an inter-channel prediction, and use the predetermined scale factor band filled with the noise, and the second scale factor bands dequantized using the scale factors of the second scale factor bands as a prediction residual of the inter-channel prediction to acquire the spectrum. 4 . The parametric frequency-domain audio decoder according to claim 3 , further configured to, in predicting the subset of the scale factor bands, perform an imaginary part estimation of the different channel or downmix of the current frame using the spectrum of the downmix of the previous frame. 5 . The parametric frequency-domain audio decoder according to claim 1 , wherein the current channel and the other channel are subject to MS coding in the data stream, and the parametric frequency-domain audio decoder is configured to subject the spectrum to MS decoding. 6 . The parametric frequency-domain audio decoder according to claim 1 , further configured to sequentially extract the scale factors of the first and second scale factor bands from a data stream using context-adaptive entropy decoding with context determination depending on, and/or using predictive decoding with spectral prediction depending on, already extracted scale factors in a spectral neighborhood of a currently extracted scale factor, with the scale factors spectrally arranged according to a spectral order among the first and second scale factor bands. 7 . The parametric frequency-domain audio decoder according to claim 1 , further configured such that the noise is additionally generated using pseudorandom or random noise. 8 . The parametric frequency-domain audio decoder according to claim 7 , further configured to adjust a level of the pseudorandom or random noise equally for the first scale factor bands, according to a noise parameter signaled in a data stream for the current frame. 9 . The parametric frequency-domain audio decoder according to claim 1 , further configured to equally modify the scale factors of the first scale factor bands relative to the scale factors of the second scale factor bands using a modifying parameter signaled in a data stream for the current frame. 10 . A parametric frequency-domain audio encoder, configured to quantize spectral lines of a spectrum of a first channel of a current frame of a multichannel audio signal using preliminary scale factors of scale factor bands within the spectrum; identify first scale factor bands in the spectrum within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum within which at least one spectral line is quantized to non-zero, within a prediction and/or rate control loop, fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a downmix of a previous frame of the multichannel audio signal, with adjusting a level of the noise using an actual scale factor of the predetermined scale factor band; and signal the actual scale factor for the predetermined scale factor band instead of the preliminary scale factor. 11 . The parametric frequency-domain audio encoder according to claim 10 , further configured to calculate the actual scale factor for the predetermined scale factor band based on a level of an un-quantized version of the spectral lines of the spectrum of the first channel within the predetermined scale factor band and additionally based on the spectral lines of the downmix of the previous frame of the multichannel audio signal or spectral lines of a different channel of the current frame of the multichannel audio signal. 12 . A parametric frequency-domain audio decoder, configured to Identify first scale factor bands of a spectrum of a first channel of a current frame of a multichannel audio signal, within which all spectral lines are quantized to zero, and second scale factor bands of the spectrum, within which at least one spectral line is quantized to non-zero; Fill the spectral lines within a predetermined scale factor band of the first scale factor bands with noise generated using spectral lines of a different channel of the current frame of the multichannel audio signal, with adjusting a level of the noise using a scale factor of the predetermined scale factor band; dequantize the spectral lines within the second scale factor bands using scale factors of the second scale factor bands; and inverse transform the spectrum acquired from the first scale factor bands filled with the noise the level of which is adjusted using the scale factors of the first scale factor bands, and the second scale factor bands dequantized using the scale factors of the second scale factor bands, so as to acquire a time domain portion of the first channel of the multichannel audio signal. 13 . The parametric frequency-domain audio decoder according to claim 12 , further configured to, in the filling, adjust a level of a co-located portion of a spectrum of the downmix of the previous frame, spectrally co-located to the predetermined scale factor band, using the scale factor of the predetermined scale factor band, and add the co-located portion having its level adjusted, to the predetermined scale factor band. 14 . The parametric frequency-domain audio decoder according to claim 13 , further configured to predict a subset of the scale factor bands from a different channel or downmix of the current frame to acquire an inter-channel prediction, and use the predetermined scale factor band filled with the noise, and the second scale factor bands dequantized using the scale factors of the second scale factor bands as a prediction residual of the inter-channel prediction to acquire the spectrum. 15 . The parametric frequency-domain audio decoder according to claim 14 , further configured to, in predicting the subset of
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
Noise substitution, i.e. substituting non-tonal spectral components by noisy source (comfort noise for discontinuous speech transmission G10L19/012) · CPC title
Scalar quantisation · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.