Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US-2024098445-A1 · Mar 21, 2024 · US
US2016155448A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016155448-A1 |
| Application number | US-201414900389-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jun 27, 2014 |
| Priority date | Jul 5, 2013 |
| Publication date | Jun 2, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present document relates to multichannel audio coding and more precisely to techniques for discrete multichannel audio encoding and decoding. In particular, the present document relates to systems and method for coding soundfields. An audio encoder ( 200 ) configured to encode a frame of a soundfield signal ( 110 ) comprising a plurality of audio signals is described. The audio encoder ( 200 ) comprises a transform determination unit ( 203, 204 ) configured to determine an energy-compacting orthogonal transform (V) based on the frame of the soundfield signal ( 110 ). Furthermore, the encoder ( 200 ) comprises a transform unit ( 202 ) configured to apply the energy-compacting orthogonal transform (V) to the frame of the soundfield signal ( 110 ), and configured to provide a frame of a rotated soundfield signal ( 112 ) comprising a plurality of rotated audio signals (E 1 , E 2 , E 3 ). The audio encoder ( 200 ) comprises a waveform encoding unit ( 103 ) configured to encode a first rotated audio signal (E 1 ) of the plurality of rotated audio signals (E 1 , E 2 , E 3 ), and a parametric encoding unit ( 104 ) configured to determine a set of spatial parameters (ae 2 , be 2 ) for determining a second rotated audio signal (E 2 ) of the plurality of rotated audio signals (E 1 , E 2 , E 3 ) based on the first rotated audio signal (E 1 ).
Opening claim text (preview).
What is claimed is: 1 - 43 . (canceled) 44 . An audio encoder configured to encode a frame of a soundfield signal comprising a plurality of audio signals, the audio encoder comprising a transform determination unit configured to determine an energy-compacting orthogonal transform based on the frame of the soundfield signal; a transform unit configured to apply the energy-compacting orthogonal transform to a frame derived from the frame of the soundfield signal, and to provide a frame of a rotated soundfield signal comprising a plurality of rotated audio signals; a waveform encoding unit configured to encode a first rotated audio signal of the plurality of rotated audio signals; and a parametric encoding unit configured to determine a set of spatial parameters for determining a second rotated audio signal of the plurality of rotated audio signals based on the first rotated audio signal. 45 . The audio encoder of claim 44 , wherein the parametric encoding unit is configured to determine the set of spatial parameters based on the signal model E 2 =ae 2 *E 1 +be 2*decorr2( E 1), with ae 2 being a second prediction parameter, be 2 being a second energy adjustment gain, E 1 being the first rotated audio signal, E 2 being the second rotated audio signal, and decorr 2 (E 1 ) being a second decorrelated version of the first rotated audio signal; wherein the set of spatial parameters comprises the second prediction parameter and the second energy adjustment gain. 46 . The audio encoder of claim 44 , wherein the parametric encoding unit is configured to determine a second prediction parameter based on the second rotated audio signal and based on the first rotated audio signal; and the second prediction parameter enables a corresponding decoder to estimate a correlated component of the second rotated audio signal based on the first rotated audio signal. 47 . The audio encoder of claim 46 , wherein the parametric encoding unit is configured to determine the second prediction parameter such that a mean square error of a prediction residual between the second rotated audio signal and the correlated component of the second rotated audio signal is reduced. 48 . The audio encoder of claim 47 , wherein the parametric encoding unit is configured to determine the second prediction parameter using the formula: ae 2=( E 1 T *E 2)/( E 1 T *E 1), with E 1 being the first rotated audio signal, E 2 being the second rotated audio signal, ae 2 being the second prediction parameter, and T indicating a vector transposition. 49 . The audio encoder of claim 44 , wherein the parametric encoding unit is configured to determine a second energy adjustment gain based on the second rotated audio signal and based on the first rotated audio signal; and the second energy adjustment gain enables a corresponding decoder to estimate a decorrelated component of the second rotated audio signal based on the first rotated audio signal. 50 . The audio encoder of claim 49 , wherein the parametric encoding unit is configured to determine the second energy adjustment gain based on a ratio of an amplitude of the prediction residual and an amplitude of the first rotated audio signal. 51 . The audio encoder of claim 50 , wherein the parametric encoding unit is configured to determine the second energy adjustment gain based on a ratio of the root mean square of the prediction residual and the root mean square of the first rotated audio signal. 52 . The audio encoder of claim 44 , further comprising a time-to-frequency analysis unit configured to convert a frame of a soundfield signal into a plurality of sub-bands, such that a plurality of sub-band signals are provided for the plurality of rotated audio signals, respectively; wherein the parametric encoding unit is configured to determine a different set of spatial parameters for each of the plurality of sub-band signals of the second rotated audio signal. 53 . The audio encoder of claim 44 , wherein the transform determination unit is configured to determine a covariance matrix based on the plurality of audio signals of the frame of the soundfield signal; and perform an eigenvalue decomposition of the covariance matrix to provide the energy compacting transform. 54 . The audio encoder of claim 44 , further comprising a non-adaptive transform unit configured to apply a non-adaptive transform to the frame of the soundfield signal to provide a transformed soundfield signal comprising a plurality of transformed audio signals; wherein the transform determination unit is configured to determine the energy-compacting orthogonal transform based on the transformed soundfield signal. 55 . The audio encoder of claim 44 , wherein the soundfield signal comprises at least three audio signals which are indicative at least of an azimuth distribution of talkers around a terminal of a teleconferencing system; the parametric encoding unit configured to determine a further set of spatial parameters for determining a third rotated audio signal of the plurality of rotated audio signals based on the first rotated audio signal. 56 . The audio encoder of claim 54 , wherein the audio encoder comprises a multi-channel encoding unit configured to waveform encode one or more sub-bands of the plurality of rotated audio signals; the encoder is configured to provide a start band; one or more sub-bands of the plurality of rotated audio signals below the start band are encoded using the multi-channel encoding unit; and one or more sub-bands of the plurality of rotated audio signals at or above the start band are encoded using the waveform encoding unit and the parametric encoding unit. 57 . The audio encoder of claim 44 , wherein the waveform encoding unit is configured to encode the first rotated audio signal into a down-mix bit-stream to be provided to a corresponding decoder. 58 . An audio decoder configured to provide a frame of a reconstructed soundfield signal comprising a plurality of reconstructed audio signals, from a spatial bit-stream and from a down-mix bit-stream; the decoder comprising a waveform decoding unit configured to determine from the down-mix bit-stream a first reconstructed rotated audio signal of a plurality of reconstructed rotated audio signals; a parametric decoding unit configured to extract a set of spatial parameters from the spatial bit-stream; and determine a second reconstructed rotated audio signal of the plurality of reconstructed rotated audio signals, based on the set of spatial parameters and based on the first reconstructed rotated audio signal; a transform decoding unit configured to extract a set of transform parameters indicative of an energy-compacting orthogonal transform which has been determined by a corresponding encoder based on a corresponding frame of a soundfield signal which is to be reconstructed; and an inverse transform unit configured to apply the inverse of the energy-compacting orthogonal transform to the plurality of reconstructed rotated audio signals to yield an inverse transformed soundfield signal; wherein the reconstructed soundfield signal is determined based on the inverse transformed soundfield signal. 59 . The decoder of claim 58 , wherein the set of spatial parameters comprises a second energy adjustment gain; the parametric decoding unit is configured to determine a second decorrelated signal based on the first reconstructed rotated audio signal; and the parametric decoding unit is configured to determine a decorrelated component of the second reconstructed r
using subband decomposition · CPC title
using orthogonal transformation · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.