Multichannel audio signal processing method and device
US-2017134873-A1 · May 11, 2017 · US
US10645515B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10645515-B2 |
| Application number | US-201916357180-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 18, 2019 |
| Priority date | Jul 1, 2014 |
| Publication date | May 5, 2020 |
| Grant date | May 5, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed are a multi-channel audio signal processing method and a multi-channel audio signal processing apparatus. The multi-channel audio signal processing method may generate N channel output signals from N/2 channel downmix signals based on an N-N/2-N structure.
Opening claim text (preview).
What is claimed is: 1. A method of processing a multi-channel audio signal, the method comprising: identifying a residual signal and N/2 channel downmix signals; applying the residual signal and N/2 channel downmix signals into a pre-decorrelator matrix of a N-N/2-N structure defined based on bsTreeConfig; applying an output result of the pre-decorrelator matrix into mix matrix of the N-N/2-N structure; outputting a N channel output signal as an output result of the mix matrix, wherein the number of OTT box of the N-N/2-N structure is same as the number of a channel for the N/2 channel downmix signals. 2. The method of claim 1 , wherein the N/2 decorrelators correspond to the N/2 OTT boxes, when a Low Frequency Enhancement (LFE) channel is not included in the N channel output signals. 3. The method of claim 1 , wherein indices of the decorrelators are repeatedly reused based on the reference value, when the number of decorrelators exceeds a reference value of a modulo operation. 4. The method of claim 1 , wherein, when an LFE channel is included in the N channel output signals, the decorrelators corresponding to the remaining number excluding the number of LFE channels from N/2 are used, and the LTE channel does not use an OTT box decorrelator. 5. The method of claim 1 , wherein, when a temporal shaping tool is not used, a single vector including the second signal, the decorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator is input to the second matrix. 6. The method of claim 1 , wherein, when a temporal shaping tool is used, a vector corresponding to a direct signal including the second signal and the residual signal derived from the decorrelator and a vector corresponding to a diffuse signal including the decorrelated signal derived from the decorrelator are input to the second matrix. 7. The method of claim 6 , wherein the generating of the N channel output signals comprises shaping a temporal envelope of an output signal by applying a scale factor based on the diffuse signal and the direct signal to a diffuse signal portion of the output signal, when a Subband Domain Time Processing (STP) is used. 8. The method of claim 6 , wherein the generating of the N channel output signals comprises flattening and reshaping an envelope corresponding to a direct signal portion for each channel of N channel output signals when a Guided Envelope Shaping (GES) is used. 9. The method of claim 1 , wherein a size of the first matrix is determined based on the number of downmix signal channels and the number of decorrelators to which the first matrix is to be applied, and an element of the first matrix is determined based on a Channel Level Difference (CLD) parameter or a Channel Prediction Coefficient (CPC) parameter. 10. An apparatus for processing a multi-channel audio signal, the apparatus comprising: one or more processor configured to: identify a residual signal and N/2 channel downmix signals generated from N channel input signals; generate a first signal by applying the residual signal and N/2 channel downmix signals into a pre-decorrelator matrix; generate a second signal by applying the residual signal and N/2 channel downmix signals into the pre-decorrelator matrix, output a N channel output signal by applying the first signal and second signal into mix matrix, wherein the first signal is decorrelated based on N/2 decorrelators, and the second signal is not decorrelated based on the N/2 decorrelators. 11. The apparatus of claim 10 , wherein the N/2 decorrelators correspond to the N/2 OTT boxes, when a Low Frequency Enhancement (LFE) channel is not included in the N channel output signals. 12. The apparatus of claim 10 , wherein indices of the decorrelators are repeatedly reused based on the reference value, when the number of decorrelators exceeds a reference value of a modulo operation. 13. The apparatus of claim 10 , wherein, when an LFE channel is included in the N channel output signals, the decorrelators corresponding to the remaining number excluding the number of LFE channels from N/2 are used, and the LTE channel does not use an OTT box decorrelator. 14. The apparatus of claim 10 , wherein, when a temporal shaping tool is not used, a single vector including the second signal, the decorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator is input to the second matrix. 15. The apparatus of claim 10 , wherein, when a temporal shaping tool is used, a vector corresponding to a direct signal including the second signal and the residual signal derived from the decorrelator and a vector corresponding to a diffuse signal including the decorrelated signal derived from the decorrelator are input to the second matrix. 16. The apparatus of claim 15 , wherein the processor is configured to perform shaping a temporal envelope of an output signal by applying a scale factor based on the diffuse signal and the direct signal to a diffuse signal portion of the output signal, when a Subband Domain Time Processing (STP) is used. 17. The apparatus of claim 15 , wherein the processor is configured to perform flattening and reshaping an envelope corresponding to a direct signal portion for each channel of N channel output signals when a Guided Envelope Shaping (GES) is used. 18. The apparatus of claim 10 , wherein a size of the first matrix is determined based on the number of downmix signal channels and the number of decorrelators to which the first matrix is to be applied, and an element of the first matrix is determined based on a Channel Level Difference (CLD) parameter or a Channel Prediction Coefficient (CPC) parameter.
Generation or adaptation of the Low Frequency Effect [LFE] channel, e.g. distribution or signal processing · CPC title
using sound class specific coding, hybrid encoders or object based coding · CPC title
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.