Multichannel audio signal processing method and device

US10645515B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10645515-B2
Application numberUS-201916357180-A
CountryUS
Kind codeB2
Filing dateMar 18, 2019
Priority dateJul 1, 2014
Publication dateMay 5, 2020
Grant dateMay 5, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are a multi-channel audio signal processing method and a multi-channel audio signal processing apparatus. The multi-channel audio signal processing method may generate N channel output signals from N/2 channel downmix signals based on an N-N/2-N structure.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of processing a multi-channel audio signal, the method comprising: identifying a residual signal and N/2 channel downmix signals; applying the residual signal and N/2 channel downmix signals into a pre-decorrelator matrix of a N-N/2-N structure defined based on bsTreeConfig; applying an output result of the pre-decorrelator matrix into mix matrix of the N-N/2-N structure; outputting a N channel output signal as an output result of the mix matrix, wherein the number of OTT box of the N-N/2-N structure is same as the number of a channel for the N/2 channel downmix signals. 2. The method of claim 1 , wherein the N/2 decorrelators correspond to the N/2 OTT boxes, when a Low Frequency Enhancement (LFE) channel is not included in the N channel output signals. 3. The method of claim 1 , wherein indices of the decorrelators are repeatedly reused based on the reference value, when the number of decorrelators exceeds a reference value of a modulo operation. 4. The method of claim 1 , wherein, when an LFE channel is included in the N channel output signals, the decorrelators corresponding to the remaining number excluding the number of LFE channels from N/2 are used, and the LTE channel does not use an OTT box decorrelator. 5. The method of claim 1 , wherein, when a temporal shaping tool is not used, a single vector including the second signal, the decorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator is input to the second matrix. 6. The method of claim 1 , wherein, when a temporal shaping tool is used, a vector corresponding to a direct signal including the second signal and the residual signal derived from the decorrelator and a vector corresponding to a diffuse signal including the decorrelated signal derived from the decorrelator are input to the second matrix. 7. The method of claim 6 , wherein the generating of the N channel output signals comprises shaping a temporal envelope of an output signal by applying a scale factor based on the diffuse signal and the direct signal to a diffuse signal portion of the output signal, when a Subband Domain Time Processing (STP) is used. 8. The method of claim 6 , wherein the generating of the N channel output signals comprises flattening and reshaping an envelope corresponding to a direct signal portion for each channel of N channel output signals when a Guided Envelope Shaping (GES) is used. 9. The method of claim 1 , wherein a size of the first matrix is determined based on the number of downmix signal channels and the number of decorrelators to which the first matrix is to be applied, and an element of the first matrix is determined based on a Channel Level Difference (CLD) parameter or a Channel Prediction Coefficient (CPC) parameter. 10. An apparatus for processing a multi-channel audio signal, the apparatus comprising: one or more processor configured to: identify a residual signal and N/2 channel downmix signals generated from N channel input signals; generate a first signal by applying the residual signal and N/2 channel downmix signals into a pre-decorrelator matrix; generate a second signal by applying the residual signal and N/2 channel downmix signals into the pre-decorrelator matrix, output a N channel output signal by applying the first signal and second signal into mix matrix, wherein the first signal is decorrelated based on N/2 decorrelators, and the second signal is not decorrelated based on the N/2 decorrelators. 11. The apparatus of claim 10 , wherein the N/2 decorrelators correspond to the N/2 OTT boxes, when a Low Frequency Enhancement (LFE) channel is not included in the N channel output signals. 12. The apparatus of claim 10 , wherein indices of the decorrelators are repeatedly reused based on the reference value, when the number of decorrelators exceeds a reference value of a modulo operation. 13. The apparatus of claim 10 , wherein, when an LFE channel is included in the N channel output signals, the decorrelators corresponding to the remaining number excluding the number of LFE channels from N/2 are used, and the LTE channel does not use an OTT box decorrelator. 14. The apparatus of claim 10 , wherein, when a temporal shaping tool is not used, a single vector including the second signal, the decorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator is input to the second matrix. 15. The apparatus of claim 10 , wherein, when a temporal shaping tool is used, a vector corresponding to a direct signal including the second signal and the residual signal derived from the decorrelator and a vector corresponding to a diffuse signal including the decorrelated signal derived from the decorrelator are input to the second matrix. 16. The apparatus of claim 15 , wherein the processor is configured to perform shaping a temporal envelope of an output signal by applying a scale factor based on the diffuse signal and the direct signal to a diffuse signal portion of the output signal, when a Subband Domain Time Processing (STP) is used. 17. The apparatus of claim 15 , wherein the processor is configured to perform flattening and reshaping an envelope corresponding to a direct signal portion for each channel of N channel output signals when a Guided Envelope Shaping (GES) is used. 18. The apparatus of claim 10 , wherein a size of the first matrix is determined based on the number of downmix signal channels and the number of decorrelators to which the first matrix is to be applied, and an element of the first matrix is determined based on a Channel Level Difference (CLD) parameter or a Channel Prediction Coefficient (CPC) parameter.

Assignees

Inventors

Classifications

  • Generation or adaptation of the Low Frequency Effect [LFE] channel, e.g. distribution or signal processing · CPC title

  • using sound class specific coding, hybrid encoders or object based coding · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

  • H04S3/008Primary

    in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10645515B2 cover?
Disclosed are a multi-channel audio signal processing method and a multi-channel audio signal processing apparatus. The multi-channel audio signal processing method may generate N channel output signals from N/2 channel downmix signals based on an N-N/2-N structure.
Who is the assignee on this patent?
Electronics & Telecommunications Res Inst
What technology area does this patent fall under?
Primary CPC classification H04S3/008. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue May 05 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).