Multi-channel audio processing

US9584235B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9584235-B2
Application numberUS-200913516362-A
CountryUS
Kind codeB2
Filing dateDec 16, 2009
Priority dateDec 16, 2009
Publication dateFeb 28, 2017
Grant dateFeb 28, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method including: receiving at least a first input audio channel and a second input audio channel; and using an inter-channel prediction model to form at least an inter-channel direction of reception parameter.

First claim

Opening claim text (preview).

I claim: 1. A method comprising: receiving a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determining a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model and a second metric as a prediction gain of an inter-channel prediction model that predicts the second input audio channel based at least in part on the first audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, wherein determining the first metric comprises computing the respective prediction gain as the ratio between energy of the predicted first input audio channel and the energy of a prediction error signal determined as the difference between the first input audio channel and the predicted first input audio channel, and wherein determining the second metric comprises computing the respective prediction gain as the ratio between energy of the predicted second input audio channel and the energy of a prediction error signal determined as the difference between the second input audio channel and the predicted second input audio channel; computing a comparison value that compares the first metric and the second metric; and computing at least one inter-channel direction of reception parameter based on the comparison value. 2. A method as claimed in claim 1 , further comprising providing an output signal comprising a downmixed signal and the at least one inter-channel direction of reception parameter. 3. A method as claimed in claim 1 , further comprising: using the first metric as an operand of a slowly varying function to obtain a modified first metric; using the second metric as an operand of the same slowly varying function to obtain a modified second metric; determining as the comparison value, a difference between the modified first metric and the modified second metric. 4. A method as claimed in claim 3 , wherein the comparison value is a difference between a logarithm of the first metric and the logarithm of the second metric. 5. A method as claimed in claim 1 , further comprising: mapping the inter-channel direction of reception parameter to the comparison value using a mapping function calibrated from the obtained comparison value and an associated inter-channel direction of reception parameter. 6. A method as claimed in claim 5 , wherein the associated inter-channel direction of reception parameter is determined using at least one of an absolute inter-channel time difference parameter and an absolute inter-channel level difference parameter. 7. A method as claimed in claim 5 , further comprising recalibrating the mapping function intermittently. 8. A method as claimed in claim 5 , wherein the mapping function is a function of time and sub band and is determined using available obtained comparison values and associated inter-channel direction of reception parameters. 9. A method as claimed in claim 1 , wherein the inter-channel prediction model represents a predicted sample of an audio channel in terms of a different audio channel. 10. A method as claimed in claim 9 , further comprising minimizing a cost function for the predicted sample to determine a inter-channel prediction model and using the determined inter-channel prediction model to determine at least one inter-channel parameter. 11. A method as claimed in claim 1 , further comprising segmenting at least the first input audio channel and second input audio channel in the time slots in the time domain and sub bands in the frequency domain and using an inter-channel prediction model to form an inter-channel direction of reception parameter for each of a plurality of sub bands. 12. A method as claimed in claim 1 further comprising using at least one selection criterion for selecting an inter-channel prediction model for use, wherein the at least one selection criterion is based upon a performance measure of the inter-channel prediction model. 13. A method as claimed in claim 12 , wherein the performance measure is prediction gain. 14. A method as claimed in claim 1 comprising selecting an inter-channel prediction model for use from a plurality of inter-channel prediction models. 15. A non-transitory computer readable medium storing a program of instructions, execution of which by at least on processor configures an apparatus to perform the method of claim 1 . 16. A non-transitory computer readable medium storing a program of instructions, execution of which by at least on processor configures an apparatus to at least: receive a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determine a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, and a second metric as a prediction gain of an inter-channel prediction model that predicts the second input audio channel based at least in part on the first audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, wherein determining the first metric comprises computing the respective prediction gain as the ratio between energy of the predicted first input audio channel and the energy of a prediction error signal determined as the difference between the first input audio channel and the predicted first input audio channel, and wherein determining the second metric comprises computing the respective prediction gain as the ratio between energy of the predicted second input audio channel and the energy of a prediction error signal determined as the difference between the second input audio channel and the predicted second input audio channel; compute a comparison value that compares the first metric and the second metric; and compute at least one inter-channel direction of reception parameter based on the comparison value. 17. A non-transitory computer readable medium as claimed in claim 16 , wherein the apparatus is further configured to: use the first metric as an operand of a slowly varying function to obtain a modified first metric; use the second metric as an operand of the same slowly varying function to obtain a modified second metric; and determine as the comparison value, a difference between the modified first metric and the modified second metric. 18. A non-transitory computer readable medium as claimed in claim 16 , wherein the comparison value is a difference between a logarithm of the first metric and the logarithm of the second metric. 19. An apparatus comprising: at least one processor; memory storing a program of instructions; wherein the memory storing the program of instructions is configured to, with the at least one processor, cause the apparatus to at least: receive a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determine a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input chann

Assignees

Inventors

Classifications

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • Application of parametric coding in stereophonic audio systems · CPC title

  • H04H40/36Primary

    specially adapted for stereophonic broadcast receiving · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Microphone arrays; Beamforming · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9584235B2 cover?
A method including: receiving at least a first input audio channel and a second input audio channel; and using an inter-channel prediction model to form at least an inter-channel direction of reception parameter.
Who is the assignee on this patent?
Ojala Pasi, Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification H04H40/36. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).