Who is the assignee on this patent?

Ojala Pasi, Nokia Technologies Oy

What technology area does this patent fall under?

Primary CPC classification H04H40/36. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Feb 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Multi-channel audio processing

US9584235B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9584235-B2
Application number	US-200913516362-A
Country	US
Kind code	B2
Filing date	Dec 16, 2009
Priority date	Dec 16, 2009
Publication date	Feb 28, 2017
Grant date	Feb 28, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method including: receiving at least a first input audio channel and a second input audio channel; and using an inter-channel prediction model to form at least an inter-channel direction of reception parameter.

First claim

Opening claim text (preview).

I claim: 1. A method comprising: receiving a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determining a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model and a second metric as a prediction gain of an inter-channel prediction model that predicts the second input audio channel based at least in part on the first audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, wherein determining the first metric comprises computing the respective prediction gain as the ratio between energy of the predicted first input audio channel and the energy of a prediction error signal determined as the difference between the first input audio channel and the predicted first input audio channel, and wherein determining the second metric comprises computing the respective prediction gain as the ratio between energy of the predicted second input audio channel and the energy of a prediction error signal determined as the difference between the second input audio channel and the predicted second input audio channel; computing a comparison value that compares the first metric and the second metric; and computing at least one inter-channel direction of reception parameter based on the comparison value. 2. A method as claimed in claim 1 , further comprising providing an output signal comprising a downmixed signal and the at least one inter-channel direction of reception parameter. 3. A method as claimed in claim 1 , further comprising: using the first metric as an operand of a slowly varying function to obtain a modified first metric; using the second metric as an operand of the same slowly varying function to obtain a modified second metric; determining as the comparison value, a difference between the modified first metric and the modified second metric. 4. A method as claimed in claim 3 , wherein the comparison value is a difference between a logarithm of the first metric and the logarithm of the second metric. 5. A method as claimed in claim 1 , further comprising: mapping the inter-channel direction of reception parameter to the comparison value using a mapping function calibrated from the obtained comparison value and an associated inter-channel direction of reception parameter. 6. A method as claimed in claim 5 , wherein the associated inter-channel direction of reception parameter is determined using at least one of an absolute inter-channel time difference parameter and an absolute inter-channel level difference parameter. 7. A method as claimed in claim 5 , further comprising recalibrating the mapping function intermittently. 8. A method as claimed in claim 5 , wherein the mapping function is a function of time and sub band and is determined using available obtained comparison values and associated inter-channel direction of reception parameters. 9. A method as claimed in claim 1 , wherein the inter-channel prediction model represents a predicted sample of an audio channel in terms of a different audio channel. 10. A method as claimed in claim 9 , further comprising minimizing a cost function for the predicted sample to determine a inter-channel prediction model and using the determined inter-channel prediction model to determine at least one inter-channel parameter. 11. A method as claimed in claim 1 , further comprising segmenting at least the first input audio channel and second input audio channel in the time slots in the time domain and sub bands in the frequency domain and using an inter-channel prediction model to form an inter-channel direction of reception parameter for each of a plurality of sub bands. 12. A method as claimed in claim 1 further comprising using at least one selection criterion for selecting an inter-channel prediction model for use, wherein the at least one selection criterion is based upon a performance measure of the inter-channel prediction model. 13. A method as claimed in claim 12 , wherein the performance measure is prediction gain. 14. A method as claimed in claim 1 comprising selecting an inter-channel prediction model for use from a plurality of inter-channel prediction models. 15. A non-transitory computer readable medium storing a program of instructions, execution of which by at least on processor configures an apparatus to perform the method of claim 1 . 16. A non-transitory computer readable medium storing a program of instructions, execution of which by at least on processor configures an apparatus to at least: receive a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determine a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, and a second metric as a prediction gain of an inter-channel prediction model that predicts the second input audio channel based at least in part on the first audio input channel, wherein the prediction model is one of an autoregressive model, a moving average model, and an autoregressive moving average model, wherein determining the first metric comprises computing the respective prediction gain as the ratio between energy of the predicted first input audio channel and the energy of a prediction error signal determined as the difference between the first input audio channel and the predicted first input audio channel, and wherein determining the second metric comprises computing the respective prediction gain as the ratio between energy of the predicted second input audio channel and the energy of a prediction error signal determined as the difference between the second input audio channel and the predicted second input audio channel; compute a comparison value that compares the first metric and the second metric; and compute at least one inter-channel direction of reception parameter based on the comparison value. 17. A non-transitory computer readable medium as claimed in claim 16 , wherein the apparatus is further configured to: use the first metric as an operand of a slowly varying function to obtain a modified first metric; use the second metric as an operand of the same slowly varying function to obtain a modified second metric; and determine as the comparison value, a difference between the modified first metric and the modified second metric. 18. A non-transitory computer readable medium as claimed in claim 16 , wherein the comparison value is a difference between a logarithm of the first metric and the logarithm of the second metric. 19. An apparatus comprising: at least one processor; memory storing a program of instructions; wherein the memory storing the program of instructions is configured to, with the at least one processor, cause the apparatus to at least: receive a first input audio channel and a second input audio channel that jointly represent a spatial audio image; determine a first metric as a prediction gain of an inter-channel prediction model that predicts the first input audio channel based at least in part on the second audio input chann

Assignees

Inventors

Ojala Pasi

Classifications

H04S3/008
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
H04S2420/03
Application of parametric coding in stereophonic audio systems · CPC title
H04H40/36Primary
specially adapted for stereophonic broadcast receiving · CPC title
G10L19/008Primary
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
G10L2021/02166
Microphone arrays; Beamforming · CPC title

Patent family

Related publications grouped by family.

View patent family 42144823

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9584235B2 cover?: A method including: receiving at least a first input audio channel and a second input audio channel; and using an inter-channel prediction model to form at least an inter-channel direction of reception parameter.
Who is the assignee on this patent?: Ojala Pasi, Nokia Technologies Oy
What technology area does this patent fall under?: Primary CPC classification H04H40/36. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Feb 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).