Apparatus and method for merging geometry-based spatial audio coding streams

US9484038B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9484038-B2
Application numberUS-201213445585-A
CountryUS
Kind codeB2
Filing dateApr 12, 2012
Priority dateDec 2, 2011
Publication dateNov 1, 2016
Grant dateNov 1, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for generating a merged audio data stream is provided. The apparatus includes a demultiplexer for obtaining a plurality of single-layer audio data streams, wherein each input audio data stream includes one or more layers, wherein the demultiplexer is adapted to demultiplex each one of one or more input audio data streams having one or more layers into two or more demultiplexed audio data streams having exactly one layer. Furthermore, the apparatus includes a merging module for generating the merged audio data stream based on the plurality of single-layer audio data streams. Each layer of the input data audio streams, of the demultiplexed audio data streams, of the single-layer data streams and of the merged audio data stream includes a pressure value of a pressure signal, a position value and a diffuseness value as audio data.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for generating a merged audio data stream, wherein the apparatus is implemented using a hardware apparatus or a computer, wherein the apparatus comprises: a demultiplexer for acquiring a plurality of single-layer audio data streams, wherein the demultiplexer is adapted to receive one or more input audio data streams, wherein each input audio data stream comprises one or more layers, wherein the demultiplexer is adapted to demultiplex each one of the input audio data streams comprising one or more layers into two or more demultiplexed audio data streams comprising exactly one layer, such that the two or more demultiplexed audio data streams together comprise the one or more layers of the input audio data stream, to acquire two or more of the single-layer audio data streams; and a merging module for generating the merged audio data stream, comprising one or more layers, based on the plurality of single-layer audio data streams, wherein each layer of the input data audio streams, of the demultiplexed audio data streams, of the single-layer data streams and of the merged audio data stream comprises a pressure value of a pressure signal, a position value and a diffuseness value as audio data, wherein the position value indicates a position of a sound source. 2. An apparatus according to claim 1 , wherein the audio data is defined for a time-frequency bin of a plurality of time-frequency bins. 3. An apparatus according to claim 2 , wherein the merging module furthermore comprises a pressure merging unit, wherein the pressure merging unit is adapted to determine a first group comprising one or more single-layer audio data streams of the plurality of single-layer audio data streams and to determine a second group comprising one or more different single-layer audio data streams of the plurality of single-layer audio data streams, wherein a cost value of each of the single-layer audio data streams of the first group is greater than a cost value of each of the single-layer audio data streams of the second group, or wherein the cost value of each of the single-layer audio data streams of the first group is smaller than the cost value of each of the single-layer audio data streams of the second group, wherein the pressure merging unit is adapted to generate the one or more pressure values of the one or more layers of the merged audio data stream, such that each pressure value of each of the single-layer audio data streams of the first group is a pressure value of one of the layers of the merged audio data stream, and such that a combination of the pressure values of the single-layer audio data streams of the second group is a pressure value of one of the layers of the merged audio data stream. 4. An apparatus according to claim 2 , wherein the merging module furthermore comprises a diffuseness merging unit, wherein the diffuseness merging unit is adapted to determine a third group comprising one or more single-layer audio data streams of the plurality of single-layer audio data streams and to determine a fourth group comprising one or more different single-layer audio data streams of the plurality of single-layer audio data streams, wherein a cost value of each of the single-layer audio data streams of the third group is greater than a cost value of each of the single-layer audio data streams of the fourth group, or wherein the cost value of each of the single-layer audio data streams of the third group is smaller than the cost value of each of the single-layer audio data streams of the fourth group, wherein the diffuseness merging unit is adapted to generate the one or more diffuseness values of the one or more layers of the merged audio data stream, such that each diffuseness value of each of the single-layer audio data streams of the third group is a diffuseness value of one of the layers of the merged audio data stream, and such that a combination of the diffuseness values of the single-layer audio data streams of the fourth group is a diffuseness value of one of the layers of the merged audio data stream. 5. An apparatus according to claim 2 , wherein the merging module furthermore comprises a position mixing unit, wherein the position mixing unit is adapted to determine a fifth group comprising one or more single-layer audio data streams of the plurality of single-layer audio data streams, wherein a cost value of each of the single-layer audio data streams of the fifth group is greater than a cost value of any single-layer audio data streams not comprised in the fifth group of the plurality of single-layer audio data streams, or wherein the cost value of each of the single-layer audio data streams of the fifth group is smaller than the cost value of any single-layer audio data streams not comprised in the fifth group of the plurality of single-layer audio data streams, wherein the position value unit is adapted to generate the one or more position values of the one or more layers of the merged audio data stream, such that each position value of each of the single-layer audio data streams of the fifth group is a position value of one of the layers of the merged audio data stream. 6. An apparatus according to claim 2 , wherein the merging module furthermore comprises a sound scene adaption module for manipulating the position value of one or more of the single-layer audio data streams of the plurality of single-layer audio data streams. 7. An apparatus according to claim 6 , wherein the sound scene adaption module is adapted to manipulate the position value of the one or more of the single-layer audio data streams of the plurality of single-layer audio data streams applying a rotation, a translation or a non-linear transformation on the position value. 8. An apparatus according to claim 1 , wherein the merging module comprises a cost function module for assigning a cost value to each one of the single-layer audio data streams, and wherein the merging module is adapted to generate the merged audio data stream based on the cost values assigned to the single-layer audio data streams. 9. An apparatus according to claim 8 , wherein the cost function module is adapted to assign the cost value to each one of the single-layer audio data streams depending on at least one of the pressure values or the diffuseness values of the single-layer audio data stream. 10. An apparatus according to claim 9 , wherein the cost function module is adapted to assign the cost value to each audio data stream of the group of single-layer audio data streams by applying the formula: f i (Ψ i ,P i )=(1−Ψ i )·| P i | 2 wherein P i , is the pressure value and Ψ i is the diffuseness value of the layer of an i-th audio data stream of the group of single-layer audio data streams. 11. An apparatus according to claim 1 , wherein the demultiplexer is adapted to modify a magnitude of one of the pressure values of one of the demultiplexed audio data streams by multiplying the magnitude by a scalar value. 12. An apparatus according to claim 1 , wherein the demultiplexer comprises a plurality of demultiplexing units, wherein each one of the demultiplexing units is configured to demultiplex one or more of the input audio data streams. 13. An apparatus according to claim 1 , wherein the apparatus furthermore comprises an artificial source generator for generating an artificial data stream comprising exactly one layer, wherein the artificial source generator is adapted to receive pressure information being represented in a time domain and to receive a position information, wherein the artificial source generator is adapted to replicate t

Assignees

Inventors

Classifications

  • based on separation criteria, e.g. independent component analysis · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Stereophonic arrangements (stereophonic pick-ups H04R9/16, H04R11/12, H04R17/08, H04R19/10) · CPC title

  • Application of ambisonics in stereophonic audio systems · CPC title

  • Voice signal separating · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9484038B2 cover?
An apparatus for generating a merged audio data stream is provided. The apparatus includes a demultiplexer for obtaining a plurality of single-layer audio data streams, wherein each input audio data stream includes one or more layers, wherein the demultiplexer is adapted to demultiplex each one of one or more input audio data streams having one or more layers into two or more demultiplexed audi…
Who is the assignee on this patent?
Del Galdo Giovanni, Thiergart Oliver, Herre Juergen, and 5 more
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 01 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).