Optimal mixing matrices and usage of decorrelators in spatial audio processing

US11282485B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11282485-B2
Application numberUS-202016987264-A
CountryUS
Kind codeB2
Filing dateAug 6, 2020
Priority dateAug 17, 2011
Publication dateMar 22, 2022
Grant dateMar 22, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for generating an audio output signal having two or more audio output channels from an audio input signal having two or more audio input channels includes a provider and a signal processor. The provider is adapted to provide first covariance properties of the audio input signal. The signal processor is adapted to generate the audio output signal by applying a mixing rule on at least two of the two or more audio input channels. The signal processor is configured to determine the mixing rule based on the first covariance properties of the audio input signal and based on second covariance properties of the audio output signal, the second covariance properties being different from the first covariance properties.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for generating an audio output signal comprising two or more audio output channels from an audio input signal comprising two or more audio input channels, comprising: a provider for providing first covariance properties of the audio input signal, and a signal processor for generating the audio output signal by applying a mixing rule on at least two of the two or more audio input channels, wherein the signal processor is configured to determine the mixing rule based on the first covariance properties of the audio input signal and based on second covariance properties of the audio output signal, the second covariance properties being different from the first covariance properties. 2. The apparatus according to claim 1 , wherein the provider is adapted to provide the first covariance properties, wherein the first covariance properties comprise a first state for a first time-frequency bin, and wherein the first covariance properties comprise a second state, being different from the first state, for a second time-frequency bin, being different from the first time-frequency bin. 3. The apparatus according to claim 1 , wherein the signal processor is adapted to determine the mixing rule based on the second covariance properties, wherein the second covariance properties comprise a third state for a third time-frequency bin, and wherein the second covariance properties comprise a fourth state, being different from the third state for a fourth time-frequency bin, being different from the third time-frequency bin. 4. The apparatus according to claim 1 , wherein the signal processor is adapted to generate the audio output signal by applying the mixing rule such that each one of the two or more audio output channels depends on each one of the two or more audio input channels. 5. The apparatus according to claim 1 , wherein the signal processor is adapted to determine the mixing rule such that an error measure is minimized. 6. The apparatus according to claim 5 , wherein the signal processor is adapted to determine the mixing rule such that the mixing rule depends on ∥ y ref −y∥ 2 , wherein y ref =Qx, wherein x is the audio input signal, wherein Q is a mapping matrix, and wherein y is the audio output signal. 7. The apparatus according to claim 1 , wherein the signal processor is configured to determine the mixing rule by determining the second covariance properties, wherein the signal processor is configured to determine the second covariance properties based on the first covariance properties. 8. The apparatus according to claim 1 , wherein the signal processor is adapted to determine a mixing matrix as the mixing rule, wherein the signal processor is adapted to determine the mixing matrix based on the first covariance properties and based on the second covariance properties. 9. The apparatus according to claim 1 , wherein the provider is adapted to provide the first covariance properties by determining a first covariance matrix of the audio input signal, and wherein the signal processor is configured to determine the mixing rule based on a second covariance matrix of the audio output signal as the second covariance properties. 10. The apparatus according to claim 9 , wherein the provider is adapted to determine the first covariance matrix, such that each diagonal value of the first covariance matrix indicates an energy of one of the audio input channels, and such that each value of the first covariance matrix, which is not a diagonal value indicates an inter-channel correlation between a first audio input channel and a different second audio input channel. 11. The apparatus according to claim 9 , wherein the signal processor is configured to determine the mixing rule based on the second covariance matrix, wherein each diagonal value of the second covariance matrix indicates an energy of one of the audio output channels, and wherein each value of the second covariance matrix, which is not a diagonal value, indicates an inter-channel correlation between a first audio output channel and a second audio output channel. 12. The apparatus according to claim 1 , wherein the signal processor is adapted to determine a mixing matrix as the mixing rule, wherein the signal processor is adapted to determine the mixing matrix based on the first covariance properties and based on the second covariance properties, wherein the provider is adapted provide the first covariance properties by determining a first covariance matrix of the audio input signal, and wherein the signal processor is configured to determine the mixing rule based on a second covariance matrix of the audio output signal as the second covariance properties, wherein the signal processor is adapted to determine the mixing matrix such that: M=K y PK x −1 , such that, K x K x T =C x , K y K y T =C y wherein M is the mixing matrix, wherein C x is the first covariance matrix, wherein C y is the second covariance matrix, wherein K x T is a first transposed matrix of a first decomposed matrix K x , wherein K y T is a second transposed matrix of a second decomposed matrix K y , wherein K x −1 is an inverse matrix of the first decomposed matrix K x , and wherein P is a first unitary matrix. 13. The apparatus according to claim 12 , wherein the signal processor is adapted to determine the mixing matrix such that M=K y PK x −1 , wherein P=VΛU T , wherein U T is a third transposed matrix of a second unitary matrix U, wherein V is a third unitary matrix, wherein Λ is an identity matrix appended with zeros, wherein USV T =K x T Q x T K y , wherein Q T is a fourth transposed matrix of the mapping matrix Q, wherein V T is a fifth transposed matrix of the third unitary matrix V, and wherein S is a diagonal matrix. 14. The apparatus according to claim 1 , wherein the signal processor is adapted to determine a mixing matrix as the mixing rule, wherein the signal processor is adapted to determine the mixing matrix based on the first covariance properties and based on the second covariance properties, wherein the provider is adapted to provide the first covariance properties by determining a first covariance matrix of the audio input signal, and wherein the signal processor is configured to determine the mixing rule based on a second covariance matrix of the audio output signal as the second covariance properties, wherein the signal processor is adapted to determine the mixing rule by modifying at least some diagonal values of a diagonal matrix S x when the values of the diagonal matrix S x are zero or smaller than a threshold value, such that the values are greater than or equal to the threshold value, wherein the diagonal matrix depends on the first covariance matrix. 15. The apparatus according to claim 14 , wherein the signal processor is configured to modify the at least some diagonal values of the diagonal matrix S x , wherein K x =U x S x V x T , and wherein C x =K x K x T , wherein C x is the first covariance matrix, wherein S x is the diagonal matrix, wherein U, is a second matrix, V x T is a third transposed matrix, and wherein K x T is a fourth transposed matrix of the fifth matrix K x , and wherein V x and U x are unitary matrices. 16. The apparatus according to claim 14 , wherein the signal processor is adapted to generate the audio output signal by applying the mixing matrix on at least two of the two or more audio input channels to acquire an intermediate signa

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis (in musical instruments G10H) · CPC title

  • G10H1/183Primary

    Channel-assigning means for polyphonic instruments · CPC title

  • Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error (G10L19/24 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11282485B2 cover?
An apparatus for generating an audio output signal having two or more audio output channels from an audio input signal having two or more audio input channels includes a provider and a signal processor. The provider is adapted to provide first covariance properties of the audio input signal. The signal processor is adapted to generate the audio output signal by applying a mixing rule on at leas…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 22 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).