Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field

US9622008B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9622008-B2
Application numberUS-201414766739-A
CountryUS
Kind codeB2
Filing dateFeb 7, 2014
Priority dateFeb 8, 2013
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Higher Order Ambisonics (HOA) represents three-dimensional sound. HOA provides high spatial resolution and facilitates analyzing of the sound field with respect to dominant sound sources. The invention aims to identify independent dominant sound sources constituting the sound field, and to track their temporal trajectories. Known applications are searching for all potential candidates for dominant sound source directions by looking at the directional power distribution of the original HOA representation, whereas in the invention all components which are correlated with the signals of previously found sound sources are removed. By such operation the problem of erroneously detecting many instead of only one correct sound source can be avoided in case its contributions to the sound field are highly directionally dispersed.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: in a current time frame of HOA coefficients, searching preliminary direction estimates of dominant sound sources; and determining HOA sound field components based on corresponding dominant sound sources, wherein a current direction estimate is determined based on a residual HOA representation which represents an original HOA representation from which all components correlated with signals of previously found sound sources have been removed, wherein the current direction estimate is selected out of a set of predefined test directions, based on a power of a related general plane wave of the residual HOA representation, impinging from a direction on a listener position, relative to respective power of all other test directions, and wherein the current direction estimate for the current time frame of HOA coefficients is assigned to at least a dominant sound source of a previous time frame of HOA coefficients and is smoothed with respect to a time trajectory. 2. The method of claim 1 , wherein the smoothing is based on a Bayesian inference process that exploits a statistical a priori sound source movement model and directional power distributions of the dominant sound source components of the original HOA representation. 3. The method of claim 2 , wherein the statistical a priori model statistically predicts a movement of individual sound sources based on their direction in the previous time frame and movement between the previous time frame and a penultimate time frame. 4. The method of claim 2 , wherein direction estimates are assigned to dominant sound sources of the previous time frame of HOA coefficients based on a joint minimization of angles between pairs of a direction estimate and a direction of a previously found sound source, and maximization of an absolute value of a correlation coefficient between the pairs of the directional signals related to a direction estimate and to a dominant sound source found in the previous time frame of HOA coefficients. 5. A method for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: in a current time frame of HOA coefficients, searching preliminary direction estimates of dominant sound sources, and determining HOA sound field components based on corresponding dominant sound sources, and determining corresponding directional signals; assigning the dominant sound sources to corresponding sound sources active in a previous time frame of the HOA coefficients based on a comparison of the preliminary direction estimates of the current time frame and smoothed directions of sound sources active in the previous time frame, wherein the assignment is further based on a correlation of directional signals of the current time frame and directional signals of sound sources active in the previous time frame, resulting in an assignment function; determining smoothed dominant source directions based on the assignment function, the smoothed dominant source directions in the previous time frame, indices of active dominant sound sources in the previous time frame, respective source movement angles between the penultimate time frame and the previous time frame, and the HOA sound field components based on the corresponding dominant sound sources; and determining indices and directions of the active dominant sound sources of the current time frame based on the smoothed dominant source directions, a frame delayed version of directions of the active dominant sound sources of the previous time frame and a frame delayed version of indices of the active dominant sound sources of the previous time frame, wherein the directional signals of sound sources active in the previous time frame are determined based on mode matching based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and the HOA coefficients of the previous time frame, and wherein the source movement angles between the penultimate time frame and the previous time frame is determined based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and a further frame delayed version thereof. 6. An apparatus for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: a processor configured to search in a current time frame of HOA coefficients preliminary direction estimates of dominant sound sources, and to determine HOA sound field components based on corresponding dominant sound sources, the processor further configured to determine corresponding directional signals; wherein the processor is further configured to assign the dominant sound sources to corresponding sound sources active in a previous time frame of the HOA coefficients based on a comparison of the preliminary direction estimates of the current time frame and smoothed directions of sound sources active in the previous time frame, wherein the assignment is further based on a correlation of the directional signals of the current time frame and directional signals of sound sources active in the previous time frame, resulting in an assignment function; wherein the processor is further configured to determine smoothed dominant source directions based on the assignment function, the smoothed dominant source directions in the previous time frame, indices of active dominant sound sources in the previous time frame, respective source movement angles between the penultimate time frame and the previous time frame, and the HOA sound field components based on the corresponding dominant sound sources, wherein the processor is further configured to determine indices and directions of active dominant sound sources of the current time frame based on the smoothed dominant source directions, a frame delayed version of directions of the active dominant sound sources of the previous time frame and a frame delayed version of indices of the active dominant sound sources of the previous time frame, wherein the directional signals of sound sources active in the previous time frame are determined based on mode matching based on frame delayed version of directions of the active dominant sound sources of said previous time frame and the HOA coefficients of the previous time frame, and wherein the source movement angles between the penultimate time frame and the previous time frame is determined based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and a further frame delayed version thereof. 7. The method of claim 5 , wherein the determination of the detected dominant directional signals and the corresponding preliminary direction estimates, further includes: determining an HOA sound field component based on a subtraction of the corresponding dominant sound sources from the current time frame of HOA coefficients in order to obtain a corresponding residual HOA representation, wherein the subtraction processing is repeatedly performed for each case of a remaining residual HOA representation for further sound field components, wherein the sound field components are excluded for further direction searches. 8. The method of claim 7 , further comprising determining a representation for a predefined number of discrete test directions which are nearly uniformly distributed on a unit sphere, wherein directional power distribution is analyzed for presence of a dominant sound source, and based on a determination of an absence of a dominant sound source, the direction

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Application of ambisonics in stereophonic audio systems · CPC title

  • H04S3/00Primary

    Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

  • Voice signal separating · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9622008B2 cover?
Higher Order Ambisonics (HOA) represents three-dimensional sound. HOA provides high spatial resolution and facilitates analyzing of the sound field with respect to dominant sound sources. The invention aims to identify independent dominant sound sources constituting the sound field, and to track their temporal trajectories. Known applications are searching for all potential candidates for domin…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).