Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US-2024098445-A1 · Mar 21, 2024 · US
US9622008B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9622008-B2 |
| Application number | US-201414766739-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 7, 2014 |
| Priority date | Feb 8, 2013 |
| Publication date | Apr 11, 2017 |
| Grant date | Apr 11, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Higher Order Ambisonics (HOA) represents three-dimensional sound. HOA provides high spatial resolution and facilitates analyzing of the sound field with respect to dominant sound sources. The invention aims to identify independent dominant sound sources constituting the sound field, and to track their temporal trajectories. Known applications are searching for all potential candidates for dominant sound source directions by looking at the directional power distribution of the original HOA representation, whereas in the invention all components which are correlated with the signals of previously found sound sources are removed. By such operation the problem of erroneously detecting many instead of only one correct sound source can be avoided in case its contributions to the sound field are highly directionally dispersed.
Opening claim text (preview).
The invention claimed is: 1. A method for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: in a current time frame of HOA coefficients, searching preliminary direction estimates of dominant sound sources; and determining HOA sound field components based on corresponding dominant sound sources, wherein a current direction estimate is determined based on a residual HOA representation which represents an original HOA representation from which all components correlated with signals of previously found sound sources have been removed, wherein the current direction estimate is selected out of a set of predefined test directions, based on a power of a related general plane wave of the residual HOA representation, impinging from a direction on a listener position, relative to respective power of all other test directions, and wherein the current direction estimate for the current time frame of HOA coefficients is assigned to at least a dominant sound source of a previous time frame of HOA coefficients and is smoothed with respect to a time trajectory. 2. The method of claim 1 , wherein the smoothing is based on a Bayesian inference process that exploits a statistical a priori sound source movement model and directional power distributions of the dominant sound source components of the original HOA representation. 3. The method of claim 2 , wherein the statistical a priori model statistically predicts a movement of individual sound sources based on their direction in the previous time frame and movement between the previous time frame and a penultimate time frame. 4. The method of claim 2 , wherein direction estimates are assigned to dominant sound sources of the previous time frame of HOA coefficients based on a joint minimization of angles between pairs of a direction estimate and a direction of a previously found sound source, and maximization of an absolute value of a correlation coefficient between the pairs of the directional signals related to a direction estimate and to a dominant sound source found in the previous time frame of HOA coefficients. 5. A method for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: in a current time frame of HOA coefficients, searching preliminary direction estimates of dominant sound sources, and determining HOA sound field components based on corresponding dominant sound sources, and determining corresponding directional signals; assigning the dominant sound sources to corresponding sound sources active in a previous time frame of the HOA coefficients based on a comparison of the preliminary direction estimates of the current time frame and smoothed directions of sound sources active in the previous time frame, wherein the assignment is further based on a correlation of directional signals of the current time frame and directional signals of sound sources active in the previous time frame, resulting in an assignment function; determining smoothed dominant source directions based on the assignment function, the smoothed dominant source directions in the previous time frame, indices of active dominant sound sources in the previous time frame, respective source movement angles between the penultimate time frame and the previous time frame, and the HOA sound field components based on the corresponding dominant sound sources; and determining indices and directions of the active dominant sound sources of the current time frame based on the smoothed dominant source directions, a frame delayed version of directions of the active dominant sound sources of the previous time frame and a frame delayed version of indices of the active dominant sound sources of the previous time frame, wherein the directional signals of sound sources active in the previous time frame are determined based on mode matching based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and the HOA coefficients of the previous time frame, and wherein the source movement angles between the penultimate time frame and the previous time frame is determined based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and a further frame delayed version thereof. 6. An apparatus for determining directions of uncorrelated sound sources in a Higher Order Ambisonics (HOA) representation of a sound field, comprising: a processor configured to search in a current time frame of HOA coefficients preliminary direction estimates of dominant sound sources, and to determine HOA sound field components based on corresponding dominant sound sources, the processor further configured to determine corresponding directional signals; wherein the processor is further configured to assign the dominant sound sources to corresponding sound sources active in a previous time frame of the HOA coefficients based on a comparison of the preliminary direction estimates of the current time frame and smoothed directions of sound sources active in the previous time frame, wherein the assignment is further based on a correlation of the directional signals of the current time frame and directional signals of sound sources active in the previous time frame, resulting in an assignment function; wherein the processor is further configured to determine smoothed dominant source directions based on the assignment function, the smoothed dominant source directions in the previous time frame, indices of active dominant sound sources in the previous time frame, respective source movement angles between the penultimate time frame and the previous time frame, and the HOA sound field components based on the corresponding dominant sound sources, wherein the processor is further configured to determine indices and directions of active dominant sound sources of the current time frame based on the smoothed dominant source directions, a frame delayed version of directions of the active dominant sound sources of the previous time frame and a frame delayed version of indices of the active dominant sound sources of the previous time frame, wherein the directional signals of sound sources active in the previous time frame are determined based on mode matching based on frame delayed version of directions of the active dominant sound sources of said previous time frame and the HOA coefficients of the previous time frame, and wherein the source movement angles between the penultimate time frame and the previous time frame is determined based on the frame delayed version of directions of the active dominant sound sources of the previous time frame and a further frame delayed version thereof. 7. The method of claim 5 , wherein the determination of the detected dominant directional signals and the corresponding preliminary direction estimates, further includes: determining an HOA sound field component based on a subtraction of the corresponding dominant sound sources from the current time frame of HOA coefficients in order to obtain a corresponding residual HOA representation, wherein the subtraction processing is repeatedly performed for each case of a remaining residual HOA representation for further sound field components, wherein the sound field components are excluded for further direction searches. 8. The method of claim 7 , further comprising determining a representation for a predefined number of discrete test directions which are nearly uniformly distributed on a unit sphere, wherein directional power distribution is analyzed for presence of a dominant sound source, and based on a determination of an absence of a dominant sound source, the direction
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Application of ambisonics in stereophonic audio systems · CPC title
Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title
Voice signal separating · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.