Information processing method, information processing system, and program
US-2024406653-A1 · Dec 5, 2024 · US
US9838822B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9838822-B2 |
| Application number | US-201414779326-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 17, 2014 |
| Priority date | Mar 22, 2013 |
| Publication date | Dec 5, 2017 |
| Grant date | Dec 5, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Recordings from microphones that provide 1 st order Ambisonics signals, so-called B-format signals, offer a limited cognition of sound directivity. Sound sources are perceived broader than they actually are, especially for off-center listening positions, and the sound sources are often located to be coming from the closest speaker positions. In a method and apparatus for enhancing the directivity of 1 st order Ambisonics signals, additional directivity information is extracted (SFA) from the lower order Ambisonics input signal. The additional directivity information is used to estimate higher order Ambisonics coefficients, which are then combined with the coefficients of the input signal. Thus, the directivity of the Ambisonics signal is enhanced, which leads to an increased accuracy of spatial source localization when the Ambisonics signal is decoded to loud speaker signals. The resulting output signal has more energy than the input signal.
Opening claim text (preview).
The invention claimed is: 1. A method for enhancing directivity of an input signal being a 1 st order Ambisonics signal and having coefficients of 0 th order and 1 st order, the method including: filtering the input signal in an Analysis Filter bank, wherein four frequency domain channels are obtained that are a frequency domain representation of the 1 st order Ambisonics signal, and wherein one first frequency domain channel of the frequency domain channels represents 0 th order coefficients and three remaining frequency domain channels represent 1 st order coefficients; performing a Sound Field Analysis of the four frequency domain channels, whereby source directions and a diffuseness estimate are obtained; filtering in a filter the first frequency domain channel that has 0 th order coefficients to determine a direct sound based on the diffuseness estimate; encoding in a Higher Order Ambisonics encoder the direct sound component based on the source directions, wherein the direct sound component is encoded in Ambisonics format with a pre-defined order that has a value of at least two, wherein encoded direct sound in Ambisonics format of the pre-defined order is obtained, and wherein the encoded direct sound in Ambisonics format of the pre-defined order including Ambisonics coefficients of an order higher than 1 st order; selecting from the encoded direct sound in the Ambisonics format of the pre-defined order only Ambisonics coefficients of 2 nd order or higher order, wherein coefficients of 1 st order and 0 th order are omitted; and combining in a Combining and Synthesis unit a signal representing the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound with the input signal, wherein an enhanced directivity Ambisonics signal of at least 2 nd order is obtained. 2. The method according to claim 1 , wherein the combining the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound with the input signal includes: combining in a frequency domain Combiner unit Ambisonics coefficients of the four frequency domain channels with the selected frequency coefficients of the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound, wherein a signal is obtained that is a frequency domain representation of an Ambisonics signal of at least 2 nd order, and filtering in a Synthesis Filter Bank the obtained signal to determine a time domain representation of an enhancement Higher Order Ambisonics signal that has coefficients of at least 2 nd order. 3. The method according to claim 1 , wherein the combining the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound with the input signal includes: filtering in a synthesis filter bank the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound, wherein a time domain representation of an enhancement Higher Order Ambisonics signal is obtained that includes coefficients of 2 nd order or higher order; combining in a time domain combiner Ambisonics coefficients representative of the input signal with the time domain representation of said enhancement Higher Order Ambisonics signal of 2 nd order or higher order, wherein a time domain representation of an Ambisonics signal of at least 2 nd order is obtained that has enhanced directivity as compared to the input signal. 4. The method according to claim 1 , wherein in said encoding the direct sound component in Ambisonics format with pre-defined order, the Higher Order Ambisonics encoder uses B-format. 5. The method according to claim 1 , wherein in the encoding the direct sound component in Ambisonics format with pre-defined order, the Higher Order Ambisonics encoder uses an Ambisonics format other than B-format, further including: re-formatting in a HOA format adaptation unit, before said combining, the input signal according to the Ambisonics format other than B-format to obtain re-formatted Ambisonics coefficients of the input signal, and wherein, in said combining, the combiner combines the re-formatted Ambisonics coefficients of the input signal with the time domain representation of the enhancement Higher Order Ambisonics signal of 2 nd order or higher order. 6. The method according to claim 1 , wherein the performing a Sound Field Analysis of the four frequency domain channels includes: performing an active Intensity analysis of the four frequency domain channels, wherein a value representing active intensity is obtained; performing a diffuseness analysis of the four frequency domain channels, wherein said diffuseness estimate is obtained; and performing a Direction-of-Arrival analysis of the value representing active intensity to obtain the source directions. 7. The method according to claim 1 , further including mixing the enhanced Ambisonics signal of at least 2 nd order with a further HOA input signal of a higher order or a different Ambisonics format, wherein a HOA signal that includes a mixture of the input signal and said further HOA input signal is obtained. 8. The method according to claim 1 , wherein the resulting HOA signal has O=(N order +1) 2 components for 3D realizations and O=(2 N order +1) components for 2D realizations, wherein N order is the order of the HOA encoder, and the resulting HOA signal has C n m coefficients according to C n m : [A 0 0 , A 1 −1 ,A 1 0 ,A 1 1 , B 2 −2 , B 2 −1 , B 2 0 , B 2 1 , B 2 2 , . . . ′], wherein the A i j are coefficients of the input signal and the B i j are the selected HOA coefficients from the encoded direct sound. 9. An apparatus for enhancing directivity of an input signal being a 1 st order Ambisonics signal and having coefficients of 0 th order and 1 st order, the apparatus including: an Analysis Filter bank for filtering the input signal, wherein four frequency domain channels are obtained that are a frequency domain representation of the 1 st order Ambisonics signal, and wherein one frequency domain channel of the frequency domain channels represents 0 th order coefficients and three of the frequency domain channels represent 1 st order coefficients; a Sound Field Analysis unit for performing a sound field analysis of the four frequency domain channels, whereby source directions and a diffuseness estimate are obtained; a Filter for filtering the frequency domain channel that has 0 th order coefficients to determine a direct sound based on the diffuseness estimate; a Higher Order Ambisonics encoder for encoding the direct sound component based on the source directions, wherein the direct sound component is encoded in Ambisonics format with a pre-defined order that has a value of at least two, wherein encoded direct sound in Ambisonics format of the pre-defined order is obtained, and wherein the encoded direct sound in Ambisonics format of the pre-defined order including Ambisonics coefficients of an order higher than 1 st order; a Selector for selecting, from the encoded direct sound in the Ambisonics format of the pre-defined order only Ambisonics coefficients of 2 nd order or higher order, wherein coefficients of 1 st order and 0 th order are omitted; and a Combining and Synthesis unit for combining a signal representing the selected Ambisonics coefficients of 2 nd order or higher order from the encoded direct sound with the input signal, wherein an enhanced directivity Ambisonics signal of at least 2 nd order is obtained. 10. The apparatus according to claim 9 , wherein the Combining and Synthesis unit includes: a frequency domain Combiner unit for combining Ambisonics coefficients of the four frequency domai
Aspects of sound capture and related signal processing for recording or reproduction · CPC title
Application of ambisonics in stereophonic audio systems · CPC title
Control circuits for electronic adaptation of the sound field · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Speech enhancement, e.g. noise reduction or echo cancellation (reducing echo effects in line transmission systems H04B3/20; echo suppression in hands-free telephones H04M9/08) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.