Methods and apparatuses for encoding and decoding object-based audio signals
US-9449601-B2 · Sep 20, 2016 · US
US9674632B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9674632-B2 |
| Application number | US-201414288293-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 27, 2014 |
| Priority date | May 29, 2013 |
| Publication date | Jun 6, 2017 |
| Grant date | Jun 6, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A device comprising one or more processors is configured to determine a plurality of segments for each of a plurality of binaural room impulse response filters, wherein each of the plurality of binaural room impulse response filters comprises a residual room response segment and at least one direction-dependent segment for which a filter response depends on a location within a sound field; transform each of at least one direction-dependent segment of the plurality of binaural room impulse response filters to a domain corresponding to a domain of a plurality of hierarchical elements to generate a plurality of transformed binaural room impulse response filters, wherein the plurality of hierarchical elements describe a sound field; and perform a fast convolution of the plurality of transformed binaural room impulse response filters and the plurality of hierarchical elements to render the sound field.
Opening claim text (preview).
What is claimed is: 1. A method of binaural audio rendering performed by an audio playback system, the method comprising: extracting direction-dependent segments of left and right binaural room impulse response (BRIR) filters, wherein: the left BRIR filter comprises a left residual room response segment, the right BRIR filter comprises a right residual room response segment, each of the left and right BRIR filters comprises one of the direction-dependent segments, wherein a filter response for each of the direction-dependent segments depends on a location of a virtual speaker; applying a rendering matrix to transform a left matrix and a right matrix to left and right filter matrices in a spherical harmonic domain respectively, the left matrix and the right matrix including the extracted direction-dependent segments of the left and right BRIR filters; combining the left residual room response segment and the right residual room response segment to produce a left common residual room response segment and a right common residual room response segment; convolving the left filter matrix and spherical harmonic coefficients (SHCs) to produce a left filtered SHC channel, wherein the SHCs describe a sound field; convolving the right filter matrix and the SHCs to produce a right filtered SHC channel; computing a fast convolution of the left common residual room response segment and at least one channel of the SHCs to produce a left residual room signal; computing a fast convolution of the right common residual room response segment and at least one channel of the SHCs to produce a right residual room signal; combining the left residual room signal and the left filtered SHC channel to produce a left binaural output signal; and combining the right residual room signal and the right filtered SHC channel to produce a right binaural output signal. 2. The method of claim 1 , further comprising: after applying the rendering matrix to transform the left matrix to the left filter matrix in the spherical harmonic domain and before convolving the left filter matrix and the SHCs to produce the left filtered SHC channel, modifying the left filter matrix by applying, to the left filter matrix, a first minimum phase reduction and using a first Balanced Model Truncation method to design a first Infinite Impulse Response (IIR) filter to approximate a frequency response of a minimum phase portion of the left filter matrix; and after applying the rendering matrix to transform the right matrix to the right filter matrix in the spherical harmonic domain and before convolving the right filter matrix and the SHCs to produce the right filtered SHC channel, modifying the right filter matrix by applying, to the right filter matrix, a second minimum phase reduction and using a second Balanced Model Truncation method to design a second IIR filter to approximate a frequency response of a minimum phase portion of the right filter matrix. 3. The method of claim 1 , wherein: computing the fast convolution of the left common residual room response segment and at least one channel of the SHCs to produce the left residual room signal comprises convolving the left common residual room response segment only with a highest-order channel of the SHCs to produce the left residual room signal; and computing the fast convolution of the right common residual room response segment and at least one channel of the SHCs to produce the right residual room signal comprises convolving the right common residual room response segment with only the highest-order channel of the SHCs to produce the right residual room signal. 4. The method of claim 1 , the method further comprising: zero-padding the left residual room signal with an onset number of samples; and zero-padding the right residual room signal with the onset number of samples. 5. The method of claim 1 , wherein the left and right BRIR filters are conditioned to remove samples of initial phases of the left and right BRIR filters. 6. A device comprising: a memory; and one or more processors configured to: extract direction-dependent segments of left and right binaural room impulse response (BRIR) filters, wherein: the left BRIR filter comprises a left residual room response segment, the right BRIR filter comprises a right residual room response segment, each of the left and right BRIR filters comprises one of the direction-dependent segments, wherein a filter response for each of the direction-dependent segments depends on a location of a virtual speaker; apply a rendering matrix to transform a left matrix and a right matrix to left and right filter matrices in a spherical harmonic domain respectively, the left matrix and the right matrix including the extracted direction-dependent segments of the left and right BRIR filters; combine the left residual room response segment and the right residual room response segment to produce a left common residual room response segment and a right common residual room response segment; convolve the left filter matrix and spherical harmonic coefficients (SHCs) to produce a left filtered SHC channel, wherein the SHCs describe a sound field; convolve the right filter matrix and the SHCs to produce a right filtered SHC channel; compute a fast convolution of the left common residual room response segment and at least one channel of the SHCs to produce a left residual room signal; compute a fast convolution of the right common residual room response segment and at least one channel of the SHCs to produce a right residual room signal; combine the left residual room signal and the left filtered SHC channel to produce a left binaural output signal; and combine the right residual room signal and the right filtered SHC channel to produce a right binaural output signal. 7. The device of claim 6 , wherein the one or more processors are configured such that: after applying the rendering matrix to transform the left matrix to the left filter matrix in the spherical harmonic domain and before convolving the left filter matrix and the SHCs to produce the left filtered SHC channel, the one or more processors modify the left filter matrix by applying, to the left filter matrix, a first minimum phase reduction and by using a first Balanced Model Truncation method to design a first Infinite Impulse Response (IIR) filter to approximate a frequency response of a minimum phase portion of the left filter matrix; and after applying the rendering matrix to transform the right matrix to the right filter matrix in the spherical harmonic domain and before convolving the right filter matrix and the SHCs to produce the right filtered SHC channel, the one or more processors modify the right filter matrix by applying, to the right filter matrix, a second minimum phase reduction and by using a second Balanced Model Truncation method to design a second IIR filter to approximate a frequency response of a minimum phase portion of the right filter matrix. 8. The device of claim 6 , wherein: to compute the fast convolution of the left common residual room response segment and the at least one channel of the SHCs to produce the left residual room signal, the one or more processors convolve the left common residual room response segment only with a highest-order element of the SHCs to produce the left residual room signal; and to compute the fast convolution of the right common residual room response segment and the at least one channel of the SHCs to produce the right residual room signal, the one or more processors convolve the right common residual room response segment with only the highest-order channel of the SHCs to produce the right residual room signal. 9. The device of claim 6 , wherein t
Application of ambisonics in stereophonic audio systems · CPC title
For headphones · CPC title
Frequency adjustment, e.g. tone control (H04S7/301 takes precedence) · CPC title
Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution (control circuits for electronic adaptation of the sound field H04S7/30) · CPC title
Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.