Audio system and method of operation therefor
US-9462387-B2 · Oct 4, 2016 · US
US2016337779A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016337779-A1 |
| Application number | US-201415109557-A |
| Country | US |
| Kind code | A1 |
| Filing date | Dec 23, 2014 |
| Priority date | Jan 3, 2014 |
| Publication date | Nov 17, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
Opening claim text (preview).
1 - 11 . (canceled) 12 . A method for generating a binaural signal in response to a set of N channels of a multi-channel audio input signal, where N is a positive integer, said method including steps of: (a) applying N binaural room impulse responses, BRIR 1 , BRIR 2 , . . . , BRIR N , to the set of channels of the audio input signal, thereby generating filtered signals, including by applying the “i”th one of the binaural room impulse responses, BRIR i , to the “i”th channel of the set, for each value of index i in the range from 1 through N; and (b) combining the filtered signals to generate the binaural signal, wherein each said BRIR i , when convolved with the “i”th channel of the set, generates a binaural signal indicative of sound from a source having a direction, x i , and a distance, d i , relative to an intended listener, and at least one of said BRIR i has been designed by a method including steps of: (c) generating candidate binaural room impulse responses (candidate BRIRs) in accordance with a simulation model which simulates a response of an audio source, having a candidate BRIR direction and a candidate BRIR distance relative to an intended listener, where the candidate BRIR direction is at least substantially equal to the direction, x i , and the candidate BRIR distance is at least substantially equal to the distance, d i ; (d) generating performance metrics, including a performance metric for each of the candidate BRIRs, by processing the candidate BRIRs in accordance with at least one objective function; and (e) identifying one of the performance metrics having an extremum value, and identifying, as the BRIR i , one of the candidate BRIRs for which the performance metric has said extremum value; wherein the simulation model is a stochastic model that uses a combination of deterministic and stochastic elements, wherein step (d) includes a step of determining a target BRIR for each said candidate BRIR direction, and wherein the performance metric for each of the candidate BRIRs is indicative of a degree of similarity between said each of the candidate BRIRs and the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 13 . The method of claim 12 , wherein the stochastic elements are driven in part by random variables. 14 . The method of claim 13 , wherein one or more of the random variables are pseudo-random variables. 15 . The method of claim 12 , wherein step (a) includes a step of generating one or more noise sequences. 16 . The method of claim 12 , wherein step (c) includes a step of generating the candidate BRIRs in accordance with predetermined perceptual cues, such that each of the candidate BRIRs, when convolved with the input audio channel, generates a binaural signal indicative of sound which provides said perceptual cues. 17 . The method of claim 12 , wherein step (d) includes a step of comparing a perceptually banded, frequency domain representation of each of the candidate BRIRs with a perceptually banded, frequency domain representation of the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 18 . (canceled) 19 . The method of claim 12 , wherein each of the candidate BRIRs, and thus the BRIR identified in step (c), represents a response of a virtual room. 20 - 27 . (canceled) 28 . A system configured to generate a binaural signal in response to a set of N channels of a multi-channel audio input signal, where N is a positive integer, said system including: a filtering subsystem coupled and configured to apply N binaural room impulse responses, BRIR 1 , BRIR 2 , . . . , BRIR N , to the set of channels of the audio input signal, thereby generating filtered signals, including by applying the “i”th one of the binaural room impulse responses, BRIR i , to the “i”th channel of the set, for each value of index i in the range from 1 through N; and a signal combining subsystem, coupled to the filtering subsystem, and configured to generate the binaural signal by combining the filtered signals, wherein each said BRIR i , when convolved with the “i”th channel of the set, generates a binaural signal indicative of sound from a source having a direction, x i , and a distance, d i , relative to an intended listener, and at least one of said BRIR i has been predetermined by a method including steps of: generating candidate binaural room impulse responses (candidate BRIRs) in accordance with a simulation model which simulates a response of an audio source, having a candidate BRIR direction and a candidate BRIR distance relative to an intended listener, where the candidate BRIR direction is at least substantially equal to the direction, x i , and the candidate BRIR distance is at least substantially equal to the distance, d i ; generating performance metrics, including a performance metric for each of the candidate BRIRs, by processing the candidate BRIRs in accordance with at least one objective function; and identifying one of the performance metrics having an extremum value, and identifying, as the BRIR i , one of the candidate BRIRs for which the performance metric has said extremum value; wherein the simulation model is a stochastic model that uses a combination of deterministic and stochastic elements, wherein each said BRIR i has been designed by a method including a step of determining a target BRIR for each said candidate BRIR direction, and wherein the performance metric for each of the candidate BRIRs is indicative of a degree of similarity between said each of the candidate BRIRs and the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 29 . The system of claim 28 , wherein the stochastic elements are driven in part by random variables. 30 . The system of claim 29 , wherein one or more of the random variables are pseudo-random variables. 31 . The system of claim 28 , wherein the step of generating BRIRs includes a step of generating one or more noise sequences. 32 - 33 . (canceled) 34 . The system of claim 28 , wherein each said BRIR i has been designed by a method including a step of comparing a perceptually banded, frequency domain representation of each of the candidate BRIRs with a perceptually banded, frequency domain representation of the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 35 . The system of claim 34 , wherein the performance metric for said each of the candidate BRIRs is indicative of specific loudness in critical frequency bands of the target BRIR and said each of the candidate BRIRs. 36 . The system of claim 34 , wherein each said perceptually banded, frequency domain representation comprises a left channel having B frequency bands and a right channel having B frequency bands, and the performance metric for said each of the candidate BRIRs is at least substantially equal to: D = 1 B ∑ n = 1 2
Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD] · CPC title
For headphones · CPC title
For headphones · CPC title
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
Synergistic effects of band splitting and sub-band processing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.