Methods and systems for designing and applying numerically optimized binaural room impulse responses

US2016337779A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016337779-A1
Application numberUS-201415109557-A
CountryUS
Kind codeA1
Filing dateDec 23, 2014
Priority dateJan 3, 2014
Publication dateNov 17, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

First claim

Opening claim text (preview).

1 - 11 . (canceled) 12 . A method for generating a binaural signal in response to a set of N channels of a multi-channel audio input signal, where N is a positive integer, said method including steps of: (a) applying N binaural room impulse responses, BRIR 1 , BRIR 2 , . . . , BRIR N , to the set of channels of the audio input signal, thereby generating filtered signals, including by applying the “i”th one of the binaural room impulse responses, BRIR i , to the “i”th channel of the set, for each value of index i in the range from 1 through N; and (b) combining the filtered signals to generate the binaural signal, wherein each said BRIR i , when convolved with the “i”th channel of the set, generates a binaural signal indicative of sound from a source having a direction, x i , and a distance, d i , relative to an intended listener, and at least one of said BRIR i has been designed by a method including steps of: (c) generating candidate binaural room impulse responses (candidate BRIRs) in accordance with a simulation model which simulates a response of an audio source, having a candidate BRIR direction and a candidate BRIR distance relative to an intended listener, where the candidate BRIR direction is at least substantially equal to the direction, x i , and the candidate BRIR distance is at least substantially equal to the distance, d i ; (d) generating performance metrics, including a performance metric for each of the candidate BRIRs, by processing the candidate BRIRs in accordance with at least one objective function; and (e) identifying one of the performance metrics having an extremum value, and identifying, as the BRIR i , one of the candidate BRIRs for which the performance metric has said extremum value; wherein the simulation model is a stochastic model that uses a combination of deterministic and stochastic elements, wherein step (d) includes a step of determining a target BRIR for each said candidate BRIR direction, and wherein the performance metric for each of the candidate BRIRs is indicative of a degree of similarity between said each of the candidate BRIRs and the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 13 . The method of claim 12 , wherein the stochastic elements are driven in part by random variables. 14 . The method of claim 13 , wherein one or more of the random variables are pseudo-random variables. 15 . The method of claim 12 , wherein step (a) includes a step of generating one or more noise sequences. 16 . The method of claim 12 , wherein step (c) includes a step of generating the candidate BRIRs in accordance with predetermined perceptual cues, such that each of the candidate BRIRs, when convolved with the input audio channel, generates a binaural signal indicative of sound which provides said perceptual cues. 17 . The method of claim 12 , wherein step (d) includes a step of comparing a perceptually banded, frequency domain representation of each of the candidate BRIRs with a perceptually banded, frequency domain representation of the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 18 . (canceled) 19 . The method of claim 12 , wherein each of the candidate BRIRs, and thus the BRIR identified in step (c), represents a response of a virtual room. 20 - 27 . (canceled) 28 . A system configured to generate a binaural signal in response to a set of N channels of a multi-channel audio input signal, where N is a positive integer, said system including: a filtering subsystem coupled and configured to apply N binaural room impulse responses, BRIR 1 , BRIR 2 , . . . , BRIR N , to the set of channels of the audio input signal, thereby generating filtered signals, including by applying the “i”th one of the binaural room impulse responses, BRIR i , to the “i”th channel of the set, for each value of index i in the range from 1 through N; and a signal combining subsystem, coupled to the filtering subsystem, and configured to generate the binaural signal by combining the filtered signals, wherein each said BRIR i , when convolved with the “i”th channel of the set, generates a binaural signal indicative of sound from a source having a direction, x i , and a distance, d i , relative to an intended listener, and at least one of said BRIR i has been predetermined by a method including steps of: generating candidate binaural room impulse responses (candidate BRIRs) in accordance with a simulation model which simulates a response of an audio source, having a candidate BRIR direction and a candidate BRIR distance relative to an intended listener, where the candidate BRIR direction is at least substantially equal to the direction, x i , and the candidate BRIR distance is at least substantially equal to the distance, d i ; generating performance metrics, including a performance metric for each of the candidate BRIRs, by processing the candidate BRIRs in accordance with at least one objective function; and identifying one of the performance metrics having an extremum value, and identifying, as the BRIR i , one of the candidate BRIRs for which the performance metric has said extremum value; wherein the simulation model is a stochastic model that uses a combination of deterministic and stochastic elements, wherein each said BRIR i has been designed by a method including a step of determining a target BRIR for each said candidate BRIR direction, and wherein the performance metric for each of the candidate BRIRs is indicative of a degree of similarity between said each of the candidate BRIRs and the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 29 . The system of claim 28 , wherein the stochastic elements are driven in part by random variables. 30 . The system of claim 29 , wherein one or more of the random variables are pseudo-random variables. 31 . The system of claim 28 , wherein the step of generating BRIRs includes a step of generating one or more noise sequences. 32 - 33 . (canceled) 34 . The system of claim 28 , wherein each said BRIR i has been designed by a method including a step of comparing a perceptually banded, frequency domain representation of each of the candidate BRIRs with a perceptually banded, frequency domain representation of the target BRIR corresponding to the candidate BRIR direction for said each of the candidate BRIRs. 35 . The system of claim 34 , wherein the performance metric for said each of the candidate BRIRs is indicative of specific loudness in critical frequency bands of the target BRIR and said each of the candidate BRIRs. 36 . The system of claim 34 , wherein each said perceptually banded, frequency domain representation comprises a left channel having B frequency bands and a right channel having B frequency bands, and the performance metric for said each of the candidate BRIRs is at least substantially equal to: D = 1 B  ∑ n = 1 2  

Assignees

Inventors

Classifications

  • Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD] · CPC title

  • For headphones · CPC title

  • H04S7/304Primary

    For headphones · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

  • Synergistic effects of band splitting and sub-band processing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016337779A1 cover?
Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, whe…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification H04S7/304. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Nov 17 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).