Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network
US-2021051435-A1 · Feb 18, 2021 · US
US12185085B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12185085-B2 |
| Application number | US-202117928319-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 21, 2021 |
| Priority date | Jun 22, 2020 |
| Publication date | Dec 31, 2024 |
| Grant date | Dec 31, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An audio apparatus for generating a diffuse reverberation signal comprises a receiver (501) receiving audio signals representing sound sources and metadata comprising a diffuse reverberation signal to total source relationship indicative of a level of diffuse reverberation sound relative to total emitted sound in the environment. The metadata also for each audio signal comprises a signal level indication and a directivity data indicative of directivity of sound radiation from the sound source represented by the audio signal. A circuit (505, 507) determines a total emitted energy indication based on the signal level indication and the directivity data, and a downmix coefficient based on the total emitted energy and the diffuse reverberation signal to total signal relationship. A downmixer (509) generates a downmix signal by combining signal components for each audio signal generated by applying the downmix coefficient for each audio signal to the audio signal. A reverberator (407) generates the diffuse reverberation signal for the environment from the downmix signal component.
Opening claim text (preview).
The invention claimed is: 1. An audio apparatus comprising: a receiver circuit arranged to receive a plurality of audio signals, and metadata, wherein the plurality of audio signals represent sound sources in an environment, wherein the metadata comprises a diffuse reverberation signal to total signal relationship, directivity data and a signal level indication for each audio signal of the plurality of audio signals, wherein the diffuse reverberation signal to total signal relationship is indicative of a level of diffuse reverberation sound relative to total emitted sound in an environment, wherein the directivity data is indicative of a directivity of sound radiation from the sound source represented by each of the plurality of audio signals; a circuit arranged to determine a total emitted energy indication and a downmix coefficient for each of the plurality of audio signals, wherein the total emitted energy indication is based on the signal level indication and the directivity data, wherein the downmix coefficient is based on the total emitted energy and the diffuse reverberation signal to total signal relationship; a downmixer circuit, wherein the downmixer circuit is arranged to generate a downmix signal by combining signal components of each of the plurality of audio signals generated by applying the corresponding downmix coefficient to each of the plurality of audio signals; and a reverberator circuit, wherein the reverberator circuit is arranged to generate a diffuse reverberation signal from the downmix signal, wherein the diffuse reverberation signal is for the environment. 2. The audio apparatus of claim 1 , wherein the directivity of sound radiation is frequency dependent, wherein the circuit is arranged to determine a frequency dependent total emitted energy and frequency dependent downmix coefficients. 3. The audio apparatus of claim 1 , wherein the diffuse reverberation signal to total signal relationship is frequency dependent, wherein the circuit is arranged to determine frequency dependent downmix coefficients. 4. The audio apparatus of claim 1 , wherein the diffuse reverberation signal to total signal relationship comprises a frequency dependent portion and a non-frequency dependent portion, wherein the circuit is arranged to determine the downmix coefficients using the non-frequency dependent portion, wherein the circuit is arranged to adapt the reverberator circuit using the frequency dependent portion part. 5. The audio apparatus of claim 1 , wherein the circuit is arranged to determine the total emitted energy indication for a first audio signal of the plurality of audio signals in response to a scaling of the signal level indication for the first audio signal, wherein the scaling is by a value determined by integrating a directivity pattern of the sound source represented by the first audio signal, wherein the directivity pattern is determined based on directivity data. 6. The audio apparatus of claim 1 , wherein the signal level indication for a first audio signal of the plurality of audio signals comprises a reference distance, wherein the reference distance indicates a distance from the audio source represented by the first audio signal for a distance reference gain for the first audio signal. 7. The audio apparatus of claim 6 , wherein an integration is performed for a the reference distance from the audio source represented by the first audio signal. 8. The audio apparatus of claim 1 , wherein the diffuse reverberation signal to total signal relationship is indicative of an energy of diffuse reverberation sound relative to an energy of total emitted sound in the environment. 9. The audio apparatus of claim 1 , wherein the diffuse reverberation signal to total signal relationship is indicative of an initial amplitude of diffuse sound relative to an energy of total emitted sound in the environment. 10. The audio apparatus of claim 1 , wherein the downmix coefficient determined for a first audio signal of the plurality of audio signals is independent on a position of a first audio source represented by the first audio signal. 11. The audio apparatus of claim 1 , wherein the downmix coefficient determined for a first audio signal of the plurality of audio signals is independent of a position of a listener. 12. The audio apparatus of claim 1 , wherein the signal level indication for a first audio signal of the plurality of audio signals comprises a gain indication for the first audio signal, wherein the gain indication is indicative of a gain to apply to the first audio signal when rendering sound from a first audio source represented by the first audio signal, wherein the circuit is arranged to determine the downmix coefficient for the first audio signal in response to the gain indication. 13. The audio apparatus of claim 1 , further comprising a direct rendering circuit, wherein the direct rendering circuit is arranged to generate a direct path audio signal for a first audio signal of the plurality of audio signals in response to the signal level indication and the directivity data of the first audio signal. 14. The audio apparatus of claim 1 , wherein the metadata further comprises a delay indication and the diffuse reverberation signal to total signal relationship, wherein the diffuse reverberation signal to total signal relationship is indicative of an energy of a delayed diffuse reverberation sound relative to an energy of total emitted sound in the environment, wherein the delayed diffuse reverberation sound is the diffuse reverberation sound after delay, wherein the delay is substantially the same as the delay indication. 15. A method comprising: receiving a plurality of audio signals, wherein the plurality of audio signals represent sound sources in the environment; receiving metadata, wherein the metadata comprises a diffuse reverberation signal to total signal relationship, directivity data and a signal level indication for each audio signal of the plurality of audio signals, wherein the diffuse reverberation signal to total signal relationship is indicative of a level of diffuse reverberation sound relative to total emitted sound in the environment, wherein the directivity data is indicative of directivity of sound radiation from the sound source represented by each of the plurality of audio signals; determining a total emitted energy indication based on the signal level indication and the directivity data for each of the plurality of audio signals; determining a downmix coefficient based on the total emitted energy and the diffuse reverberation signal to total signal relationship for each of the plurality of audio signals; generating a downmix signal by combining signal components of each of the plurality of audio signals generated by applying the corresponding downmix coefficient to each of the plurality of audio signals; and generating the diffuse reverberation signal for the environment from the downmix signal. 16. A computer program product stored on a non-transitory medium, wherein the computer program when executed on a processor performs the method as claimed in claim 15 . 17. The method of claim 15 , further comprising determining a frequency dependent total emitted energy and frequency dependent downmix coefficients, wherein the directivity of sound radiation is frequency dependent. 18. The method of claim 15 , further comprising determining frequency dependent downmix coefficients, wherein the diffuse reverberation signal to total signal relationship i
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
in which the audio signals are in digital form (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title
Electronic adaptation of stereophonic sound system to listener position or orientation (H04S7/301 takes precedence) · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.