Who is the assignee on this patent?

Dolby Laboratories Licensing Corp, Dolby Int Ab

What technology area does this patent fall under?

Primary CPC classification H04S1/002. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Apr 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Binaural dialogue enhancement

US11950078B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11950078-B2
Application number	US-202318309099-A
Country	US
Kind code	B2
Filing date	Apr 28, 2023
Priority date	Jan 29, 2016
Publication date	Apr 2, 2024
Grant date	Apr 2, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for dialogue enhancing audio content having one or more audio components, wherein each component is associated with a spatial location, comprising: receiving a first audio signal presentation of the audio components intended for reproduction on a first audio reproduction system; receiving a first set of presentation transform parameters configured to enable transformation of said first audio signal presentation into said second audio signal presentation intended for reproduction on a second audio reproduction system; receiving a second set of presentation transform parameters configured to enable transformation of said first audio signal presentation into an acoustic environment simulation input signal; receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation; applying the first set of presentation transform parameters to the first audio signal presentation to form the second audio signal presentation; applying the second set of presentation transform parameters to the first audio signal presentation to form the acoustic environment simulation input signal; applying an acoustic environment simulation to the acoustic environment simulation input signal to generate an acoustic environment simulation output signal; applying the set of dialogue estimation parameters to the first audio signal presentation to form a dialogue presentation of the dialogue components; and summing the dialogue presentation with the second audio signal presentation and the acoustic environment simulation output signal to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system; wherein the second audio signal presentation is an anechoic binaural audio signal presentation. 2. The method according to claim 1 , wherein the first audio signal presentation is a stereo or surround audio signal presentation. 3. The method according to claim 1 , further comprising receiving a set of dialogue transform parameters and applying the set of dialogue transform parameters before or after application of said set of dialogue estimation parameters to form a transformed dialogue presentation corresponding to the second audio signal presentation. 4. The method according to claim 1 , wherein said dialogue estimation parameters are configured to also perform a presentation transform, so that the dialogue presentation corresponds to the second audio signal presentation. 5. The method according to claim 1 , further comprising applying a level modification by a factor G to the dialogue presentation. 6. The method according to claim 5 , wherein a first processing is applied when G is less than a given threshold, and a second processing is applied when G is greater than said threshold. 7. The method according to claim 6 , wherein the threshold is equal to zero, wherein G<0 represents attenuation of dialogue and G>0 represents enhancement of dialogue. 8. The method according to claim 1 , wherein the dialogue presentation is a mono presentation, and further comprising: receiving positional data related to said dialogue components, rendering the mono dialogue presentation using the positional data before combining with the second audio signal presentation. 9. The method according to claim 8 , wherein the rendering includes either: selecting head related transfer functions (HRTFs) from a library based on the positional data, and applying the selected HRTFs to the mono dialogue presentation; or amplitude panning. 10. A decoder for dialogue enhancing audio content having one or more audio components, wherein each component is associated with a spatial location, comprising: a core decoder for receiving a first audio signal presentation of the audio components intended for reproduction on a first audio reproduction system, a first set of presentation transform parameters configured to enable transformation of said first audio signal presentation into a second audio signal presentation intended for reproduction on a second audio reproduction system, a second set of presentation transform parameters configured to enable transformation of said first audio signal presentation into an acoustic environment simulation input signal, and a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation; a first transform unit configured to apply the first set of presentation transform parameters to the first audio signal presentation to form a second audio signal presentation intended for reproduction on a second audio reproduction system; a second transform unit configured to apply the second set of presentation transform parameters to the first audio signal presentation to form the acoustic environment simulation input signal; an acoustic environment simulation unit configured to apply an acoustic environment simulation to the acoustic environment simulation input signal to generate an acoustic environment simulation output signal; a dialogue estimator for applying the set of dialogue estimation parameters to the first audio signal presentation to form a dialogue presentation of the dialogue components; and a summation block for summing the dialogue presentation with the second audio signal presentation and the acoustic environment simulation output signal to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system; wherein the second audio signal presentation is an anechoic binaural audio signal presentation. 11. A non-transitory computer-readable storage medium comprising a set of instructions, wherein, when executed by one or more processors of an audio signal processing device, the instructions cause the one or more processors to perform the method of claim 1 .

Assignees

Inventors

Classifications

H04S1/002Primary
Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution (control circuits for electronic adaptation of the sound field H04S7/30) · CPC title
H04R5/04
Circuit arrangements, {e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments (combinations of amplifiers H03F3/68; stereophonic systems H04S)} · CPC title
H04S3/00Primary
Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title
H04S7/303
Tracking of listener position or orientation · CPC title
H04S3/008
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

Patent family

Related publications grouped by family.

View patent family 55272356

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11950078B2 cover?: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal…
Who is the assignee on this patent?: Dolby Laboratories Licensing Corp, Dolby Int Ab
What technology area does this patent fall under?: Primary CPC classification H04S1/002. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Apr 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Audio Decoder and Decoding Method

Decoding method and decoder for dialog enhancement

Hybrid waveform-coded and parametric-coded speech enhancement

Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems

Frequently asked questions