System and method for optimizing loudness and dynamic range across different playback devices

US9841941B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9841941-B2
Application numberUS-201414761888-A
CountryUS
Kind codeB2
Filing dateJan 15, 2014
Priority dateJan 21, 2013
Publication dateDec 12, 2017
Grant dateDec 12, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving a bitstream including audio data and metadata associated with the audio data; analyzing the metadata to determine whether said metadata is or includes profile metadata indicative of a target profile, where the profile metadata is useful to perform at least one of loudness control, loudness normalization, or dynamic range control on the audio data in accordance with the target profile, and where the target profile determines a target loudness and/or at least one target dynamic range characteristic of a rendered version of the audio data for playback by an audio playback device of a group of audio playback devices; responsive to determining that the metadata is or includes the profile metadata, using the profile metadata and the audio data to render audio including by performing said at least one of loudness control, loudness normalization, or dynamic range control on the audio data accordance with the target profile; and responsive to determining that the metadata is not and does not include the profile metadata, analyzing one or more characteristics of the group, and generating the profile metadata based on the one or more characteristics. 2. The method of claim 1 wherein the one or more characteristics comprises gain levels for different profiles of the audio data. 3. The method of claim 2 wherein the gain levels define at least one of: an artistic dynamic range profile that controls how the audio data will be played back for a defined program, and a short-term dynamic range profile that provides overload protection for portions of the defined program. 4. The method of claim 1 wherein using the profile metadata and the audio data to render audio comprises transmitting the profile metadata and the audio data to a downstream module that renders the audio for playback. 5. The method of claim 1 wherein using the profile metadata and the audio data to render audio comprises rendering the audio data based on the profile metadata and the audio data. 6. The method of claim 1 further comprising: determining an output device that would render the received audio stream; and determining whether or not the output device belongs to the group of audio playback devices, wherein the step of analyzing the metadata to determine whether said metadata is or includes the profile metadata is executed after the step of determining whether or not the output device belongs to the group of audio playback devices. 7. The method of claim 6 wherein the step of determining that the output device belongs to the group of audio playback devices comprises: receiving an indication from a module connected to the output device indicating an identity of the output device or indicating an identity of a group of devices that include the output device; and determining that the output device belongs to the group of audio playback devices based on the received indication. 8. An apparatus, comprising: an interface configured to receive a bitstream including audio data and metadata associated with the audio data; an analyzer coupled to the interface and configured to analyze the metadata to determine whether said metadata is or includes profile metadata indicative of a target profile, where the profile metadata is useful to perform at least one of loudness control, loudness normalization, or dynamic range control on the audio data in accordance with the target profile, and where the target profile determines a target loudness and/or at least one target dynamic range characteristic of a rendered version of the audio data for playback by an audio playback device of a group of audio playback devices; and a rendering component configured to use the profile metadata and the audio data to render audio, including by performing said at least one of loudness control, loudness normalization, or dynamic range control on the audio data accordance with the target profile, responsive to the analyzer determining that the metadata is or includes the profile metadata, the rendering component further configured to analyze one or more characteristics of the group and determine the profile metadata based on the one or more characteristics responsive to the analyzer determining that the metadata is not and does not include the profile metadata. 9. The apparatus of claim 8 wherein the rendering component using the profile metadata and the audio data to render audio comprises the rendering component transmitting the profile metadata and the audio data to a downstream module that renders the audio for playback. 10. The apparatus of claim 9 wherein the rendering component using the profile metadata and the audio data to render audio comprises the rendering component rendering the audio data based on the profile metadata and the audio data. 11. The apparatus of claim 10 further comprising a second component configured to determine an output device that would render the received audio stream and determine whether or not the output device belongs to the group of audio playback devices; wherein the analyzer is configured to analyze the metadata in stream to determine whether the metadata is or includes the profile metadata after the second component determines whether or not the output device belongs to the group of audio playback devices. 12. The apparatus of claim 11 further comprising an interface configured to receive an indication from a module connected to the output device indicating an identity of the output device or indicating an identity of a group of devices that include the output device, and to determine that the output device belongs to the group of audio playback devices based on the received indication. 13. A method, comprising: receiving audio data and metadata associated with the audio data; analyzing the metadata to determine whether said metadata is or includes loudness data indicative of a target profile, where the loudness data is useful to perform at least one of loudness control, loudness normalization, or dynamic range control on the audio data in accordance with the target profile, and where the target profile determines a target loudness and/or at least one target dynamic range characteristic of a rendered version of the audio data for playback by an audio playback device of a group of audio devices; responsive to determining that the metadata is or includes the loudness data, transmitting the audio data and the loudness data for use in rendering audio; and responsive to determining that the metadata is not and does not include the loudness data, determining the loudness data, and transmitting the determined loudness data for use in rendering audio. 14. The method of claim 13 , wherein the step of determining the loudness data comprises: analyzing characteristics of different profiles of the audio data; determining the loudness data based on the characteristics. 15. The method of claim 14 wherein the characteristics comprise gain levels for the different profiles of the audio data. 16. The method of claim 15 wherein the gain levels define at least one of: an artistic dynamic range profile that controls how the audio data will be played back for a defined program, and a short-term dynamic range profile that provides overload protection for portions of the defined program. 17. The method of claim 13 , wherein the loudness data includes loudness parameters for the target profile. 18. An apparatus for decoding audio data, comprising: an interface configured to receive the audio data and metadata associated with the audio data; and

Assignees

Inventors

Classifications

  • H03G9/005Primary

    of digital or coded signals · CPC title

  • for improving intelligibility · CPC title

  • G06F3/165Primary

    Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • H03G9/00Primary

    Combinations of two or more types of control, e.g. gain control and tone control · CPC title

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9841941B2 cover?
Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to rend…
Who is the assignee on this patent?
Dolby Int Ab, Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification H03G9/005. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Dec 12 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).