Audio encoder and decoder with dynamic range compression metadata

US9959878B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9959878-B2
Application numberUS-201615189710-A
CountryUS
Kind codeB2
Filing dateJun 22, 2016
Priority dateJun 19, 2013
Publication dateMay 1, 2018
Grant dateMay 1, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.

First claim

Opening claim text (preview).

What is claimed is: 1. An audio processing unit, comprising: a buffer memory which is a non-transitory medium, configured to store at least one frame of an encoded audio bitstream, wherein the encoded audio bitstream includes audio data and a metadata container, wherein the metadata container includes a header and one or more metadata payloads after the header, the one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data; a subsystem configured to: determine that the profile metadata indicates that DRC metadata includes DRC control values for use in performing dynamic range compression in accordance with one said compression profile; parse the encoded audio bitstream to extract the DRC control values; and perform dynamic range compression, on at least some of the audio data or on decoded audio data generated by decoding the at least some of the audio data using at least some of the extracted DRC control values on at least some of the audio data or decoded audio data. 2. The audio processing unit of claim 1 , wherein one said compression profile is a profile for dynamic range compression of audio data indicative of speech. 3. The audio processing unit of claim 1 , wherein one said compression profile is a film standard compression profile, a film light compression profile, a music standard compression profile, or a music light compression profile. 4. The audio processing unit of claim 1 , also including: an audio decoder coupled to the buffer memory and configured to decode the audio data thereby generating decoded audio data. 5. The audio processing unit of claim 4 , wherein the subsystem coupled to the parser is also coupled to the audio decoder, and is configured to perform dynamic range compression on at least some of the decoded audio data using at least some of the DRC metadata. 6. An audio decoding method, comprising steps of: receiving an encoded audio bitstream, wherein the encoded audio bitstream is segmented into one or more frames; extracting audio data and a container of metadata from the encoded audio bitstream, wherein the container of metadata includes a header, and one or more metadata payloads after the header, and wherein the one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data, and wherein determining that the profile metadata indicates that DRC metadata includes DRC control values for use in performing dynamic range compression in accordance with one said compression profile; parsing the encoded audio bitstream to extract the DRC control values; and performing dynamic range compression, on at least some of the audio data or on decoded audio data generated by decoding said at least some of the audio data, and using at least some of the extracted DRC control values on at least some of the audio data or decoded audio data. 7. The method of claim 6 , wherein one said compression profile is a profile for dynamic range compression of audio data indicative of speech. 8. The method of claim 6 , wherein one said compression profile is a film standard compression profile, a film light compression profile, a music standard compression profile, or a music light compression profile. 9. The method of claim 6 , wherein the audio data are encoded audio data, and also comprising a step of: decoding the encoded audio data to generate decoded audio data. 10. The method of claim 9 , also comprising: performing dynamic range compression on at least some of the decoded audio data using at least some of the DRC metadata. 11. A non-transitory, computer-readable storage medium having instructions stored thereon that when executed by one or more audio processing units, cause the one or more audio processing units to perform operations comprising: receiving an encoded audio bitstream, wherein the encoded audio bitstream is segmented into one or more frames; extracting audio data and a container of metadata from the encoded audio bitstream, wherein the container of metadata includes a header, and one or more metadata payloads after the header, and wherein the one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data, and wherein if the profile metadata indicates that DRC metadata includes DRC control values for use in performing dynamic range compression in accordance with one said compression profile, the DRC metadata also includes a set of DRC control values generated in accordance with the compression profile; parsing the encoded audio bitstream to extract the DRC control values; and performing dynamic range compression, on at least some of the audio data or on decoded audio data generated by decoding said at least some of the audio data, and using at least some of the extracted DRC control values on at least some of the audio data or decoded audio data. 12. The storage medium of claim 11 , wherein one said compression profile is a profile for dynamic range compression of audio data indicative of speech. 13. The storage medium of claim 11 , wherein one said compression profile is a film standard compression profile, a film light compression profile, a music standard compression profile, or a music light compression profile.

Assignees

Inventors

Classifications

  • by changing the amplitude · CPC title

  • Pre-filtering or post-filtering · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • Vocoder architecture · CPC title

  • Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9959878B2 cover?
An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) me…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 01 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).