Audio encoder and decoder with program information or substream structure metadata

US10147436B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10147436-B2
Application numberUS-201615187310-A
CountryUS
Kind codeB2
Filing dateJun 20, 2016
Priority dateJun 19, 2013
Publication dateDec 4, 2018
Grant dateDec 4, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Apparatus and methods for generating an encoded audio bitstream, including by including substream structure metadata (SSM) and/or program information metadata (PIM) and audio data in the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, and an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.

First claim

Opening claim text (preview).

What is claimed is: 1. An audio processing unit, comprising: a buffer memory; and at least one processing subsystem coupled to the buffer memory, wherein the buffer memory stores at least one frame of an encoded audio bitstream, wherein the encoded audio bitstream is indicative of at least one audio program having at least one independent substream of audio data, said frame including program information metadata in at least one metadata segment of at least one reserved field of the frame and substream structure metadata in said at least one metadata segment of at least one reserved field of the frame, and the audio data in at least one other segment of the frame, wherein the metadata segment includes a metadata segment header; after the metadata segment header, at least one protection value useful for at least one of decryption, authentication, or validation of at least one of the program information metadata or the substream structure metadata or the audio data corresponding to said program information metadata or said substream structure metadata; after the metadata segment header, metadata payload identification and payload configuration values identifying a type and at least one aspect of a configuration of each following metadata payload; after the metadata payload identification and payload configuration values, at least one metadata payload, said metadata payload comprising: a header; and after the header, at least some of the program information metadata or at least some of the substream structure metadata, wherein the program information metadata includes active channel metadata indicative of each non-silent channel and each silent channel and the substream structure metadata includes independent substream metadata indicative of number of independent substreams of the audio program, and dependent substream metadata indicative of whether each independent substream of the audio program has at least one associated dependent sub stream, and wherein the reserved field is selected from the group consisting of a skip field, an additional bitstream information (“addbsi”) field, and an auxiliary data (“auxdata”) field, and wherein the processing subsystem is coupled and configured to: extract program information metadata and substream structure metadata from the metadata payload; responsive to extraction of the program information metadata: extract, from the program information metadata, the active channel metadata indicative of each non-silent channel and each silent channel of the program; decode the audio data based on the active channel metadata; output the decoded audio data to one or more speaker or object channels; responsive to extraction of the substream structure metadata: extract, from the substream structure metadata, the independent substream metadata and the dependent substream metadata; decode the audio data based on the independent substream metadata and the dependent substream metadata; and output the decoded audio data to one or more speaker channels or object channels. 2. The audio processing unit of claim 1 , wherein the program information metadata also includes at least one of: downmix processing state metadata indicative of whether the program was downmixed, and if so, a type of downmixing that was applied to the program; upmix processing state metadata indicative of whether the program was upmixed, and if so, a type of upmixing that was applied to the program; preprocessing state metadata indicative of whether preprocessing was performed on the audio data of the frame, and if so, a type of preprocessing that was performed on said audio data; or spectral extension processing or channel coupling metadata indicative of whether spectral extension processing or channel coupling was applied to the program, and if so, a frequency range that the spectral extension or channel coupling was applied. 3. The audio processing unit of claim 1 , wherein the metadata segment header includes a syncword identifying the start of the metadata segment, and at least one identification value following the syncword, and the header of the metadata payload includes at least one identification value. 4. The audio processing unit of claim 1 , wherein the encoded audio bitstream is an AC-3 bitstream or an E-AC-3 bitstream. 5. The audio processing unit of claim 1 , wherein the buffer memory stores the frame in a non-transitory manner. 6. The audio processing unit of claim 1 , wherein the audio processing unit is an encoder. 7. The audio processing unit of claim 6 , wherein said processing subsystem includes: a decoding subsystem configured to receive an input audio bitstream and to extract input metadata and input audio data from the input audio bitstream; an adaptive processing subsystem coupled and configured to perform adaptive processing on the input audio data using the input metadata, thereby generating processed audio data; and an encoding subsystem coupled and configured to generate the encoded audio bitstream in response to the processed audio data, including by including the program information metadata or the substream structure metadata in said encoded audio bitstream, and to assert the encoded audio bitstream to the buffer memory. 8. The audio processing unit of claim 1 , wherein the audio processing unit is a decoder. 9. The audio processing unit of claim 8 , wherein the processing subsystem is a decoding subsystem coupled to the buffer memory and configured to extract the program information metadata or the substream structure metadata from the encoded audio bitstream. 10. The audio processing unit of claim 1 , including: a subsystem coupled to the buffer memory and configured to extract the program information metadata or the substream structure metadata from the encoded audio bitstream and to extract the audio data from the encoded audio bitstream; and a post-processor, coupled to the subsystem and configured to perform adaptive processing on the audio data using at least one of the program information metadata or the substream structure metadata extracted from the encoded audio bitstream. 11. The audio processing unit of claim 1 , wherein said audio processing unit is a digital signal processor. 12. The audio processing unit of claim 1 , wherein the audio processing unit is a pre-processor configured to extract the program information metadata or the substream structure metadata and the audio data from the encoded audio bitstream, and to perform adaptive processing on the audio data using at least one of the program information metadata or the substream structure metadata extracted from the encoded audio bitstream. 13. A method for decoding an encoded audio bitstream, comprising: receiving an encoded audio bitstream that includes metadata and audio data; and extracting said metadata and said audio data, from the encoded audio bitstream, wherein the metadata includes program information metadata and substream structure metadata, wherein the encoded audio bitstream comprises a sequence of frames and is indicative of at least one audio program having at least one independent sub stream of audio data, the program information metadata is indicative of at least one type or parameter of processing performed prior to encoding the audio data of the audio program, and each channel of the audio program which is an active channel, and the substream structure metadata is indicative of whether each independent substream of the audio program has at least one associated dependent sub stream, each of the frames includes at least one audio data segment, each said audio data segment includes at least some of the audio data, each

Assignees

Inventors

Classifications

  • by changing the amplitude · CPC title

  • Pre-filtering or post-filtering · CPC title

  • Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10147436B2 cover?
Apparatus and methods for generating an encoded audio bitstream, including by including substream structure metadata (SSM) and/or program information metadata (PIM) and audio data in the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, and an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodim…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).