Audio encoder and decoder with program information or substream structure metadata

US2016196830A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016196830-A1
Application numberUS-201414770375-A
CountryUS
Kind codeA1
Filing dateJun 12, 2014
Priority dateJun 19, 2013
Publication dateJul 7, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Apparatus and methods for generating an encoded audio bitstream, including by including substream structure metadata (SSM) and/or program information metadata (PIM) and audio data in the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, and an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.

First claim

Opening claim text (preview).

What is claimed is: 1 . An audio processing unit, including: a buffer memory; and at least one processing subsystem coupled to the buffer memory, wherein the buffer memory stores at least one frame of an encoded audio bitstream, said frame including program information metadata or substream structure metadata in at least one metadata segment of at least one skip field of the frame and audio data in at least one other segment of the frame, wherein the processing subsystem is coupled and configured to perform at least one of generation of the bitstream, decoding of the bitstream, or adaptive processing of audio data of the bitstream using metadata of the bitstream, or at least one of authentication or validation of at least one of audio data or metadata of the bitstream using metadata of the bitstream, wherein the metadata segment includes at least one metadata payload, said metadata payload comprising: a header; and after the header, at least some of the program information metadata or at least some of the substream structure metadata. 2 . The audio processing unit of claim 1 , wherein the encoded audio bitstream is indicative of at least one audio program, and the metadata segment includes a program information metadata payload, said program information metadata payload comprising: a program information metadata header; and after the program information metadata header, program information metadata indicative of at least one property or characteristic of audio content of the program, said program information metadata including active channel metadata indicative of each non-silent channel and each silent channel of the program. 3 . The audio processing unit of claim 2 , wherein the program information metadata also includes at least one of: downmix processing state metadata indicative of whether the program was downmixed, and if so, a type of downmixing that was applied to the program; upmix processing state metadata indicative of whether the program was upmixed, and if so, a type of upmixing that was applied to the program; preprocessing state metadata indicative of whether preprocessing was performed on audio content of the frame, and if so, a type of preprocessing that was performed on said audio content; or spectral extension processing or channel coupling metadata indicative of whether spectral extension processing or channel coupling was applied to the program, and if so, a frequency range that the spectral extension or channel coupling was applied. 4 . The audio processing unit of claim 1 , wherein the encoded audio bitstream is indicative of at least one audio program having at least one independent substream of audio content, and the metadata segment includes a substream structure metadata payload, said substream structure metadata payload comprising: a substream structure metadata payload header; and after the substream structure metadata payload header, independent substream metadata indicative of number of independent substreams of the program, and dependent substream metadata indicative of whether each independent substream of the program has at least one associated dependent substream. 5 . The audio processing unit of claim 1 , wherein the metadata segment includes: a metadata segment header; after the metadata segment header, at least one protection value useful for at least one of decryption, authentication, or validation of at least one of the program information metadata or the substream structure metadata or the audio data corresponding to said program information metadata or said substream structure metadata; and after the metadata segment header, metadata payload identification and payload configuration values, wherein the metadata payload follows the metadata payload identification and payload configuration values. 6 . The audio processing unit of claim 5 , wherein the metadata segment header includes a syncword identifying the start of the metadata segment, and at least one identification value following the syncword, and the header of the metadata payload includes at least one identification value. 7 . The audio processing unit of claim 1 , wherein the encoded audio bitstream is an AC-3 bitstream or an E-AC-3 bitstream. 8 . The audio processing unit of claim 1 , wherein the buffer memory stores the frame in a non-transitory manner. 9 . The audio processing unit of claim 1 , wherein the audio processing unit is an encoder. 10 . The the audio processing unit of claim 9 , wherein said processing subsystem includes: a decoding subsystem configured to receive an input audio bitstream and to extract input metadata and input audio data from the input audio bitstream; an adaptive processing subsystem coupled and configured to perform adaptive processing on the input audio data using the input metadata, thereby generating processed audio data; and an encoding subsystem coupled and configured to generate the encoded audio bitstream in response to the processed audio data, including by including the program information metadata or the substream structure metadata in said encoded audio bitstream, and to assert the encoded audio bitstream to the buffer memory. 11 . The audio processing unit of claim 1 , wherein the audio processing unit is a decoder. 12 . The the audio processing unit of claim 11 , wherein the processing subsystem is a decoding subsystem coupled to the buffer memory and configured to extract the program information metadata or the substream structure metadata from the encoded audio bitstream. 13 . The audio processing unit of claim 1 , including: a subsystem coupled to the buffer memory and configured to extract the program information metadata or the substream structure metadata from the encoded audio bitstream and to extract the audio data from the encoded audio bitstream; and a post-processor, coupled to the subsystem and configured to perform adaptive processing on the audio data using at least one of the program information metadata or the substream structure metadata extracted from the encoded audio bitstream. 14 . The audio processing unit of claim 1 , wherein said audio processing unit is a digital signal processor. 15 . The audio processing unit of claim 1 , wherein the audio processing unit is a pre-processor configured to extract the program information metadata or the substream structure metadata and the audio data from the encoded audio bitstream, and to perform adaptive processing on the audio data using at least one of the program information metadata or the substream structure metadata extracted from the encoded audio bitstream. 16 . A method for decoding an encoded audio bitstream, said method including steps of: receiving an encoded audio bitstream; and extracting metadata and audio data from the encoded audio bitstream, wherein the metadata is or includes program information metadata and substream structure metadata, wherein the encoded audio bitstream comprises a sequence of frames and is indicative of at least one audio program, the program information metadata and the substream structure metadata are indicative of the program, each of the frames includes at least one audio data segment, each said audio data segment includes at least some of the audio data, each frame of at least a subset of the frames includes a metadata segment, and each said metadata segment includes at least some of the program information metadata and at least some of the substream structure metadata. 17 . The method of claim 16 , wherein the metadata segment includes a program information metadata

Assignees

Inventors

Classifications

  • by changing the amplitude · CPC title

  • Pre-filtering or post-filtering · CPC title

  • Vocoder architecture · CPC title

  • Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016196830A1 cover?
Apparatus and methods for generating an encoded audio bitstream, including by including substream structure metadata (SSM) and/or program information metadata (PIM) and audio data in the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, and an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodim…
Who is the assignee on this patent?
Dolby Lab Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 07 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).