Methods for parametric multi-channel encoding

US10360919B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10360919-B2
Application numberUS-201715646482-A
CountryUS
Kind codeB2
Filing dateJul 11, 2017
Priority dateFeb 21, 2013
Publication dateJul 23, 2019
Grant dateJul 23, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal. In addition, the system comprises a configuration unit configured to determine one or more control settings for the parameter processing unit based on one or more external settings; wherein the one or more external settings comprise a target data-rate for the bitstream and wherein the one or more control settings comprise a maximum data-rate for the spatial metadata.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method comprising: obtaining, by an audio decoder of a playback equipment, an encoded bitstream generated by an audio encoding system; extracting, from the encoded bitstream by the audio decoder, an audio signal; extracting, from the encoded bitstream by the audio decoder, a first set of dynamic range control (DRC) values configured for controlling a dynamic range of the audio signal during playback by the playback equipment, wherein the first set of DRC values are generated and encoded into the encoded bitstream by the audio encoding system; extracting, from the encoded bitstream, a second set of DRC values configured for preventing the audio signal from clipping during playback by the playback equipment, wherein the second set of DRC values are generated and encoded into the encoded bitstream by the audio encoding system, wherein each DRC value in the second set of DRC values represents a clipping protection gain indicating an attenuation to be applied to a corresponding frame of the audio signal to prevent clipping; extracting, from the encoded bitstream, first metadata indicating how to apply the first and second sets of DRC values to the audio signal; and applying the first and second sets of DRC values to the audio signal during playback by the playback equipment according to the first metadata; rendering the audio signal with the playback equipment. 2. The method of claim 1 , wherein the audio signal is a m-channel downmix audio signal, the method further comprises: applying the second set of DRC values to the m-channel downmix audio signal; extracting, from the encoded bitstream, spatial metadata; and upmixing the m-channel downmix audio signal into an n-channel audio signal using the spatial metadata, where m and n are positive integers and m is less than n. 3. The method of claim 1 , wherein the first set of DRC values are configured to dynamically compress the audio signal. 4. An apparatus comprising: one or more processors; memory storing instructions, which, when executed by the one or more processors, causes the one or more processors to perform operations comprising: obtaining, by an audio decoder of a playback equipment, an encoded bitstream generated by an audio encoding system; extracting, from the encoded bitstream by the audio decoder, an audio signal; extracting, from the encoded bitstream by the audio decoder, a first set of dynamic range control (DRC) values configured for controlling a dynamic range of the audio signal during playback by the playback equipment, wherein the first set of DRC values are generated and encoded into the encoded bitstream by the audio encoding system; extracting, from the encoded bitstream, a second set of DRC values configured for preventing the audio signal from clipping during playback by the playback equipment, wherein the second set of DRC values are generated and encoded into the encoded bitstream by the audio encoding system, wherein each DRC value in the second set of DRC values represents a clipping protection gain indicating an attenuation to be applied to a corresponding frame of the audio signal to prevent clipping; extracting, from the encoded bitstream, first metadata indicating how to apply the first and second sets of DRC values to the audio signal; and applying the first and second sets of DRC values to the audio signal during playback by the playback equipment according to the first metadata; rendering the audio signal with the playback equipment. 5. The apparatus of claim 4 , wherein the audio signal is a m-channel downmix audio signal, the operations further comprising: applying the second set of DRC values to the m-channel downmix audio signal; extracting, from the encoded bitstream, spatial metadata; and upmixing the m-channel downmix audio signal into an n-channel audio signal using the spatial metadata, where m and n are positive integers and m is less than n. 6. The apparatus of claim 4 , wherein the first set of DRC values are configured to dynamically compress the audio signal.

Assignees

Inventors

Classifications

  • Application of parametric coding in stereophonic audio systems · CPC title

  • Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10360919B2 cover?
The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmi…
Who is the assignee on this patent?
Dolby Int Ab
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 23 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).