Advanced stereo coding based on a combination of adaptively selectable left/right or mid/side stereo coding and of parametric stereo coding

US11017785B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11017785-B2
Application numberUS-201916369728-A
CountryUS
Kind codeB2
Filing dateMar 29, 2019
Priority dateMar 17, 2009
Publication dateMay 25, 2021
Grant dateMay 25, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The application relates to audio encoder and decoder systems. An embodiment of the encoder system comprises a downmix stage for generating a downmix signal and a residual signal based on a stereo signal. In addition, the encoder system comprises a parameter determining stage for determining parametric stereo parameters such as an inter-channel intensity difference and an inter-channel cross-correlation. Preferably, the parametric stereo parameters are time- and frequency-variant. Moreover, the encoder system comprises a transform stage. The transform stage generates a pseudo left/right stereo signal by performing a transform based on the downmix signal and the residual signal. The pseudo stereo signal is processed by a perceptual stereo encoder. For stereo encoding, left/right encoding or mid/side encoding is selectable. Preferably, the selection between left/right stereo encoding and mid/side stereo encoding is time- and frequency-variant.

First claim

Opening claim text (preview).

The invention claimed is: 1. An audio signal processing device for encoding a stereo signal to a bitstream signal, the audio signal processing device comprising one or more components that: generate an intermediate stereo signal and stereo SBR parameters in response to the stereo signal; generate a downmix signal, a residual signal, and one or more parametric stereo parameters based on the intermediate stereo signal, wherein the residual signal indicates an error associated with representing the intermediate signal by the downmix signal and the one or more parametric stereo parameters; generate, in a frequency-variant or frequency-invariant manner, a first signal and a second signal based on either: a sum of the downmix signal and the residual signal and a difference of the downmix signal and the residual signal; or the downmix signal and the residual signal; generating an encoded stereo signal by perceptual encoding the first signal and the second signal; and generating the bitstream signal by combining the stereo SBR parameters, the parametric stereo parameters, and the encoded stereo signal. 2. The audio signal processing device of claim 1 , wherein perceptual encoding comprises: generating, in a frequency-variant or frequency-invariant manner, the encoded stereo signal by performing either: left/right perceptual encoding of the first signal and the second signal; or mid/side perceptual encoding of the first signal and the second signal. 3. The audio signal processing device of claim 2 , wherein perceptual encoding comprises selecting, in a frequency-variant or frequency-invariant manner and based on the first signal and the second signal, between either: left/right perceptual encoding of the first signal and the second signal; or mid/side perceptual encoding of the first signal and the second signal. 4. The audio signal processing device of claim 2 , wherein left/right perceptual encoding of the first signal and the second signal is performed for some frequency bands, and mid/side perceptual encoding of the first signal and the second signal is performed for other frequency bands. 5. A audio signal processing device for decoding a bitstream signal including stereo SBR parameters and one or more parametric stereo parameters to a stereo signal, the audio signal processing device comprising one or more components that: generate a first signal and a second signal by perceptual decoding the bitstream signal; generate, in a frequency-variant or frequency-invariant manner, a downmix signal and a residual signal based on either: a sum of the first signal and of the second signal and a difference of the first signal and of the second signal; or the first signal and the second signal; generate an intermediate stereo signal by performing an upmix operation in response to the downmix signal, the residual signal, and the parametric stereo parameters, wherein the residual signal indicates an error associated with representing the first signal and the second signal by the downmix signal and the parametric stereo parameters; and generate the stereo signal by performing a stereo SBR decoding operation in response to the intermediate stereo signal and the stereo SBR parameters. 6. The audio signal processing device of claim 5 , wherein perceptual decoding the bitstream signal comprises: generating, in a frequency-variant or frequency-invariant manner, the first signal and the second signal by performing either: left/right perceptual decoding of the bitstream signal; or mid/side perceptual decoding of the bitstream signal. 7. The audio signal processing device of claim 6 , wherein left/right perceptual decoding of the bitstream signal is performed for some frequency bands, and mid/side perceptual decoding of the bitstream signal is performed for other frequency bands. 8. The audio signal processing device of claim 5 , wherein the parametric stereo parameters comprise: a frequency-variant or a frequency-invariant parameter indicating an inter-channel intensity difference; and a frequency-variant or a frequency-invariant parameter indicating an inter-channel cross-correlation. 9. A method, performed by an audio signal processing device, for decoding a bitstream signal including stereo SBR parameters and one or more parametric stereo parameters to a stereo signal, the method comprising: generating a first signal and a second signal by perceptual decoding the bitstream signal; generating, in a frequency-variant or frequency-invariant manner, a downmix signal and a residual signal based on either: a sum of the first signal and of the second signal and based on a difference of the first signal and of the second signal; or the first signal and the second signal; generating an intermediate stereo signal by performing an upmix operation in response to the downmix signal, the residual signal, and the parametric stereo parameters, wherein the residual signal indicates an error associated with representing the first signal and the second signal by the downmix signal and the parametric stereo parameters; and generating the stereo signal by performing a stereo SBR decoding operation in response to the intermediate stereo signal and the stereo SBR parameters; wherein the method is performed, at least in part, by one or more components of the audio signal processing device. 10. A non-transitory computer readable storage medium comprising a sequence of instructions, wherein, when executed by an audio signal processing device, the sequence of instructions causes the audio signal processing device to perform the method of claim 9 .

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other · CPC title

  • Vocoders using multiple modes · CPC title

  • G10L19/002Primary

    Dynamic bit allocation (for perceptual audio coders G10L19/032) · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11017785B2 cover?
The application relates to audio encoder and decoder systems. An embodiment of the encoder system comprises a downmix stage for generating a downmix signal and a residual signal based on a stereo signal. In addition, the encoder system comprises a parameter determining stage for determining parametric stereo parameters such as an inter-channel intensity difference and an inter-channel cross-cor…
Who is the assignee on this patent?
Dolby Int Ab
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 25 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).