Multi-object audio encoding and decoding apparatus supporting post down-mix signal

US9685167B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9685167-B2
Application numberUS-200913054662-A
CountryUS
Kind codeB2
Filing dateJul 16, 2009
Priority dateJul 16, 2008
Publication dateJun 20, 2017
Grant dateJun 20, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A multi-object audio encoding and decoding apparatus supporting a post downmix signal may be provided. The multi-object audio encoding apparatus may include: an object information extraction and downmix generation unit to generate object information and a downmix signal from input object signals; a parameter determination unit to determine a downmix information parameter using the extracted downmix signal and the post downmix signal; and a bitstream generation unit to combine the object information and the downmix information parameter, and to generate an object bitstream.

First claim

Opening claim text (preview).

The invention claimed is: 1. A multi-object audio encoding apparatus, comprising: at least one hardware processor to: generate object information using input object signals and extract a downmix signal from the input object signals; determine a Post Downmix Gain (PDG) to compensate for a difference between the extracted downmix signal and a post downmix signal supplied from a source that is external to the multi-object audio encoding apparatus, the PDG being useable to adjust for the post downmix signal according to a relationship between the extracted downmix signal and the post downmix signal; and generate an object bitstream including the PDG and the object information, wherein the difference between the downmix signal and the post downmix signal is compensated by applying a mixing matrix including the PDG included with the object bitstream generated at the multi-object audio encoding apparatus, wherein the mixing matrix is determined based on either mono downmix or stereo downmix. 2. The multi-object audio encoding apparatus of claim 1 , wherein the object information comprises spatial cue parameters predicted from the input object signals. 3. The multi-object audio encoding apparatus of claim 1 , wherein the at least one processor is configured operate as: a power offset calculator that scales the post downmix signal as a predetermined value to enable an average power of the post downmix signal in a particular frame to be identical to an average power of the downmix signal; and a parameter extractor that extracts the PDG from the scaled post downmix signal in a predetermined frame. 4. The multi-object audio encoding apparatus of claim 1 , wherein the at least one processor calculates a Downmix Channel Level Difference (DCLD) and a Downmix Gain (DMG) indicating a mixing amount of the input object signals. 5. The multi-object audio encoding apparatus of claim 1 , wherein the at least one processor generates a residual signal corresponding to the difference between the downmix signal and the post downmix signal, and transmits the object bitstream including the residual signal, the difference between the downmix signal and the post downmix signal being compensated for by applying the PDG. 6. The multi-object audio encoding apparatus of claim 5 , wherein the residual signal is generated with respect to a frequency band that affects a sound quality of the input object signals, and transmitted through the object bitstream. 7. A multi-object audio decoding apparatus which decodes a multi-object audio, comprising: at least one hardware processor to: extracting a Post Downmix Gain (PDG) and object information from an object bitstream; decoding a downmix signal using the object information and generates an object signal; and compensating a difference between the downmix signal and a post downmix signal supplied from a source that is external to the multi-object audio decoding apparatus, based on the PDG, the PDG being useable to adjust for the post downmix signal according to a relationship between the decoded downmix signal and the post downmix signal, wherein the difference between the downmix signal and the post downmix signal is compensated by applying a mixing matrix including PDG transmitted to include the object bitstream to the multi-object audio decoding apparatus, wherein the mixing matrix is determined based on either mono downmix or stereo downmix. 8. The multi-object audio decoding apparatus of claim 7 , wherein the object information comprises spatial cue parameters predicted from input object signals. 9. The multi-object audio decoding apparatus of claim 8 , user control information is applied to the object signal generated from the decoding to generate a reproducible output signal. 10. The multi-object audio decoding apparatus of claim 8 , wherein the at least one processor is configured operate as: a power offset compensator that scales the post downmix signal using a power offset value extracted from the PDG as a downmix information parameter; and a downmix signal adjustor that converts the scaled post downmix signal into the downmix signal using the PDG. 11. The multi-object audio decoding apparatus of claim 10 , wherein a residual signal is referenced to the post downmix signal, which is compensated for by using the PDG, and the post downmix signal is adjusted to be similar to the downmix signal, and the residual signal is the difference between the downmix signal and the post downmix signal, the difference between the downmix signal and the post downmix signal being compensated for by applying the PDG.

Assignees

Inventors

Classifications

  • Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error (G10L19/24 takes precedence) · CPC title

  • G10L19/20Primary

    using sound class specific coding, hybrid encoders or object based coding · CPC title

  • Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Scalar quantisation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9685167B2 cover?
A multi-object audio encoding and decoding apparatus supporting a post downmix signal may be provided. The multi-object audio encoding apparatus may include: an object information extraction and downmix generation unit to generate object information and a downmix signal from input object signals; a parameter determination unit to determine a downmix information parameter using the extracted dow…
Who is the assignee on this patent?
Seo Jeongil, Beack Seungkwon, Kang Kyeongok, and 6 more
What technology area does this patent fall under?
Primary CPC classification G10L19/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 20 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).