Reconstruction of Audio Scenes from a Downmix
US-2016111099-A1 · Apr 21, 2016 · US
US10575111B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10575111-B2 |
| Application number | US-201916354890-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 15, 2019 |
| Priority date | Sep 5, 2013 |
| Publication date | Feb 25, 2020 |
| Grant date | Feb 25, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
Opening claim text (preview).
What is claimed is: 1. An audio decoding method performed by a processor, comprising: decoding an encoded intermediate channel signal included in a bitstream, and an object sound or a background sound to be used for unmixing of the decoded intermediate channel signal; decoding matrix information used for the unmixing the decoded intermediate channel signal; unmixing the decoded intermediate channel signal using the matrix information and outputs the object sound and the background sound; and decoding metadata including control information of the object sound and outputs the decoded metadata, wherein a number of channels of the intermediate channel signal has the same number of channels as a number of channels of the background sound, wherein the encoded intermediate channel signal is obtained by encoding the intermediate channel signal using an encoder, wherein a layout of a speaker system is rendered using the metadata based on audio reproduction environments. 2. The method of claim 1 , wherein the object sound is a controllable audio and a dynamic audio scene associated with the background sound is formed based on the object sound. 3. The method of claim 1 , wherein the intermediate channel signal is determined based on a channel gain of the background sound, and a gain of the object sound mixed with the background sound. 4. The method of claim 1 , wherein the intermediate channel is unmixed by using the object sound to output the background sound and the object sound or wherein the intermediate channel is unmixed by using the background sound to output the object sound and the background sound. 5. The method of claim 1 , further comprising: determining metadata to be used for rendering based on audio reproduction environment information; and rendering the background sound and the object sound based on the metadata. 6. An audio decoding method performed by a processor, comprising: decoding an encoded intermediate channel signal related to a layout of a speaker system, and a metadata, extracting a background sound, an object sound from the decoded intermediate channel signal, rendering the object sound and the background sound based on the metadata, wherein a number of channels of the intermediate channel signal has the same number of channels as a number of channels of the background sound, and wherein the encoded intermediate channel signal is obtained by encoding the intermediate channel signal using an encoder. 7. The method of claim 6 , wherein a layout of a speaker system is rendered using the metadata based on audio reproduction environments. 8. The method of claim 6 , wherein the object sound is a controllable audio and a dynamic audio scene associated with the background sound is formed based on the object sound. 9. The method of claim 6 , wherein the encoded intermediate channel signal is determined based on a channel gain of the background sound, and a gain of the object sound mixed with the background sound. 10. The method of claim 6 , wherein a target channel signal is outputted for expressing an audio scene by rendering the object sound and the background sound.
Application of parametric coding in stereophonic audio systems · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis (in musical instruments G10H) · CPC title
Digital recording or reproducing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.