Method and apparatus for generating side information bitstream of multi-object audio signal

US9299352B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9299352-B2
Application numberUS-93301909-A
CountryUS
Kind codeB2
Filing dateMar 30, 2009
Priority dateMar 31, 2008
Publication dateMar 29, 2016
Grant dateMar 29, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for generating a side information bitstream of a multi-object audio signal, comprising: a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal; a preset information input unit configured to receive preset information for the multi-object audio signal; and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 2. The apparatus of claim 1 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 3. The apparatus of claim 1 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 4. An apparatus for analyzing a side information bitstream of a multi-object audio signal, comprising: a side information bitstream input unit configured to receive the side information bitstream; a spatial cue information extractor configured to extract spatial cue information based on the side information bitstream; and a preset information extractor configured to extract preset information from a frame region of the side information bitstream, wherein the side information bitstream includes the frame region, wherein the preset information includes: (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 5. The apparatus of claim 4 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 6. The apparatus of claim 4 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 7. An apparatus for encoding a multi-object audio signal, comprising: an encoder configured to down-mix an audio signal formed of a plurality of objects and generate spatial cue information for the audio signal formed of the plurality of objects; and a side information bitstream generator configured to generate a side information bitstream based on preset information for the spatial cue information and the audio signal, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 8. An apparatus for decoding a multi-object audio signal, comprising: aside information bitstream analyzer configured to receive a side information bitstream and extract spatial cue information and preset information included in a frame region of the side information bitstream, wherein the side information bitstream includes the frame region; a decoder configured to restore an audio signal formed of a plurality of audio objects based on the spatial cue information from an input down-mixed audio signal; and a renderer configured to render an audio signal formed of the plurality of objects into an audio signal formed of a plurality of channels based on the preset information, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 9. A method for generating a side information bitstream of a multi-object audio signal, comprising: receiving spatial cue information generated in an encoder of the multi-object audio signal; receiving preset information of the multi-object audio signal; and generating the side information bitstream based on the spatial cue information and the preset information, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 10. The method of claim 9 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 11. The method of claim 9 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 12. A method for analyzing a side information bitstream of a multi-object audio signal, comprising: receiving the side information bitstream; and extracting preset information from a frame region of the side information bitstream, wherein the side information bitstream includes the frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 13. The method of claim 12 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 14. The method of claim 12 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 15. A method for encoding a multi-object audio signal, comprising: down-mixing an audio signal formed of a plurality of objects and generating spatial cue information for the audio signal formed of a plurality of objects; and generating a side information bitstream based on preset information for the spatial cue information and the audio signal, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

  • Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation · CPC title

  • H04S7/308Primary

    Electronic adaptation dependent on speaker or headphone connection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9299352B2 cover?
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to r…
Who is the assignee on this patent?
Seo Jeong-Il, Beack Seung-Kwon, Lee Tae-Jin, and 7 more
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).