Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US-2024098445-A1 · Mar 21, 2024 · US
US9299352B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9299352-B2 |
| Application number | US-93301909-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 30, 2009 |
| Priority date | Mar 31, 2008 |
| Publication date | Mar 29, 2016 |
| Grant date | Mar 29, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.
Opening claim text (preview).
What is claimed is: 1. An apparatus for generating a side information bitstream of a multi-object audio signal, comprising: a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal; a preset information input unit configured to receive preset information for the multi-object audio signal; and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 2. The apparatus of claim 1 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 3. The apparatus of claim 1 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 4. An apparatus for analyzing a side information bitstream of a multi-object audio signal, comprising: a side information bitstream input unit configured to receive the side information bitstream; a spatial cue information extractor configured to extract spatial cue information based on the side information bitstream; and a preset information extractor configured to extract preset information from a frame region of the side information bitstream, wherein the side information bitstream includes the frame region, wherein the preset information includes: (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 5. The apparatus of claim 4 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 6. The apparatus of claim 4 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 7. An apparatus for encoding a multi-object audio signal, comprising: an encoder configured to down-mix an audio signal formed of a plurality of objects and generate spatial cue information for the audio signal formed of the plurality of objects; and a side information bitstream generator configured to generate a side information bitstream based on preset information for the spatial cue information and the audio signal, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 8. An apparatus for decoding a multi-object audio signal, comprising: aside information bitstream analyzer configured to receive a side information bitstream and extract spatial cue information and preset information included in a frame region of the side information bitstream, wherein the side information bitstream includes the frame region; a decoder configured to restore an audio signal formed of a plurality of audio objects based on the spatial cue information from an input down-mixed audio signal; and a renderer configured to render an audio signal formed of the plurality of objects into an audio signal formed of a plurality of channels based on the preset information, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 9. A method for generating a side information bitstream of a multi-object audio signal, comprising: receiving spatial cue information generated in an encoder of the multi-object audio signal; receiving preset information of the multi-object audio signal; and generating the side information bitstream based on the spatial cue information and the preset information, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 10. The method of claim 9 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 11. The method of claim 9 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 12. A method for analyzing a side information bitstream of a multi-object audio signal, comprising: receiving the side information bitstream; and extracting preset information from a frame region of the side information bitstream, wherein the side information bitstream includes the frame region, wherein the frame region includes the preset information for rendering a multi-object audio signal corresponding to a frame, wherein the preset information includes (i) a layout of a playback system for a mono system, a stereo system and multi-channel system, (ii) an audio object ID, (iii) object location, (iv) object level and (v) an azimuth degree and an elevation degree of the object, wherein the preset information is used to define audio scene for rendering a multi-object audio signal. 13. The method of claim 12 , wherein the frame region includes one or more frames and at least one of the frames includes one or more preset information. 14. The method of claim 12 , wherein at least one of the preset information is used to render a multi-object audio signal corresponding to the frame region. 15. A method for encoding a multi-object audio signal, comprising: down-mixing an audio signal formed of a plurality of objects and generating spatial cue information for the audio signal formed of a plurality of objects; and generating a side information bitstream based on preset information for the spatial cue information and the audio signal, wherein the side information bitstream includes a frame region, wherein the frame region includes the preset information for rendering a multi-object audio
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title
Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title
Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation · CPC title
Electronic adaptation dependent on speaker or headphone connection · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.