Compression of decomposed representations of a sound field
US-9502044-B2 · Nov 22, 2016 · US
US10075802B1 · US · B1
| Field | Value |
|---|---|
| Publication number | US-10075802-B1 |
| Application number | US-201715671988-A |
| Country | US |
| Kind code | B1 |
| Filing date | Aug 8, 2017 |
| Priority date | Aug 8, 2017 |
| Publication date | Sep 11, 2018 |
| Grant date | Sep 11, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In general, techniques are described by which to perform bitrate allocation with respect to higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may be configured to perform various aspects of the bitrate allocation techniques. The memory may be configured to store a spatially compressed version of the HOA audio data. The processor may be coupled to the memory, and configured to perform bitrate allocation, based on an analysis of transport channels representative of the spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels. The processor may also be configured to generate a bitstream that specifies each of the transport channels using the respective allocated number of bits.
Opening claim text (preview).
The invention claimed is: 1. A device configured to compress higher-order ambisonic (HOA) audio data representative of a soundfield, the device comprising: a memory configured to store a spatially compressed version of the HOA audio data; and one or more processors coupled to the memory, and configured to: perform bitrate allocation, based on an analysis of transport channels representative of the spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and generate a bitstream that specifies each of the transport channels using the respective allocated number of bits. 2. The device of claim 1 , wherein the one or more processors are further configured to: render the transport channels from a spherical harmonic domain to spatial domain channels; and perform the analysis with respect to the spatial domain channels. 3. The device of claim 1 , wherein the one or more processors are further configured to: render the transport channels from a spherical harmonic domain to uniformly distributed spatial domain channels; and perform the analysis with respect to the uniformly distributed spatial domain channels. 4. The device of claim 1 , wherein the analysis comprises an energy-based analysis of the transport channels. 5. The device of claim 1 , wherein the analysis comprises a perceptual-based analysis of the transport channels. 6. The device of claim 1 , wherein the analysis comprises a directional-based weighting analysis of the transport channels. 7. The device of claim 1 , wherein the analysis comprises a directional-based weighting analysis and a perceptual-based analysis of the transport channels. 8. The device of claim 1 , wherein the one or more processors are further configured to perform the inverse gain control with respect to the transport channels to remove gain normalization applied to the transport channels prior to performing the analysis of the transport channels. 9. The device of claim 1 , further comprising a microphone coupled to the one or more processors, and configured to capture signals representative of the HOA audio data. 10. The device of claim 9 , wherein the one or more processors are further configured to perform spatial compression with respect to the HOA audio data to generate the spatially compressed version of the HOA audio data. 11. The device of claim 9 , wherein the one or more processors are configured to perform a linear invertible decomposition with respect to the HOA audio data so as to generate the spatially compressed version of the HOA audio data. 12. The device of claim 1 , wherein the spatially compressed version of the HOA audio data includes a predominant audio signal defined in a spherical harmonic domain, and a corresponding spatial component defining a direction, a shape, and a width of the predominant audio signal, the spatial component also defined in the spherical harmonic domain. 13. The device of claim 1 , wherein the device comprises a robot. 14. The device of claim 1 , wherein the device comprises an automobile. 15. A method of compressing higher-order ambisonic (HOA) audio data representative of a soundfield, the method comprising: performing bitrate allocation, based on an analysis of transport channels representative of a spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and generating a bitstream that specifies each of the transport channels using the respective allocated number of bits. 16. The method of claim 15 , further comprising: rendering the transport channels from a spherical harmonic domain to spatial domain channels; and performing the analysis with respect to the spatial domain channels. 17. The method of claim 15 , further comprising: rendering the transport channels from a spherical harmonic domain to uniformly distributed spatial domain channels; and performing the analysis with respect to the uniformly distributed spatial domain channels. 18. The method of claim 15 , wherein the analysis comprises an energy-based analysis of the transport channels. 19. The method of claim 15 , wherein the analysis comprises a perceptual-based analysis of the transport channels. 20. The method of claim 15 , wherein the analysis comprises a directional-based weighting analysis of the transport channels. 21. The method of claim 15 , wherein the analysis comprises a directional-based weighting analysis and a perceptual-based analysis of the transport channels. 22. The method of claim 15 , further comprising performing the inverse gain control with respect to the transport channels to remove gain normalization applied to the transport channels prior to performing the analysis of the transport channels. 23. The method of claim 15 , further comprising capturing, by a microphone, signals representative of the HOA audio data. 24. The method of claim 23 , further comprising performing spatial compression with respect to the HOA audio data to generate the spatially compressed version of the HOA audio data. 25. The method of claim 23 , further comprising performing a linear invertible decomposition with respect to the HOA audio data so as to generate the spatially compressed version of the HOA audio data. 26. The method of claim 15 , wherein the spatially compressed version of the HOA audio data includes a predominant audio signal defined in a spherical harmonic domain, and a corresponding spatial component defining a direction, a shape, and a width of the predominant audio signal, the spatial component also defined in the spherical harmonic domain. 27. The device of claim 15 , wherein performing the bitrate allocation comprises performing, by one or more processors of a device, the bitrate allocation, wherein generating the bitstream comprises generating, by the one or more processors, the bitstream, and wherein the device comprises a mobile communication handset. 28. The device of claim 15 , wherein performing the bitrate allocation comprises performing, by one or more processors of a device, the bitrate allocation, wherein generating the bitstream comprises generating, by the one or more processors, the bitstream, and wherein the device comprises a robot. 29. A device configured to compress higher-order ambisonic (HOA) audio data representative of a soundfield, the device comprising: means for performing bitrate allocation, based on an analysis of transport channels representative of a spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and means for generating a bitstream that specifies each of the transport channels using the respective allocated number of bits. 30. A non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processo
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
the extracted parameters being power information · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Electronic adaptation of stereophonic sound system to listener position or orientation (H04S7/301 takes precedence) · CPC title
Aspects of volume control, not necessarily automatic, in stereophonic sound systems · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.