Bitrate allocation for higher order ambisonic audio data

US10075802B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10075802-B1
Application numberUS-201715671988-A
CountryUS
Kind codeB1
Filing dateAug 8, 2017
Priority dateAug 8, 2017
Publication dateSep 11, 2018
Grant dateSep 11, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In general, techniques are described by which to perform bitrate allocation with respect to higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may be configured to perform various aspects of the bitrate allocation techniques. The memory may be configured to store a spatially compressed version of the HOA audio data. The processor may be coupled to the memory, and configured to perform bitrate allocation, based on an analysis of transport channels representative of the spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels. The processor may also be configured to generate a bitstream that specifies each of the transport channels using the respective allocated number of bits.

First claim

Opening claim text (preview).

The invention claimed is: 1. A device configured to compress higher-order ambisonic (HOA) audio data representative of a soundfield, the device comprising: a memory configured to store a spatially compressed version of the HOA audio data; and one or more processors coupled to the memory, and configured to: perform bitrate allocation, based on an analysis of transport channels representative of the spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and generate a bitstream that specifies each of the transport channels using the respective allocated number of bits. 2. The device of claim 1 , wherein the one or more processors are further configured to: render the transport channels from a spherical harmonic domain to spatial domain channels; and perform the analysis with respect to the spatial domain channels. 3. The device of claim 1 , wherein the one or more processors are further configured to: render the transport channels from a spherical harmonic domain to uniformly distributed spatial domain channels; and perform the analysis with respect to the uniformly distributed spatial domain channels. 4. The device of claim 1 , wherein the analysis comprises an energy-based analysis of the transport channels. 5. The device of claim 1 , wherein the analysis comprises a perceptual-based analysis of the transport channels. 6. The device of claim 1 , wherein the analysis comprises a directional-based weighting analysis of the transport channels. 7. The device of claim 1 , wherein the analysis comprises a directional-based weighting analysis and a perceptual-based analysis of the transport channels. 8. The device of claim 1 , wherein the one or more processors are further configured to perform the inverse gain control with respect to the transport channels to remove gain normalization applied to the transport channels prior to performing the analysis of the transport channels. 9. The device of claim 1 , further comprising a microphone coupled to the one or more processors, and configured to capture signals representative of the HOA audio data. 10. The device of claim 9 , wherein the one or more processors are further configured to perform spatial compression with respect to the HOA audio data to generate the spatially compressed version of the HOA audio data. 11. The device of claim 9 , wherein the one or more processors are configured to perform a linear invertible decomposition with respect to the HOA audio data so as to generate the spatially compressed version of the HOA audio data. 12. The device of claim 1 , wherein the spatially compressed version of the HOA audio data includes a predominant audio signal defined in a spherical harmonic domain, and a corresponding spatial component defining a direction, a shape, and a width of the predominant audio signal, the spatial component also defined in the spherical harmonic domain. 13. The device of claim 1 , wherein the device comprises a robot. 14. The device of claim 1 , wherein the device comprises an automobile. 15. A method of compressing higher-order ambisonic (HOA) audio data representative of a soundfield, the method comprising: performing bitrate allocation, based on an analysis of transport channels representative of a spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and generating a bitstream that specifies each of the transport channels using the respective allocated number of bits. 16. The method of claim 15 , further comprising: rendering the transport channels from a spherical harmonic domain to spatial domain channels; and performing the analysis with respect to the spatial domain channels. 17. The method of claim 15 , further comprising: rendering the transport channels from a spherical harmonic domain to uniformly distributed spatial domain channels; and performing the analysis with respect to the uniformly distributed spatial domain channels. 18. The method of claim 15 , wherein the analysis comprises an energy-based analysis of the transport channels. 19. The method of claim 15 , wherein the analysis comprises a perceptual-based analysis of the transport channels. 20. The method of claim 15 , wherein the analysis comprises a directional-based weighting analysis of the transport channels. 21. The method of claim 15 , wherein the analysis comprises a directional-based weighting analysis and a perceptual-based analysis of the transport channels. 22. The method of claim 15 , further comprising performing the inverse gain control with respect to the transport channels to remove gain normalization applied to the transport channels prior to performing the analysis of the transport channels. 23. The method of claim 15 , further comprising capturing, by a microphone, signals representative of the HOA audio data. 24. The method of claim 23 , further comprising performing spatial compression with respect to the HOA audio data to generate the spatially compressed version of the HOA audio data. 25. The method of claim 23 , further comprising performing a linear invertible decomposition with respect to the HOA audio data so as to generate the spatially compressed version of the HOA audio data. 26. The method of claim 15 , wherein the spatially compressed version of the HOA audio data includes a predominant audio signal defined in a spherical harmonic domain, and a corresponding spatial component defining a direction, a shape, and a width of the predominant audio signal, the spatial component also defined in the spherical harmonic domain. 27. The device of claim 15 , wherein performing the bitrate allocation comprises performing, by one or more processors of a device, the bitrate allocation, wherein generating the bitstream comprises generating, by the one or more processors, the bitstream, and wherein the device comprises a mobile communication handset. 28. The device of claim 15 , wherein performing the bitrate allocation comprises performing, by one or more processors of a device, the bitrate allocation, wherein generating the bitstream comprises generating, by the one or more processors, the bitstream, and wherein the device comprises a robot. 29. A device configured to compress higher-order ambisonic (HOA) audio data representative of a soundfield, the device comprising: means for performing bitrate allocation, based on an analysis of transport channels representative of a spatially compressed version of the HOA audio data, and prior to performing gain control with respect to the transport channels or after performing inverse gain control with respect to the transport channels, to allocate a number of bits to each of the transport channels; and means for generating a bitstream that specifies each of the transport channels using the respective allocated number of bits. 30. A non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processo

Assignees

Inventors

Classifications

  • Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • the extracted parameters being power information · CPC title

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • H04S7/302Primary

    Electronic adaptation of stereophonic sound system to listener position or orientation (H04S7/301 takes precedence) · CPC title

  • Aspects of volume control, not necessarily automatic, in stereophonic sound systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10075802B1 cover?
In general, techniques are described by which to perform bitrate allocation with respect to higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may be configured to perform various aspects of the bitrate allocation techniques. The memory may be configured to store a spatially compressed version of the HOA audio data. The processor may be coupled to the memory, …
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification H04S7/302. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 11 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).