Method and apparatus for compressing and decompressing a higher order ambisonics signal representation

US9454971B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9454971-B2
Application numberUS-201314400039-A
CountryUS
Kind codeB2
Filing dateMay 6, 2013
Priority dateMay 14, 2012
Publication dateSep 27, 2016
Grant dateSep 27, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Higher Order Ambisonics (HOA) represents a complete sound field in the vicinity of a sweet spot, independent of loudspeaker set-up. The high spatial resolution requires a high number of HOA coefficients. In the invention, dominant sound directions are estimated and the HOA signal representation is decomposed into dominant directional signals in time domain and related direction information, and an ambient component in HOA domain, followed by compression of the ambient component by reducing its order. The reduced-order ambient component is transformed to the spatial domain, and is perceptually coded together with the directional signals. At receiver side, the encoded directional signals and the order-reduced encoded ambient component are perceptually decompressed, the perceptually decompressed ambient signals are transformed to an HOA domain representation of reduced order, followed by order extension. The total HOA representation is recomposed from the directional signals, the corresponding direction information, and the original-order ambient HOA component.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for compressing a Higher Order Ambisonics HOA signal representation, said method comprising: estimating dominant directions; decomposing or decoding the HOA signal representation into a number of dominant directional signals in time domain and related direction information, and a residual ambient component in HOA domain, wherein said residual ambient component represents the difference between said HOA signal representation and a representation of said dominant directional signals; compressing said residual ambient component by reducing its order as compared to its original order; transforming said residual ambient HOA component of reduced order to the spatial domain; perceptually encoding said dominant directional signals and said transformed residual ambient HOA component. 2. The method according to claim 1 , wherein incoming vectors of HOA coefficients are framed into non-overlapping frames, and wherein a frame duration can be 25 ms. 3. The method according to claim 1 , wherein said dominant directions estimating is dependent on long overlapping groups of frames, such that for each current frame the content of adjacent frames is taken into consideration. 4. The method according to claim 1 , wherein said dominant directional signals and said transformed ambient HOA component are jointly perceptually compressed. 5. The method according to claim 1 , wherein said decomposing of the HOA signal representation into a number of dominant directional signals in time domain with related direction information and a residual ambient component in HOA domain is used for a signal-adaptive DirAC-like rendering of the HOA representation, wherein DirAC means Directional Audio Coding according to Pulkki. 6. The method according to claim 1 , wherein said dominant direction estimation is dependent on a directional power distribution of the energetically dominant HOA components. 7. A method for decompressing a Higher Order Ambisonics HOA signal representation that was compressed by: estimating dominant directions; decomposing or decoding the HOA signal representation into a number of dominant directional signals in time domain and related direction information, and a residual ambient component in HOA domain, wherein said residual ambient component represents the difference between said HOA signal representation and a representation of said dominant directional signals; compressing said residual ambient component by reducing its order as compared to its original order; transforming said residual ambient HOA component of reduced order to the spatial domain; perceptually encoding said dominant directional signals and said transformed residual ambient HOA component, said method comprising: perceptually decoding said perceptually encoded dominant directional signals and said perceptually encoded transformed residual ambient HOA component; inverse transforming said perceptually decoded transformed residual ambient HOA component so as to get an HOA domain representation; performing an order extension of said inverse transformed residual ambient HOA component so as to establish an original-order ambient HOA component; composing said perceptually decoded dominant directional signals, said direction information and said original-order extended ambient HOA component so as to get an HOA signal representation. 8. An apparatus for compressing a Higher Order Ambisonics HOA signal representation, said apparatus comprising: means adapted to estimate dominant directions; means adapted to decompose or decode the HOA signal representation into a number of dominant directional signals in time domain and related direction information, and a residual ambient component in HOA domain, wherein said residual ambient component represents the difference between said HOA signal representation and a representation of said dominant directional signals; means adapted to compress said residual ambient component by reducing its order as compared to its original order; means adapted to transform said residual ambient HOA component of reduced order to the spatial domain; means adapted to perceptually encode said dominant directional signals and said transformed residual ambient HOA component. 9. The apparatus according to claim 8 , wherein incoming vectors of HOA coefficients are framed into non-overlapping frames, and wherein a frame duration can be: 25 ms. 10. The apparatus according to claim 8 , wherein said dominant directions estimating is dependent on long overlapping groups of frames, such that for each current frame the content of adjacent frames is taken into consideration. 11. The apparatus according to claim 8 , wherein said dominant directional signals and said transformed ambient HOA component are jointly perceptually compressed. 12. The apparatus according to claim 8 , wherein said decomposing of the HOA signal representation into a number of dominant directional signals in time domain with related direction information and a residual ambient component in HOA domain is used for a signal-adaptive DirAC-like rendering of the HOA representation, wherein DirAC means Directional Audio Coding according to Pulkki. 13. The apparatus according to claim 8 , wherein said dominant direction estimation is dependent on a directional power distribution of the energetically dominant HOA components. 14. An apparatus for decompressing a Higher Order Ambisonics HOA signal representation that was compressed by: estimating dominant directions; decomposing or decoding the HOA signal representation into a number of dominant directional signals in time domain and related direction information, and a residual ambient component in HOA domain, wherein said residual ambient component represents the difference between said HOA signal representation and a representation of said dominant directional signals; compressing said residual ambient component by reducing its order as compared to its original order; transforming said residual ambient HOA component of reduced order to the spatial domain; perceptually encoding said dominant directional signals and said transformed residual ambient HOA component, said apparatus comprising a decoder configured to: perceptually decode said perceptually encoded dominant directional signals and said perceptually encoded transformed residual ambient HOA component; inverse transform said perceptually decoded transformed residual ambient HOA component so as to get an HOA domain representation; perform an order extension of said inverse transformed residual ambient HOA component so as to establish an original-order ambient HOA component; compose said perceptually decoded dominant directional signals, said direction information and said original-order extended ambient HOA component so as to get an HOA signal representation. 15. An apparatus for compressing a Higher Order Ambisonics HOA signal representation, said apparatus comprising an encoder configured to: estimate dominant directions; decompose or decode the HOA signal representation into a number of dominant directional signals in time domain and related direction information, and a residual ambient component in HOA domain, wherein said residual ambient component represents the difference between said HOA signal representation and a representation of said dominant directional signals; compress said residual ambient component by reducing its order as compared to its original order; transform said residual ambient HOA component of reduced order to the spatial domain; perceptually encode said dominant directional signals and said transf

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Application of ambisonics in stereophonic audio systems · CPC title

  • H04S3/008Primary

    in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • using sound class specific coding, hybrid encoders or object based coding · CPC title

  • using three or more audio channels, e.g. triphonic or quadraphonic · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9454971B2 cover?
Higher Order Ambisonics (HOA) represents a complete sound field in the vicinity of a sweet spot, independent of loudspeaker set-up. The high spatial resolution requires a high number of HOA coefficients. In the invention, dominant sound directions are estimated and the HOA signal representation is decomposed into dominant directional signals in time domain and related direction information, and…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 27 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).