Method and apparatus for playback of a higher-order ambisonics audio signal
US-9451363-B2 · Sep 20, 2016 · US
US11527254B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11527254-B2 |
| Application number | US-202016950768-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 17, 2020 |
| Priority date | Mar 26, 2014 |
| Publication date | Dec 13, 2022 |
| Grant date | Dec 13, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
Opening claim text (preview).
The invention claimed is: 1. An apparatus for generating loudspeaker signals, comprising: an object metadata processor, and an object renderer, wherein the object renderer is configured to receive an audio object, wherein the object metadata processor is configured to receive metadata comprising a first position of the audio object, wherein the object metadata processor is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen if the audio object is screen-related, wherein the object renderer is configured to generate the loudspeaker signals depending on the audio object and depending on position information, wherein the object metadata processor is configured to feed the first position of the audio object as the position information into the object renderer if the audio object is not screen-related, and wherein the object metadata processor is configured to feed the second position of the audio object as the position information into the object renderer if the audio object is screen-related. 2. The apparatus according to claim 1 , wherein the object metadata processor is configured to not calculate the second position of the audio object if the audio object is not screen-related. 3. The apparatus according to claim 1 , wherein the object renderer is configured to not determine whether the position information is the first position of the audio object or the second position of the audio object. 4. The apparatus according to claim 1 , wherein the object renderer is configured to generate the loudspeaker signals further depending on the number of the loudspeakers of a playback environment. 5. The apparatus according to claim 4 , wherein the object renderer is configured to generate the loudspeaker signals further depending on a loudspeaker position of each of the loudspeakers of the playback environment. 6. The apparatus according to claim 1 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen if the audio object is screen-related, wherein the first position indicates the first position in a three-dimensional space, and wherein the second position indicates the second position in the three-dimensional space. 7. The apparatus according to claim 6 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen if the audio object is screen-related, wherein the first position indicates a first azimuth, a first elevation and a first distance, and wherein the second position indicates a second azimuth, a second elevation and a second distance. 8. The apparatus according to claim 1 , wherein the object metadata processor is configured to receive the metadata comprising an indication if the audio object is screen-related, said indication indicating whether the audio object is an on-screen object, and wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen, such that the second position takes a first value on a screen area of the screen if the indication indicates that the audio object is an on-screen object. 9. The apparatus according to claim 8 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen, such that the second position takes a second value, which is either on the screen area or not on the screen area if the second indication indicates that the audio object is not an on-screen object. 10. The apparatus according to claim 1 , wherein the object metadata processor is configured to receive the metadata comprising an indication if the audio object is screen-related, said second indication indicating whether the audio object is an on-screen object, wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object, depending on the size of the screen, and depending on a first mapping curve as the mapping curve if the indication indicates that the audio object is an on-screen object, wherein the first mapping curve defines a mapping of original object positions in a first value interval to remapped object positions in a second value interval, and wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object, depending on the size of the screen, and depending on a second mapping curve as the mapping curve if the indication indicates that the audio object is not an on-screen object, wherein the second mapping curve defines a mapping of original object positions in the first value interval to remapped object positions in a third value interval, and wherein said second value interval is comprised by the third value interval, and wherein said second value interval is smaller than said third value interval. 11. The apparatus according to claim 10 , wherein each of the first value interval and the second value interval and the third value interval is a value interval of azimuth angles, or wherein each of the first value interval and the second value interval and the third value interval is a value interval of elevation angles. 12. The apparatus according to claim 1 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on at least one of a first linear mapping function and a second linear mapping function, wherein the first linear mapping function is defined to map a first azimuth value to a second azimuth value, wherein the second linear mapping function is defined to map a first elevation value to a second elevation value, wherein φ left nominal indicates a left azimuth screen edge reference, wherein φ right nominal indicates a right azimuth screen edge reference, wherein θ top nominal indicates a top elevation screen edge reference, wherein θ bottom nominal indicates a bottom elevation screen edge reference, wherein φ left repro indicates a left azimuth screen edge of the screen, wherein φ right repro indicates a right azimuth screen edge of the screen, wherein θ top repro indicates a top elevation screen edge of the screen, wherein θ bottom repro indicates a bottom elevation screen edge of the screen, wherein φ indicates the first azimuth value, wherein φ′ indicates the second azimuth value, wherein θ indicates the first elevation value, wherein θ′ indicates the second elevation value, wherein the second azimuth value φ′ results from a first mapping of the first azimuth value φ according to the first linear mapping function according to φ ′ = { φ right repro
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title
using sound class specific coding, hybrid encoders or object based coding · CPC title
Indicating arrangements; Control arrangements, e.g. balance control · CPC title
involving special audio data, e.g. different tracks for different languages · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.