Apparatus and method for screen related audio object remapping

US12380903B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12380903-B2
Application numberUS-202418416775-A
CountryUS
Kind codeB2
Filing dateJan 18, 2024
Priority dateMar 26, 2014
Publication dateAug 5, 2025
Grant dateAug 5, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for generating loudspeaker signals, comprising: an object metadata processor, and an object renderer, wherein, metadata comprises an indication on whether an audio object is screen-related, and further comprises a first position of the audio object, wherein the object metadata processor is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen if the audio object is indicated in the metadata as being screen-related, wherein the object renderer is configured to generate the loudspeaker signals depending on the audio object and depending on position information, wherein the object metadata processor is configured to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and wherein the object metadata processor is configured to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. 2. The apparatus according to claim 1 , wherein the object metadata processor is configured to not calculate the second position of the audio object if the audio object is indicated in the metadata as being not screen-related. 3. The apparatus according to claim 1 , wherein the object renderer is configured to not determine whether the position information is the first position of the audio object or the second position of the audio object. 4. The apparatus according to claim 1 , wherein the object renderer is configured to generate the loudspeaker signals further depending on the number of the loudspeakers of a playback environment. 5. The apparatus according to claim 4 , wherein the object renderer is configured to generate the loudspeaker signals further depending on a loudspeaker position of each of the loudspeakers of the playback environment. 6. The apparatus according to claim 1 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen if the audio object is indicated in the metadata as being screen-related, wherein the first position indicates the first position in a three-dimensional space, and wherein the second position indicates the second position in the three-dimensional space. 7. The apparatus according to claim 6 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen if the audio object is indicated in the metadata as being screen-related, wherein the first position indicates a first azimuth, a first elevation and a first distance, and wherein the second position indicates a second azimuth, a second elevation and a second distance. 8. The apparatus according to claim 1 , wherein the object metadata processor is configured to receive the metadata, comprising the indication on whether the audio object is screen-related as a first indication, and further comprising a second indication if the audio object is screen-related, said second indication indicating whether the audio object is an on-screen object, and wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen, such that the second position takes a first value on a screen area of the screen if the second indication indicates that the audio object is an on-screen object. 9. The apparatus according to claim 8 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object and depending on the size of the screen, such that the second position takes a second value, which is either on the screen area or not on the screen area if the second indication indicates that the audio object is not an on-screen object. 10. The apparatus according to claim 1 , wherein the object metadata processor is configured to receive the metadata, comprising the indication on whether the audio object is screen-related as a first indication, and further comprising a second indication if the audio object is screen-related, said second indication indicating whether the audio object is an on-screen object, wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object, depending on the size of the screen, and depending on a first mapping curve as the mapping curve if the second indication indicates that the audio object is an on-screen object, wherein the first mapping curve defines a mapping of original object positions in a first value interval to remapped object positions in a second value interval, and wherein the object metadata processor is configured to calculate the second position of the audio object depending on the first position of the audio object, depending on the size of the screen, and depending on a second mapping curve as the mapping curve if the second indication indicates that the audio object is not an on-screen object, wherein the second mapping curve defines a mapping of original object positions in the first value interval to remapped object positions in a third value interval, and wherein said second value interval is comprised by the third value interval, and wherein said second value interval is smaller than said third value interval. 11. The apparatus according to claim 10 , wherein each of the first value interval and the second value interval and the third value interval is a value interval of azimuth angles, or wherein each of the first value interval and the second value interval and the third value interval is a value interval of elevation angles. 12. The apparatus according to claim 1 , wherein the object metadata processor is configured to calculate the second position of the audio object depending on at least one of a first linear mapping function and a second linear mapping function, wherein the first linear mapping function is defined to map a first azimuth value to a second azimuth value, wherein the second linear mapping function is defined to map a first elevation value to a second elevation value, wherein φ left nominal indicates a left azimuth screen edge reference, wherein φ right nominal right indicates a right azimuth screen edge reference, wherein θ top nominal indicates a top elevation screen edge reference, wherein θ bottom nominal indicates a bottom elevation screen edge reference, wherein φ left repro indicates a left azimuth screen edge of the screen, wherein φ right repro indicates a right azimuth screen edge of the screen, wherein θ top repro indicates a top elevation screen edge of the screen, wherein θ bottom repro indicates a bottom elevation screen edge of the screen, wherein φ indicates the first azimuth value, wherein φ′ indicates the second azimuth value, wherein θ indicates the first elevation value, wherein θ′ indicates the second elevation value, wherein the second azimuth value φ′ results from a first mapping of the first azimuth value φ according to the first linear mapping function according to φ ′ = {

Assignees

Inventors

Classifications

  • Electronic adaptation dependent on speaker or headphone connection · CPC title

  • Processing of audio elementary streams · CPC title

  • Indicating arrangements; Control arrangements, e.g. balance control · CPC title

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12380903B2 cover?
An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information i…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 05 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).