Device and rendering environment tracking

US12382235B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12382235-B2
Application numberUS-202318161645-A
CountryUS
Kind codeB2
Filing dateJan 30, 2023
Priority dateFeb 1, 2022
Publication dateAug 5, 2025
Grant dateAug 5, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Images of an actual rendering environment are acquired through image sensors operating in conjunction with a media consumption system. The acquired images of the actual rendering environment are used to predict audio characteristics of objects present in the actual rendering environment. Spatial audio rendered, to a user in the actual rendering environment, by audio speakers operating in conjunction with the media consumption system is adjusted or modified based at least in part on the audio characteristics of the objects present in the actual rendering environment.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: acquiring a plurality of images of an actual rendering environment through one or more image sensors operating in conjunction with a media consumption system; using the plurality of images of the actual rendering environment to predict audio characteristics of objects present in the actual rendering environment; adjusting spatial audio rendered, to a user in the actual rendering environment, by a plurality of audio speakers operating in conjunction with the media consumption system based at least in part on the audio characteristics of the objects present in the actual rendering environment; wherein the spatial audio is derived from received audio data by the media consumption system; wherein the spatial audio as adjusted reproduces a recorded rendering environment for which the received audio data is intended to be rendered; wherein the recorded rendering environment comprises a reference audio speaker configuration that is different from an actual audio speaker configuration deployed in the actual rendering environment. 2. The method of claim 1 , wherein the plurality of images of the actual rendering environment is used to identify a make and a model of headphones, among the objects present in the actual rendering environment, worn by the user. 3. The method of claim 1 , wherein the plurality of images of the actual rendering environment is used to identify a make and a model of a non-headphone audio speaker among the objects present in the actual rendering environment. 4. The method of claim 1 , wherein the plurality of images of the actual rendering environment is used to predict audio characteristics of a non-audio-speaker object among the objects present in the actual rendering environment. 5. The method of claim 1 , wherein the plurality of images of the actual rendering environment is acquired over a time duration; wherein the media consumption system selects default audio characteristics of at least a subset of objects, among the objects present in the actual rendering environment, in rendering audio to the user when the time duration begins. 6. The method of claim 1 , wherein the audio characteristics are predicted by an audio characteristics predictive model trained with training images of a plurality of objects of different types in a plurality of different rendering environments and ground truths indicating respective audio characteristics for the plurality of objects of different types in the plurality of different rendering environments. 7. The method of claim 1 , wherein a subset of audio characteristics among the audio characteristics comprises one of: a reverberation audio characteristic or an echo audio characteristic, of an object present in the actual rendering environment. 8. The method of claim 1 , wherein the recorded rendering environment is specified in audio metadata in a media data signal that carries the received audio data to the media consumption system. 9. The method of claim 1 , wherein the audio characteristics are used to select specific personalized equalization operational parameters for rendering audio to the user in the actual rendering environment. 10. The method of claim 1 , wherein a new audio portion is created and rendered by a first audio speaker to reduce an audio leakage caused by a second audio speaker in an ear of the user. 11. An apparatus performing the method of claim 1 . 12. A non-transitory computer readable storage medium, storing software instructions, which when executed by one or more processors cause performance of the method recited in claim 1 . 13. A computing device comprising one or more processors and one or more storage media, storing a set of instructions, which when executed by one or more processors cause performance of the method recited in claim 1 .

Assignees

Inventors

Classifications

  • Electronic adaptation of stereophonic audio signals to reverberation of the listening space (H04S7/301 takes precedence) · CPC title

  • H04S7/304Primary

    For headphones · CPC title

  • Aspects of volume control, not necessarily automatic, in stereophonic sound systems · CPC title

  • Public address systems (circuits for preventing acoustic reaction H04R3/02; circuits for distributing signals to loudspeakers H04R3/12; {monitoring or testing arrangements for public address systems H04R29/007}; amplifiers H03F) · CPC title

  • Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12382235B2 cover?
Images of an actual rendering environment are acquired through image sensors operating in conjunction with a media consumption system. The acquired images of the actual rendering environment are used to predict audio characteristics of objects present in the actual rendering environment. Spatial audio rendered, to a user in the actual rendering environment, by audio speakers operating in conjun…
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification H04S7/304. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 05 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).