Methods and systems for augmenting audio content

US11735203B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11735203-B2
Application numberUS-202217896785-A
CountryUS
Kind codeB2
Filing dateAug 26, 2022
Priority dateOct 28, 2020
Publication dateAug 22, 2023
Grant dateAug 22, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.

First claim

Opening claim text (preview).

What is claimed: 1. One or more non-transitory computer-readable media storing processor-executable instructions that, when executed by at least one processor, cause the at least one processor to: determine, for a portion of a content item, one or more media elements and one or more auditory events; determine, based on the one or more media elements, one or more candidate auditory events; determine, based on the one or more auditory events and the one or more candidate auditory events, a target auditory event in audio content associated with the portion of the content item; and modify, based on the target auditory event, the audio content. 2. The non-transitory computer-readable media of claim 1 , wherein the one or more media elements comprises one or more textual elements, and wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to determine the one or more media elements based on one or more of natural language processing, optical character recognition, and an output from a machine learning model. 3. The non-transitory computer-readable media of claim 1 , wherein the one or more media elements comprises one or more visual elements, and wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to determine the one or more media elements based on one or more of object recognition, and an output from a machine learning model. 4. The non-transitory computer-readable media of claim 1 , wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to determine the one or more auditory events based on one or more of audio signal analysis, speech recognition, and an output from a machine learning model. 5. The non-transitory computer-readable media of claim 1 , wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to modify the audio content to include the target auditory event. 6. The non-transitory computer-readable media of claim 1 , wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to modify the audio content, further cause the at least one processor to: determine audio data associated with the target auditory event; and update, based on the audio data, a waveform associated with the audio content. 7. The non-transitory computer-readable media of claim 1 , wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to determine the one or more candidate auditory events, further cause the at least one processor to: determine that the one or more media elements are associated with one or more auditory events in an auditory event repository; and determine, based on the one or more media elements being associated with the one or more auditory events in the auditory event repository, the one or more candidate auditory events. 8. The non-transitory computer-readable media of claim 1 , wherein the target auditory event comprises a candidate auditory event of the one or more candidate auditory events that is missing from the one or more auditory events. 9. The non-transitory computer-readable media of claim 1 , wherein the target auditory event comprises an attenuated auditory event of the one or more auditory events or an accentuated auditory event of the one or more auditory events, wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to modify the audio content, further cause the at least one processor to increase an audio level associated with the attenuated auditory event or decrease an audio level associated with the accentuated auditory event. 10. The non-transitory computer-readable media of claim 1 , wherein the target auditory event comprises an unintelligible auditory event of the one or more auditory events, wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to modify the audio content, further cause the at least one processor to: determine an audio file associated with the unintelligible auditory event; and update, based on the audio file, a waveform associated with the audio content. 11. One or more non-transitory computer-readable media storing processor-executable instructions that, when executed by at least one processor, cause the at least one processor to: determine, for a portion of a content item, a distribution of visual elements, and a distribution of auditory events; determine, based on the distribution of visual elements, one or more candidate auditory events; determine, based on the distribution of auditory events and the one or more candidate auditory events, a target auditory event in audio content associated with the portion of the content item; and modify, based on the target auditory event, the audio content. 12. The non-transitory computer-readable media of claim 11 , wherein the target auditory event comprises a candidate auditory event of the one or more candidate auditory events that is missing from the distribution of auditory events. 13. The non-transitory computer-readable media of claim 11 , wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to modify the audio content to include the target auditory event. 14. The non-transitory computer-readable media of claim 11 , wherein the target auditory event comprises an attenuated auditory event of the distribution of auditory events or an accentuated auditory event of the distribution of auditory events, wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to modify the audio content, further cause the at least one processor to increase an audio level associated with the attenuated auditory event or decrease an audio level associated with the accentuated auditory event. 15. The non-transitory computer-readable media of claim 11 , wherein the target auditory event comprises an unintelligible auditory event of the distribution of auditory events, wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to modify the audio content, further cause the at least one processor to: determine an audio file associated with the unintelligible auditory event; and update, based on the audio file, a waveform associated with the audio content. 16. The non-transitory computer-readable media of claim 11 , wherein the processor-executable instructions, that when executed by the at least one processor, cause the at least one processor to determine the one or more candidate auditory events, further cause the at least one processor to: determine that one or more visual elements of the distribution of visual elements are associated with one or more auditory events in an auditory event repository; and determine, based on the one or more visual elements being associated with the one or more auditory events in the auditory event repository, the one or more candidate auditory events. 17. The non-transitory computer-readable media of claim 11 , wherein the processor-executable instructions, when executed by the at least one processor, further cause the at least one processor to determine, for

Assignees

Inventors

Classifications

  • Details of processing therefor · CPC title

  • Extraction of image or video features · CPC title

  • Scenes; Scene-specific elements (control of digital cameras H04N23/60) · CPC title

  • Classification techniques · CPC title

  • for comparison or discrimination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11735203B2 cover?
The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
Who is the assignee on this patent?
Comcast Cable Comm Llc
What technology area does this patent fall under?
Primary CPC classification G10L21/0324. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).