System and method for efficient scene continuity in visual and multimedia using generative artificial intelligence

US12499515B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12499515-B2
Application numberUS-202418736517-A
CountryUS
Kind codeB2
Filing dateJun 6, 2024
Priority dateJun 6, 2024
Publication dateDec 16, 2025
Grant dateDec 16, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for generating multimedia artifacts with managed scene continuity in visual and multimedia using an AI-based and scene continuity aware media generation platform. The system receives a user or AI agent specification or simulation result(s), selects or trains generative models based on the specification, preprocesses relevant data, and generates scene narrative or frame-specific, sequence specific or broader continuity aware content using the selected or trained model(s). The generated content may be further enhanced using frame interpolation and view synthesis techniques to create smooth transitions or novel viewpoints or to aid in more efficient transmission or viewing or persistence of resultant content. The system enables efficient and customizable generation of high-quality scene continuity aware content for various applications in visual and multimedia production using neuro-symbolic and simulation enhanced compression, representation and generation processes.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computing system for managing scene continuity in generated or augmented visual media, the computing system comprising: one or more hardware processors configured for: receiving a content specification request associated with a scene for continuity-aware content management; selecting one or more generative models based on the content specification, wherein the selecting comprises analyzing scene-specific continuity requirements including object positioning, lighting conditions, and camera perspective across multiple scenes; preprocessing data based on the content specification to prepare the data for the selected generative models, wherein the preprocessing comprises: cleaning multi-modal visual data while maintaining temporal and spatial relationships between frames; identifying and tracking specific objects across multiple scenes to ensure consistent object representation; mapping lighting patterns and camera angles across scene transitions; and transforming the data into a format that preserves visual and narrative continuity markers; selecting, training, fine-tuning, or augmenting the selected generative models using the preprocessed data, wherein the training comprises: implementing an adversarial network architecture with a generator network and a discriminator network; training the discriminator to identify visual discontinuities between scenes; and optimizing the generator to produce content that maintains consistent representation of characters, environments, narrative elements, and visual assets across multiple clips or scenes; generating scene continuity-aware content using the selected generative models based on the content specification request, wherein the generating comprises: frame interpolation and view synthesis to create transitions between scenes that maintain continuity of subject appearance, lighting, and scene geometry; perspective reconfiguration based on user-defined camera motion and angle inputs; generative scene extension through outpainting or recomposition to fill occluded or non-visible spatial regions beyond original frame boundaries; and synthesizing synchronized audio, including ambient environmental sounds and character dialogue, based on the visual content and content specification; and outputting the generated scene continuity-aware content artifacts or representations through a three-dimensional rendering engine that applies consistent shading, texture mapping, and perspective transformations across scene boundaries. 2 . The computing system of claim 1 , wherein the user content specification comprises one or more design elements, user preference configuration documents, or templates associated with scene continuity generation. 3 . The computing system of claim 1 , wherein selecting one or more generative models comprises selecting a generator network and a discriminator network. 4 . The computing system of claim 3 , wherein training the selected generative models comprises training the generator network and the discriminator network adversarially using the preprocessed data. 5 . The computing system of claim 1 , wherein generating scene continuity content comprises: engineering one or more prompts for the trained generative models based on the user content specification; and submitting one or more prompts as input to the trained generative models to generate the scene continuity content. 6 . The computing system of claim 5 , wherein one or more prompts include desired camera angles, temporal positions, transitions, or other scene-specific attributes. 7 . The computing system of claim 1 , further comprising: selecting a frame interpolation and view synthesis subsystem based on the content specification; and applying the frame interpolation and view synthesis module to the generated scene continuity content to create smooth transitions and novel viewpoints. 8 . The computing system of claim 1 , wherein training the selected generative models comprises: initializing the selected generative models with predefined architectures and hyperparameters; iteratively updating the model parameters using optimization algorithms; and monitoring and evaluating the training progress using metrics and validation techniques. 9 . The computing system of claim 1 , wherein outputting the generated scene continuity content comprises: applying post-processing techniques to enhance the visual quality and realism of the generated content; and providing the generated content in a format compatible with the user specification or downstream applications. 10 . The computing system of claim 1 , wherein generating scene continuity-aware content further comprises generating synchronized audio including ambient environmental sounds or character dialogue aligned with the generated visual frames. 11 . A computer-implemented method, the computer-implemented method comprising: receiving a content specification request associated with a scene for continuity-aware content management; selecting one or more generative models based on the content specification, wherein the selecting comprises analyzing scene-specific continuity requirements including object positioning, lighting conditions, and camera perspective across multiple scenes; preprocessing data based on the content specification to prepare the data for the selected or generative models, wherein the preprocessing comprises: cleaning multi-modal visual data while maintaining temporal and spatial relationships between frames; identifying and tracking specific objects across multiple scenes to ensure consistent object representation; mapping lighting patterns and camera angles across scene transitions; and transforming the data into a format that preserves visual and narrative continuity markers; selecting, training, fine-tuning, or augmenting the selected generative models using the preprocessed data, wherein the training comprises: implementing an adversarial network architecture with a generator network and a discriminator network; training the discriminator to identify visual discontinuities between scenes; and optimizing the generator to produce content that maintains consistent representation of characters, environments, narrative elements, and visual assets across multiple clips or scenes; generating scene continuity-aware content using the selected generative models based on the content specification request, wherein the generating comprises: frame interpolation and view synthesis to create transitions between scenes that maintain continuity of subject appearance, lighting, and scene geometry; perspective reconfiguration based on user-defined camera motion and angle inputs; generative scene extension through outpainting or recomposition to fill occluded or non-visible spatial regions beyond original frame boundaries; and synthesizing synchronized audio, including ambient environmental sounds and character dialogue, based on the visual content and content specification; and outputting the generated scene continuity—aware content artifacts or representations through a three-dimensional rendering engine that applies consistent shading, texture mapping, and perspective transformations across scene boundaries. 12 . The computer-implemented method of claim 11 , wherein the content specification comprises one or more design elements, user preference configuration documents, or templates associated with scene continuity generation. 13 . The computer-implemented method of claim 11 , wherein selecting one or more generative models comprises selecting a generator network and a discriminat

Assignees

Inventors

Classifications

  • G06T5/60Primary

    using machine learning, e.g. neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12499515B2 cover?
A system and method for generating multimedia artifacts with managed scene continuity in visual and multimedia using an AI-based and scene continuity aware media generation platform. The system receives a user or AI agent specification or simulation result(s), selects or trains generative models based on the specification, preprocesses relevant data, and generates scene narrative or frame-speci…
Who is the assignee on this patent?
Qomplx Llc
What technology area does this patent fall under?
Primary CPC classification G06T5/60. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 16 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).