System for reducing transaction failure
US-12175472-B2 · Dec 24, 2024 · US
US2026030837A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2026030837-A1 |
| Application number | US-202418780747-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jul 23, 2024 |
| Priority date | Jul 23, 2024 |
| Publication date | Jan 29, 2026 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus comprises at least one processing device configured to extract a set of features from a user prompt using a natural language processing model, to initialize a three-dimensional scene reconstruction model utilizing a set of parameters determined based at least in part on the set of features extracted from the user prompt, and to generate, utilizing the three-dimensional scene reconstruction model, a set of two-dimensional images of a given scene from two or more different viewpoint perspectives. The at least one processing device is also configured to apply an image diffusion model to the generated set of two-dimensional images to generate a refined set of two-dimensional images, to modify the three-dimensional scene reconstruction model based at least in part on the refined set of two-dimensional images, and to utilize the modified three-dimensional scene reconstruction model to generate a three-dimensional representation of the given scene.
Opening claim text (preview).
What is claimed is: 1 . An apparatus comprising: at least one processing device comprising a processor coupled to a memory; the at least one processing device being configured: to extract a set of features from a user prompt using a natural language processing model; to initialize a three-dimensional scene reconstruction model utilizing a set of parameters determined based at least in part on the set of features extracted from the user prompt; to generate, utilizing the three-dimensional scene reconstruction model, a set of two-dimensional images of a given scene from two or more different viewpoint perspectives; to apply an image diffusion model to the generated set of two-dimensional images to generate a refined set of two-dimensional images; to modify the three-dimensional scene reconstruction model based at least in part on the refined set of two-dimensional images; and to utilize the modified three-dimensional scene reconstruction model to generate a three-dimensional representation of the given scene. 2 . The apparatus of claim 1 wherein the three-dimensional scene reconstruction model comprises a Neural Radiance Field (NeRF) model configured to take as input a three-dimensional position vector and a two-dimensional viewing direction and output a color and density at each of two or more points of the given scene. 3 . The apparatus of claim 2 wherein initializing the three-dimensional scene reconstruction model comprises initializing weights of a neural network that represents a neural radiance field. 4 . The apparatus of claim 1 wherein generating the set of two-dimensional images of the given scene from two or more different viewpoint perspectives comprises: selecting the two or more different viewpoint perspectives to capture a range of perspectives of the given scene; for each of the two or more different viewpoint perspectives, performing ray tracing through the given scene for a plurality of rays, where a color and density of each of the plurality of rays is computed using the three-dimensional scene reconstruction model; and synthesizing the set of two-dimensional images of the given scene using the plurality of rays. 5 . The apparatus of claim 1 wherein the image diffusion model comprises a denoising diffusion probabilistic model (DDPM). 6 . The apparatus of claim 1 wherein applying the image diffusion model to the generated set of two-dimensional images comprises applying a noise-reduction process to the generated set of two-dimensional images by: inputting the generated set of two-dimensional images to the image diffusion model; predicting noise added at each timestep based at least in part on an output of the image diffusion model; and removing the predicted noise from the generated set of two-dimensional images to generate the refined set of two-dimensional images. 7 . The apparatus of claim 1 wherein modifying the three-dimensional scene reconstruction model based at least in part on the refined set of two-dimensional images comprises: estimating probability densities for pixels of the refined set of two-dimensional images; and adjusting the set of parameters of the three-dimensional scene reconstruction model based at least in part on the estimated probability densities. 8 . The apparatus of claim 7 wherein estimating the probability densities for the pixels of the refined set of two-dimensional images utilizes a density estimation model that takes the refined set of two-dimensional images and the user prompt as input and computes probability density likelihoods of the pixels of the refined set of two-dimensional images. 9 . The apparatus of claim 7 wherein adjusting the set of parameters of the three-dimensional scene reconstruction model comprises utilizing a gradient descent algorithm that utilizes a loss function comprising a negative log-likelihood of the estimated probability densities for the pixels of the refined set of two-dimensional images. 10 . The apparatus of claim 1 wherein the user prompt comprises a natural language description of a design of a product, and wherein utilizing the modified three-dimensional scene reconstruction model to generate the three-dimensional representation of the given scene comprises generating a three-dimensional representation of a prototype of the product. 11 . The apparatus of claim 1 wherein the user prompt comprises a natural language description of a virtual showroom of one or more products, and wherein utilizing the modified three-dimensional scene reconstruction model to generate the three-dimensional representation of the given scene comprises generating a three-dimensional representation of the one or more products for the virtual showroom. 12 . The apparatus of claim 1 wherein the user prompt comprises a natural language description specifying one or more customizations of a product, and wherein utilizing the modified three-dimensional scene reconstruction model to generate the three-dimensional representation of the given scene comprises generating a three-dimensional representation of a customized version of the product based at least in part on the specified one or more customizations. 13 . The apparatus of claim 1 wherein the user prompt comprises a natural language description of one or more features of a product, and wherein utilizing the modified three-dimensional scene reconstruction model to generate the three-dimensional representation of the given scene comprises generating a three-dimensional representation of a training simulation for the one or more features of the product. 14 . The apparatus of claim 1 wherein the user prompt comprises a natural language description of a configuration of an information technology infrastructure environment, and utilizing the modified three-dimensional scene reconstruction model to generate the three-dimensional representation of the given scene comprises generating a three-dimensional representation of the configuration of the information technology infrastructure environment. 15 . A computer program product comprising a non-transitory processor-readable storage medium having stored therein program code of one or more software programs, wherein the program code when executed by at least one processing device causes the at least one processing device: to extract a set of features from a user prompt using a natural language processing model; to initialize a three-dimensional scene reconstruction model utilizing a set of parameters determined based at least in part on the set of features extracted from the user prompt; to generate, utilizing the three-dimensional scene reconstruction model, a set of two-dimensional images of a given scene from two or more different viewpoint perspectives; to apply an image diffusion model to the generated set of two-dimensional images to generate a refined set of two-dimensional images; to modify the three-dimensional scene reconstruction model based at least in part on the refined set of two-dimensional images; and to utilize the modified three-dimensional scene reconstruction model to generate a three-dimensional representation of the given scene. 16 . The computer program product of claim 15 wherein the three-dimensional scene reconstruction model comprises a Neural Radiance Field (NeRF) model configured to take as input a three-dimensional position vector and a two-dimensional viewing direction and output a color and density at each of two or more points of the given scene. 17 . The computer program product of claim 15 wherein modifying the three-dimensio
Related publications grouped by family.
Answers are generated from the same data shown on this page.