Learnable global bases for generating three-dimensional representations from single-view data collections

US2026099998A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026099998-A1
Application numberUS-202519351193-A
CountryUS
Kind codeA1
Filing dateOct 6, 2025
Priority dateOct 8, 2024
Publication dateApr 9, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Generating a three-dimensional representation from a single-view includes receiving a single-view image, generating a plurality of coefficients, generating a 3D representation from a plurality of basis elements and the plurality of coefficients, processing the 3D representation and the single-view image to generate a plurality of optimized coefficients, generating an optimized 3D representation from the plurality of coefficients and the plurality of optimized basis elements, and rendering the optimized 3D representation to generate a volume rendering, and reconstructing a 3D scene from the volume rendering.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for reconstructing 3D scenes, the method comprising: receiving a single-view image; generating a plurality of coefficients; generating an optimized 3D representation from a plurality of optimized basis elements and the plurality of coefficients; rendering the optimized 3D representation to generate a volume rendering; and reconstructing a 3D scene from the volume rendering. 2 . The computer-implemented method of claim 1 , wherein the single-view image is a 2D image. 3 . The computer-implemented method of claim 1 , wherein the plurality of optimized basis elements are voxels or triplanes. 4 . The computer-implemented method of claim 1 , wherein generating the optimized 3D representation comprises generating a linear combination of the plurality of optimized basis elements using the plurality of coefficients. 5 . The computer-implemented method of claim 1 , wherein generating the volume rendering comprises ray casting or shear warping. 6 . The computer-implemented method of claim 1 , wherein generating the plurality of coefficients comprises using a machine learning model. 7 . The computer-implemented method of claim 6 , wherein the machine learning model comprises a vision transformer. 8 . The computer-implemented method of claim 1 , wherein generating the plurality of coefficients comprises: processing the single-view image using a machine learning model to generate a partial observation map, a depth map, and a probability distribution map; sampling the depth map to generate a dense set of 3D points; and performing Monte Carlo integration on 3D points in the dense set of 3D points based on the probability distribution map to generate a plurality of coefficients. 9 . The computer-implemented method of claim 8 , wherein the machine learning model comprises a U-Net model or a convolutional network. 10 . The computer-implemented method of claim 1 , wherein generating the plurality of optimized bases elements comprises: generating a 3D representation from a plurality of basis elements and the plurality of coefficients; rendering the 3D representation to generate a plurality of volume renderings; and minimizing a batch reconstruction loss between the plurality of volume renderings and a plurality of single-view images to generate the plurality of optimized bases elements. 11 . The computer-implemented method of claim 10 , wherein the batch reconstruction loss comprises one or more of an L1 loss, a mean squared error, or an LPIPS metric. 12 . The computer-implemented method of claim 1 , wherein generating the plurality of optimized bases elements comprises: generating a 3D representation from a plurality of basis elements and the plurality of coefficients; rendering the 3D representation to generate a plurality of volume renderings; and minimizing a batch reconstruction loss between the plurality of volume renderings and a partial observation map and the plurality of volume renderings and a plurality of single-view images to generate the plurality of optimized bases elements. 13 . The computer-implemented method of claim 12 , wherein the batch reconstruction loss comprises one or more of an L1 loss, a mean squared error, or an LPIPS metric. 14 . One or more non-transitory computer-readable media storing instructions that, when executed by at least one processor, cause the at least one processor to perform the steps of: receiving a single-view image; generating a plurality of coefficients; generating an optimized 3D representation from a plurality of optimized basis elements and the plurality of coefficients; rendering the optimized 3D representation to generate a volume rendering; and reconstructing a 3D scene from the volume rendering. 15 . The one or more non-transitory computer-readable media of claim 14 , wherein generating the optimized 3D representation comprises generating a linear combination of the plurality of optimized basis elements using the plurality of coefficients. 16 . The one or more non-transitory computer-readable media of claim 14 , wherein generating the plurality of coefficients comprises using a machine learning model. 17 . The one or more non-transitory computer-readable media of claim 14 , wherein generating the plurality of coefficients comprises: processing the single-view image using a machine learning model to generate a partial observation map, a depth map, and a probability distribution map; sampling the depth map to generate a dense set of 3D points; and performing Monte Carlo integration on 3D points in the dense set of 3D points based on the probability distribution map to generate a plurality of coefficients. 18 . The one or more non-transitory computer-readable media of claim 14 , wherein generating the plurality of optimized bases elements comprises: generating a 3D representation from a plurality of basis elements and the plurality of coefficients; rendering the 3D representation to generate a plurality of volume renderings; and minimizing a batch reconstruction loss between the plurality of volume renderings and a plurality of single-view images to generate the plurality of optimized bases elements. 19 . The one or more non-transitory computer-readable media of claim 14 , wherein generating the plurality of optimized bases elements comprises: generating a 3D representation from a plurality of basis elements and the plurality of coefficients; rendering the 3D representation to generate a plurality of volume renderings; minimizing a batch reconstruction loss between the plurality of volume renderings and a partial observation map and the plurality of volume renderings and a plurality of single-view images to generate the plurality of optimized bases elements. 20 . A system, comprising: one or more memories storing instructions; and one or more processors that are coupled to the one or more memories and, when executing the instructions, are configured to perform steps comprising: receiving a single-view image; generating a plurality of coefficients; generating an optimized 3D representation from a plurality of optimized basis elements and the plurality of coefficients; rendering the optimized 3D representation to generate a volume rendering; and reconstructing a 3D scene from the volume rendering.

Assignees

Inventors

Classifications

  • Volume rendering · CPC title

  • Particle system, point based geometry or rendering · CPC title

  • G06T17/00Primary

    Three-dimensional [3D] modelling for computer graphics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026099998A1 cover?
Generating a three-dimensional representation from a single-view includes receiving a single-view image, generating a plurality of coefficients, generating a 3D representation from a plurality of basis elements and the plurality of coefficients, processing the 3D representation and the single-view image to generate a plurality of optimized coefficients, generating an optimized 3D representation…
Who is the assignee on this patent?
Nvidia Corp
What technology area does this patent fall under?
Primary CPC classification G06T17/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 09 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).