Methods and systems for generating three dimensional (3D) models of objects

US12154226B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12154226-B2
Application numberUS-202217948863-A
CountryUS
Kind codeB2
Filing dateSep 20, 2022
Priority dateMay 24, 2021
Publication dateNov 26, 2024
Grant dateNov 26, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for generating a three-dimensional (3D) model of an object includes receiving a two-dimensional (2D) view of at least one object as an input, measuring geometrical shape coordinates of the at least one object from the input, identifying texture parameters of the at least one object from the input, predicting geometrical shape coordinates and texture parameters of occluded portions of the at least one object in the 2D view by processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object, and generating a 3D model of the at least one object by mapping the measured geometrical shape coordinates and the identified texture parameters to the predicted geometrical shape coordinates and the predicted texture parameters of the occluded portions of the at least one object.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for generating a three-dimensional (3D) model of an object, the method comprising: receiving, by an electronic device, a two-dimensional (2D) view of at least one object as an input; measuring, by the electronic device, geometrical shape coordinates of the at least one object from the input; identifying, by the electronic device, texture parameters of the at least one object from the input; predicting, by the electronic device, geometrical shape coordinates and texture parameters of occluded portions of the at least one object in the 2D view by processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object; and generating, by the electronic device, a 3D model of the at least one object by mapping the measured geometrical shape coordinates and the identified texture parameters to the predicted geometrical shape coordinates and the predicted texture parameters of the occluded portions of the at least one object, wherein the predicting of the geometrical shape coordinates and the texture parameters of the occluded portions of the at least one object in the 2D view comprises: processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a first module of a third neural network to predict normal maps for the at least one object; processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a second module of the third neural network to predict occluded portions normal maps for the at least one object from the predicted normal maps; and processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a third module of the third neural network to predict a texture map of the occluded portions of the at least one object using a training input, wherein the normal maps and the occluded portions normal maps indicate the geometrical shape coordinates of the occluded potions of the at least one object at multiple scales, and the texture map indicates the texture parameters of the occluded potions of the at least one object. 2. The method of claim 1 , wherein the first, second and third modules of the third neural network include separate convolutional neural networks or generative adversarial networks, wherein the first module is trained to predict the normal maps from the input, wherein the second module is trained to predict the occluded portions normal maps from the predicted normal maps, and wherein the third module is trained to predict the texture parameters of the occluded portions using the input. 3. The method of claim 1 , wherein the third neural network is trained in a plurality of epochs, and wherein the training of the third neural network at each epoch of the plurality of epochs comprises: sampling surface points and texture RGB values from a training input, wherein the training input includes the batches of 2D images; identifying misclassified surface points from previous epochs and providing weightage to the misclassified surface points; training the third neural network based on the identified misclassified surface points to predict the geometrical shape coordinates and the texture parameters of the occluded portions of each object present in the training input; deriving a cost function to calculate an accuracy of the third neural network in predicting the geometrical shape coordinates and the texture parameters of the occluded portions of each object present in the training input, wherein the cost function is a weighted sum of losses computed with respect to the geometrical shape coordinates and the texture parameters; and back propagating the cost function to the third neural network to train the third neural network iteratively. 4. The method of claim 1 , wherein a fourth neural network is used to detect the occluded portions of the at least one object, and wherein the fourth neural network includes a Conditional Generative Adversarial Network (CGAN). 5. An electronic device comprising: a processor; and a memory storing instructions executable by the processor, wherein the processor is coupled to the memory and configured to: receive a two-dimensional (2D) view of at least one object as an input; measure geometrical shape coordinates of the at least one object from the input; identify texture parameters of the at least one object from the input; predict geometrical shape coordinates and texture parameters of occluded portions of the at least one object in the 2D view by processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object; and generate a three-dimensional (3D) model of the at least one object by mapping the measured geometrical shape coordinates and the identified texture parameters to the predicted geometrical shape coordinates and the predicted texture parameters of the occluded portions of the at least one object, wherein the processor is further configured to: process the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a first module of a third neural network to predict normal maps for the at least one object; process the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a second module of the third neural network to predict occluded portions normal maps for the at least one object from the predicted normal maps; and process the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object by using a third module of the third neural network to predict a texture map of the occluded portions of the at least one object using a training input, wherein the normal maps and the occluded portions normal maps indicate the geometrical shape coordinates of the occluded potions of the at least one object at multiple scales, and the texture map indicates the texture parameters of the occluded potions of the at least one object. 6. An electronic device comprising: a processor; and a memory storing instructions executable by the processor, wherein the processor is coupled to the memory and configured to: receive a two-dimensional (2D) view of at least one object as an input; measure geometrical shape coordinates of the at least one object from the input; identify texture parameters of the at least one object from the input; predict geometrical shape coordinates and texture parameters of occluded portions of the at least one object in the 2D view by processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object; and generate a three-dimensional (3D) model of the at least one object by mapping the measured geometrical shape coordinates and the identified texture parameters to the predicted geometrical shape coordinates and the predicted texture parameters of the occluded portions of the at least one objec

Assignees

Inventors

Classifications

  • Adversarial learning · CPC title

  • Generative networks · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12154226B2 cover?
A method for generating a three-dimensional (3D) model of an object includes receiving a two-dimensional (2D) view of at least one object as an input, measuring geometrical shape coordinates of the at least one object from the input, identifying texture parameters of the at least one object from the input, predicting geometrical shape coordinates and texture parameters of occluded portions of t…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T15/04. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 26 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).