Efficient three-dimensional object detection from point clouds
US-2022156483-A1 · May 19, 2022 · US
US12387430B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12387430-B2 |
| Application number | US-202017111271-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 3, 2020 |
| Priority date | Dec 3, 2020 |
| Publication date | Aug 12, 2025 |
| Grant date | Aug 12, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon one or more semantic features projected from a three-dimensional environment.
Opening claim text (preview).
What is claimed is: 1. One or more processors, comprising: circuitry to use one or more neural networks to generate one or more 2D images from one or more voxels representative of one or more objects and encoded semantic information corresponding to the one or more voxels. 2. The one or more processors of claim 1 , wherein the circuitry is further to: identify the one or more voxels from a plurality of geometric objects, of one or more object types, used to build a 3D environment. 3. The one or more processors of claim 1 , wherein the one or more objects include one or more geometric objects that are blocks having semantic feature data based at least on information associated with one or more voxels corresponding to one or more comers of the blocks, respective instances of the semantic feature data including at least a respective object type and position data within a 3D environment. 4. The one or more processors of claim 1 , wherein the circuitry is further to determine a set of semantic features visible from a field of view of a virtual camera, and project the set of semantic features into a 2D representation. 5. The one or more processors of claim 1 , wherein the the circuitry is further to encode a plurality of semantic features corresponding to the one or more voxels before determining a set of semantic features to be used in a 2D representation. 6. The one or more processors of claim 1 , wherein the the circuitry is further to generate a semantic segmentation mask from a set of semantic features identified in the one or more voxels, and wherein the one or more neural networks include a generative adversarial network (GAN) for generating the one or more 2D images using the semantic segmentation mask. 7. A system comprising: one or more processors to use one or more neural networks to generate one or more 2D images from one or more voxels representative of one or more objects part, on and encoded semantic information corresponding to the one or more voxels. 8. The system of claim 7 , wherein the one or more processors are further to identify the one or more voxels from a plurality of geometric objects, of one or more object types, used to build a 3D environment. 9. The system of claim 7 , wherein the one or more objects include one or more geometric objects are blocks having semantic feature data based, at least in part, on information associated with one or more voxels corresponding to one or more corners of the blocks, respective instances of the semantic feature data including at least a respective object type and position data within a 3D environment. 10. The system of claim 7 , wherein the one or more processors are further to use an encoder of the one or more neural networks to determine a set of semantic features visible from a field of view of a virtual camera, and use a generator of the one or more neural networks to project the set of semantic features into a 2D semantic feature representation to be used to generate the one or more 2D images. 11. The system of claim 7 , wherein the one or more processors are further to encode a plurality of semantic features before determining a set of semantic features used in the one or more 2D images. 12. The system of claim 7 , wherein the one or more processors are further to generate a semantic segmentation mask from a set of semantic features, and wherein the one or more neural networks include a generative adversarial network (GAN) for generating the one or more 2D objects images using the semantic segmentation mask. 13. A method comprising: using one or more neural networks to generate one or more 2D images from one or more voxels representative of one or more objects and encoded semantic information corresponding to the one or more voxels and provided as input to the one or more neural networks. 14. The method of claim 13 , further comprising: identifying the one or more voxels from a plurality of geometric objects, of one or more object types, used to build a 3D environment. 15. The method of claim 13 , wherein the one or more one or more objects are blocks having semantic feature data based, at least in part, on information associated with one or more voxels corresponding to one or more corners of the blocks, respective instances of the semantic feature data including at least a respective object type and position data within a 3D environment. 16. The method of claim 13 , further comprising: determining a set of semantic features visible from a field of view of a virtual camera, and project the set of semantic features into the one or more 2D images. 17. The method of claim 13 , further comprising: encoding a plurality of semantic features before determining a set of semantic features to be used in the one or more 2D images. 18. The method of claim 13 , further comprising: generating a semantic segmentation mask from a set of semantic features, wherein the one or more neural networks include a generative adversarial network (GAN) for generating the one or more 2D images using the semantic segmentation mask. 19. A non-transitory computer-readable medium having stored thereon a set of instructions, which if performed by one or more processors, cause the one or more processors to at least: use one or more neural networks to generate one or more 2D images from one or more voxels representative of one or more objects and encoded semantic information corresponding to the one or more voxels. 20. The non-transitory computer-readable medium of claim 19 , wherein the instructions if performed further cause the one or more processors to: identify one or more semantic features from a plurality of geometric objects, of one or more object types, used to build a 3D environment. 21. The non-transitory computer-readable medium of claim 19 , wherein the one or more objects are blocks having semantic feature data based, at least in part, on information associated with one or more voxels corresponding to one or more comers of the blocks, respective instances of the semantic feature data including at least a respective object type and position data within a 3D environment. 22. The non-transitory computer-readable medium of claim 19 , wherein the instructions if performed further cause the one or more processors to: determine a set of semantic features visible from a field of view of a virtual camera, and project the set of semantic features into the one or more 2D images. 23. The non-transitory computer-readable medium of claim 19 , wherein the instructions if performed further cause the one or more processors to: encode a plurality of semantic features before determining a set of semantic features to be used in the one or more 2D images. 24. The non-transitory computer-readable medium of claim 19 , wherein the instructions if performed further cause the one or more processors to: generate a semantic segmentation mask from a set of semantic features, and wherein the one or more neural networks include a generative adversarial network (GAN) for generating the one or more 2D images using the semantic segmentation mask. 25. An image generation system, comprising: one or more processors to use one or more neural networks to generate one or more 2D images from one or more voxels representative of one or more objects and encoded semantic information corresponding to the one or more voxels; and memory for storing network parameters for the one or more neural networks.
by matching two-dimensional images to three-dimensional objects · CPC title
Edge-based segmentation · CPC title
Artificial neural networks [ANN] · CPC title
Aligning objects, relative positioning of parts · CPC title
Geometric effects · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.