Methods and systems for instance-wise segmentation of a 3D point cloud based on segmented 2D images

US12518478B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12518478-B2
Application numberUS-202318237348-A
CountryUS
Kind codeB2
Filing dateAug 23, 2023
Priority dateAug 23, 2023
Publication dateJan 6, 2026
Grant dateJan 6, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An illustrative point cloud segmentation system generates an instance-wise semantic mask for a particular source image of a set of source images that has been used to construct a 3D point cloud representing a scene that includes one or more objects. The point cloud segmentation system maps a set of 3D points from the 3D point cloud to corresponding 2D points of the particular source image, then labels, based on contours defined by the instance-wise semantic mask to demarcate the one or more objects, the mapped set of 3D points in accordance with where the corresponding 2D point for each mapped 3D point is positioned with respect to the contours. Based on the labeling of the mapped 3D points, the point cloud segmentation system produces a segmented 3D point cloud including an instance-wise segmentation of the one or more objects at the scene. Corresponding methods and systems are also disclosed.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: generating, by a point cloud segmentation system, an instance-wise semantic mask for a particular source image of a set of source images that has been used to construct a 3D point cloud representing a scene that includes one or more objects; mapping, by the point cloud segmentation system, a set of 3D points from the 3D point cloud to corresponding 2D points of the particular source image; labeling, by the point cloud segmentation system based on contours defined by the instance-wise semantic mask to demarcate the one or more objects, the mapped set of 3D points in accordance with where the corresponding 2D point for each mapped 3D point is positioned with respect to the contours; identifying, by the point cloud segmentation system, a correlation between: a first group of the mapped set of 3D points that are each labeled, based on the contours defined by the instance-wise semantic mask, as being associated with a first object of the one or more objects, and a second group of 3D points mapped from the 3D point cloud to corresponding 2D points of an additional source image of the set of source images and that are each labeled, based on contours defined by an additional instance-wise semantic mask for the additional source image, as being associated with a second object of the one more objects; determining, by the point cloud segmentation system and based on the correlation, that the first and second objects are a same object; and merging, by the point cloud segmentation system and based on the determining that the first and second objects are the same object, the first group and the second group to be labeled as being associated with the same object. 2 . The method of claim 1 , further comprising constructing, by the point cloud segmentation system and based on the set of source images, the 3D point cloud representing the scene, the constructing including determining a transformation between a 2D image space of the particular source image and a 3D world space associated with the scene; wherein the mapping of the set of 3D points from the 3D point cloud to the corresponding 2D points of the particular source image is performed based on the transformation between the 2D image space and the 3D world space. 3 . The method of claim 1 , wherein the identifying of the correlation includes determining, during the labeling of the mapped set of 3D points, that a first mapped 3D point of the first group is already labeled as being associated with the second object. 4 . The method of claim 1 , wherein the identifying of the correlation includes identifying a geometric overlap between the first group and the second group within a 3D world space with which the 3D point cloud is associated. 5 . The method of claim 1 , wherein the mapping of the set of 3D points from the 3D point cloud to the corresponding 2D points of the particular source image includes accessing 2D-3D mapping data that is generated and stored as part of constructing the 3D point cloud based on the set of source images. 6 . The method of claim 1 , wherein the 3D point cloud is a sparse point cloud in which the set of 3D points is limited to 3D points corresponding to 2D keypoints identified in the set of source images to be associated with prominent features of the scene. 7 . The method of claim 1 , wherein the 3D point cloud is a dense point cloud and the set of 3D points includes both: 3D points corresponding to 2D keypoints identified in the set of source images to be associated with prominent features of the scene; and additional 3D points located between the 3D points corresponding to the 2D keypoints. 8 . The method of claim 1 , further comprising constructing, by the point cloud segmentation system and based on the set of source images, the 3D point cloud based on at least one of: a multi-view stereo scene construction technique; a structure-from-motion scene construction technique; or a time-of-flight scene construction technique. 9 . The method of claim 1 , wherein the set of source images is captured using a machine configured to gain access to an area where the scene is located. 10 . The method of claim 9 , wherein: the scene is atop a cell tower; the one or more objects include a plurality of antennas; and the machine is a drone configured with at least one of photography or videography capabilities. 11 . The method of claim 1 , further comprising producing, by the point cloud segmentation system based on the labeling of the mapped set of 3D points, a segmented 3D point cloud including an instance-wise segmentation of the one or more objects at the scene. 12 . The method of claim 11 , further comprising using the segmented 3D point cloud to perform at least one of: tracking an individual status for each of the one or more objects; or determining an individual physical characteristic for a particular object of the one or more objects. 13 . A system comprising: a memory storing instructions; and one or more processors communicatively coupled to the memory and configured to execute the instructions to perform a process comprising: generating an instance-wise semantic mask for a particular source image of a set of source images that has been used to construct a 3D point cloud representing a scene that includes one or more objects; mapping a set of 3D points from the 3D point cloud to corresponding 2D points of the particular source image; labeling, based on contours defined by the instance-wise semantic mask to demarcate the one or more objects, the mapped set of 3D points in accordance with where the corresponding 2D point for each mapped 3D point is positioned with respect to the contours; identifying a correlation between: a first group of the mapped set of 3D points that are each labeled, based on the contours defined by the instance-wise semantic mask, as being associated with a first object of the one or more objects, and a second group of 3D points mapped from the 3D point cloud to corresponding 2D points of an additional source image of the set of source images and that are each labeled, based on contours defined by an additional instance-wise semantic mask for the additional source image, as being associated with a second object of the one more objects; determining, based on the correlation, that the first and second objects are a same object; and merging, based on the determining that the first and second objects are the same object, the first group and the second group to be labeled as being associated with the same object. 14 . The system of claim 13 , wherein the process further comprises constructing, based on the set of source images, the 3D point cloud representing the scene, the constructing including determining a transformation between a 2D image space of the particular source image and a 3D world space associated with the scene; wherein the mapping of the set of 3D points from the 3D point cloud to the corresponding 2D points of the particular source image is performed based on the transformation between the 2D image space and the 3D world space. 15 . The system of claim 13 , wherein the 3D point cloud is a sparse point cloud in which the set of 3D points is limited to 3D points corresponding to 2D keypoints identified in the set of source images to be associated with prominent features of the scene. 16 . The system of claim 13 , wherein the 3D point cloud is a dense point cloud in which the set of 3D points includes both: 3D points corresponding to 2D keypoints identified in the set of source image

Assignees

Inventors

Classifications

  • Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features (colour feature extraction G06V10/56) · CPC title

  • for displaying simultaneously · CPC title

  • G06V20/70Primary

    Labelling scene content, e.g. deriving syntactic or semantic representations · CPC title

  • Target detection · CPC title

  • Edge-based segmentation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12518478B2 cover?
An illustrative point cloud segmentation system generates an instance-wise semantic mask for a particular source image of a set of source images that has been used to construct a 3D point cloud representing a scene that includes one or more objects. The point cloud segmentation system maps a set of 3D points from the 3D point cloud to corresponding 2D points of the particular source image, then…
Who is the assignee on this patent?
Verizon Patent & Licensing Inc
What technology area does this patent fall under?
Primary CPC classification G06V20/70. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 06 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).