3D point cloud encoding and decoding method, compression method and device based on graph dictionary learning

US12046009B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12046009-B2
Application numberUS-202418590993-A
CountryUS
Kind codeB2
Filing dateFeb 29, 2024
Priority dateMar 22, 2022
Publication dateJul 23, 2024
Grant dateJul 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A graph dictionary learning method for a 3D point cloud comprises: obtaining N point clouds to form training dataset; performing voxelization process on the point cloud data to obtain voxelized point cloud data of the training dataset; performing voxel block division on the point cloud data of the training dataset, selecting a plurality of voxel blocks as the training dataset, and constructing a graph dictionary learning model according to the training dataset; and performing iterative optimization on the graph dictionary learning objective function to obtain a graph dictionary for encoding and decoding a 3D point cloud signal. The present disclosure effectively uses the spatial correlation between point cloud signals to near-optimally remove the redundancy among point cloud signals.

First claim

Opening claim text (preview).

What is claimed is: 1. A graph dictionary learning method for a 3D point cloud, comprising: obtaining a training dataset of N point cloud data; performing voxelization process on the N point cloud data of the training dataset to obtain voxelized point cloud data of the training dataset; performing voxel block division on the voxelized point cloud data of the training dataset, selecting a plurality of voxel blocks as a training dataset for the graph dictionary learning, constructing a graph dictionary learning model according to the training dataset, and establishing a graph dictionary learning objective function; and performing iterative optimization on the graph dictionary learning objective function to obtain a graph dictionary for encoding and decoding a 3D point cloud signal; wherein the performing voxel block division on the voxelized point cloud data of the training dataset and selecting a plurality of voxel blocks as a training dataset for the graph dictionary learning comprises: uniformly dividing a boundary cube where all the voxelized point cloud data of the training dataset is located into m×m×m voxel blocks, m being a pre-set side length; sorting all the voxel blocks of each point cloud data of the training dataset in descending order according to the number of contained voxels; selecting top r voxel blocks containing the most number of voxels from each point cloud data of the training dataset, and calculating an average value of the attribute signals of the voxels contained in each voxel block in the top r voxel blocks as a direct current attribute signal of the voxel block, r being a pre-set positive integer; and subtracting the direct current attribute signal from each voxel to obtain a residual attribute signal as a training signal, forming a training dataset for graph dictionary learning. 2. A graph dictionary learning method for a 3D point cloud, comprising: obtaining a training dataset of N point cloud data; performing voxelization process on the N point cloud data of the training dataset to obtain voxelized point cloud data of the training dataset; performing voxel block division on the voxelized point cloud data of the training dataset, selecting a plurality of voxel blocks as a training dataset for the graph dictionary learning, constructing a graph dictionary learning model according to the training dataset, and establishing a graph dictionary learning objective function; and performing iterative optimization on the graph dictionary learning objective function to obtain a graph dictionary for encoding and decoding a 3D point cloud signal; wherein the constructing a graph dictionary learning model according to the training dataset, and establishing a graph dictionary learning objective function comprises: constructing a topology-connected graph structure =(V, ε, W) by regarding a signal in the training dataset as a graph signal, wherein V represents a set with m 3 nodes; ε represents a set of edges connecting m 3 nodes; W∈ m 3 ×m 3 represents a weight matrix of the edges; constructing a graph dictionary learning model by using the eigen-basis function of p-Laplacian operator of a topology-connected graph structure to establish a graph dictionary learning objective function: min D , A 1 n ⁢ ∑ i = 1 n 1 2 ⁢  M i ( x i - D ⁢ α i )  2 2 + β ⁢ ∑ s = 1 s J ⁡ ( D s ) + γ ⁢  D T ⁢ D - I  F 2 + λ n ⁢ ∑ i = 1 n  α i  1 where x i ∈ m 3 represents an i th training voxel block; M i ∈ m i ×m 3 represents a mask matrix with each element having a value of 0 or 1 used for extracting a corresponding m i voxels from x i ; D∈ m 3 ×m 3 represents an overcomplete graph dictionary composed of S complete sub-dictionaries, namely, D=[D 1 , . . . , D S ]; J ⁡ ( D s ) = ∑ k ⁢ ∑ i ⁢ j

Assignees

Inventors

Classifications

  • Engine management systems · CPC title

  • G06T9/40Primary

    Tree coding, e.g. quadtree, octree · CPC title

  • G06T9/001Primary

    Model-based coding, e.g. wire frame · CPC title

  • Training; Learning · CPC title

  • Range image; Depth image; 3D point clouds · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12046009B2 cover?
A graph dictionary learning method for a 3D point cloud comprises: obtaining N point clouds to form training dataset; performing voxelization process on the point cloud data to obtain voxelized point cloud data of the training dataset; performing voxel block division on the point cloud data of the training dataset, selecting a plurality of voxel blocks as the training dataset, and constructing …
Who is the assignee on this patent?
Univ Shanghai Jiaotong
What technology area does this patent fall under?
Primary CPC classification G06T9/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).