Object-based volumetric video coding

US12230002B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12230002-B2
Application numberUS-202017631301-A
CountryUS
Kind codeB2
Filing dateSep 30, 2020
Priority dateOct 1, 2019
Publication dateFeb 18, 2025
Grant dateFeb 18, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, apparatus, systems and articles of manufacture for object-based volumetric video coding are disclosed. An example apparatus disclosed herein includes a point annotator to receive point cloud data associated with an object and annotate points of the point cloud data with an object identifier of the object. The disclosed example apparatus also includes a projector to project the point cloud data onto projection planes to produce texture images and geometry images. The disclosed example apparatus further includes a patch generator to generate a patch based on the object identifier, the patch including the texture images and the geometry images of the object, the patch associated with the object identifier of the object. The disclosed example apparatus also includes an atlas generator to generate an atlas to include in encoded video data, the atlas including the patch.

First claim

Opening claim text (preview).

What is claimed is: 1. A video encoder comprising: interface circuitry; instructions; and at least one processor circuit to be programmed by the instructions to: project first point cloud data associated with a first object identifier onto a first pair of projection layers of a projection plane, the first pair of projection layers to be tagged with the first object identifier to associate the first pair of projection layers with a first object; project second point cloud data associated with a second object identifier onto a second pair of projection layers of the projection plane, the second pair of projection layers different from the first pair of projection layers, the second object identifier different from the first object identifier, the second pair of projection layers to be tagged with the second object identifier to associate the second pair of projection layers with a second object different than the first object; generate a patch based on at least one projection layer of the first pair of projection layers, the patch associated with the first object identifier of the first object; and generate an atlas to include in encoded video data, the atlas including the patch. 2. The video encoder of claim 1 , wherein the projection plane is a first projection plane, and one or more of the at least one processor circuit is to: project the first point cloud data associated with the first object identifier onto a third pair of projection layers assigned to the first object, the third pair of projection layers associated with a second projection plane; and project the second point cloud data associated with the second object identifier onto a fourth pair of projection layers assigned to the second object, the fourth pair of projection layers associated with the second projection plane. 3. The video encoder of claim 1 , wherein the first pair of projection layers includes a first near layer and a first far layer assigned to the first object, and the second pair of projection layers includes a second near layer and a second far layer assigned to the second object, the second near layer different from the first near layer, and the second far layer different from the first far layer. 4. The video encoder of claim 1 , wherein one or more of the at least one processor circuit is to order the first pair of projection layers before the second pair of projection layers when the first object is closer to the projection plane than the second object. 5. The video encoder of claim 1 , wherein the patch is a first patch associated with the first object identifier, and one or more of the at least one processor circuit is to generate a second patch based on at least one projection layer of the second pair of projection layers, the second patch associated with the second object identifier. 6. The video encoder of claim 5 , wherein one or more of the at least one processor circuit is to stack the first patch before the second patch when the first object is closer to the projection plane than the second object. 7. The video encoder of claim 1 , wherein one or more of the at least one processor circuit is to annotate points of the first point cloud data with the first object identifier and annotate points of the second point cloud data with the second object identifier. 8. The video encoder of claim 1 , wherein the patch is a first patch, and one or more of the at least one processor circuit is to generate a second patch associated with the second object identifier, the second patch at a lower resolution than the first patch. 9. At least one non-transitory computer readable medium comprising instructions to cause at least one processor circuit to at least: project first point cloud data associated with a first object identifier onto a first pair of projection layers of a projection plane, the first pair of projection layers to be tagged with the first object identifier to associate the first pair of projection layers with a first object; project second point cloud data associated with a second object identifier onto a second pair of projection layers of the projection plane, the second pair of projection layers different from the first pair of projection layers, the second object identifier different from the first object identifier, the second pair of projection layers to be tagged with the second object identifier to associate the second pair of projection layers with a second object different than the first object; generate a patch based on at least one projection layer of the first pair of projection layers, the patch associated with the first object identifier of the first object; and generate an atlas to include in encoded video data, the atlas including the patch. 10. The at least one non-transitory computer readable medium of claim 9 , wherein the projection plane is a first projection plane, and the instructions are to cause one or more of the at least one processor circuit to: project the first point cloud data associated with the first object identifier onto a third pair of projection layers assigned to the first object, the third pair of projection layers associated with a second projection plane; and project the second point cloud data associated with the second object identifier onto a fourth pair of projection layers assigned to the second object, the fourth pair of projection layers associated with the second projection plane. 11. The at least one non-transitory computer readable medium of claim 9 , wherein the first pair of projection layers includes a first near layer and a first far layer assigned to the first object, and the second pair of projection layers includes a second near layer and a second far layer assigned to the second object, the second near layer different from the first near layer, and the second far layer different from the first far layer. 12. The at least one non-transitory computer readable medium of claim 9 , wherein the instructions are to cause one or more of the at least one processor circuit to order the first pair of projection layers before the second pair of projection layers when the first object is closer to the projections plane than the second object. 13. The at least one non-transitory computer readable medium of claim 9 , wherein the patch is a first patch associated with the first object identifier, and the instructions are to cause one or more of the at least one processor circuit to generate a second patch based on at least one projection layer of the second pair of projection layers, the second patch associated with the second object identifier. 14. The at least one non-transitory computer readable medium of claim 13 , wherein the instructions are to cause one or more of the at least one processor circuit to stack the first patch before the second patch when the first object is closer to the projection plane than the second object. 15. A video encoding method comprising: projecting, by at least one processor circuit programmed by at least one instruction, a first portion of point cloud data associated with an object onto a first pair of projection layers assigned to the object, the first pair of projection layers associated with a first one of a plurality of projection planes, the first pair of projection layers tagged with an identifier of the object to associate the first pair of projection layers with a first object; projecting, by one or more of the at least one processor circuit, a second portion of point cloud data associated with a background onto a second pair of projection layers assigned to the background, the second pair of projection layers associated with the firs

Assignees

Inventors

Classifications

  • by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis · CPC title

  • using video object coding · CPC title

  • specially adapted for multi-view video sequence encoding · CPC title

  • G06T9/00Primary

    Image coding (bandwidth or redundancy reduction for static pictures H04N1/41; coding or decoding of static colour picture signals H04N1/64; methods or arrangements for coding, decoding, compressing or decompressing digital video signals H04N19/00) · CPC title

  • G06T9/001Primary

    Model-based coding, e.g. wire frame · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12230002B2 cover?
Methods, apparatus, systems and articles of manufacture for object-based volumetric video coding are disclosed. An example apparatus disclosed herein includes a point annotator to receive point cloud data associated with an object and annotate points of the point cloud data with an object identifier of the object. The disclosed example apparatus also includes a projector to project the point cl…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T9/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).