Multi-atlas encapsulation of immersive media

US12137225B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12137225-B2
Application numberUS-202217863049-A
CountryUS
Kind codeB2
Filing dateJul 12, 2022
Priority dateNov 30, 2020
Publication dateNov 5, 2024
Grant dateNov 5, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, apparatus, and systems that provide flexible encapsulation of volumetric video data in media files are disclosed. In one example aspect, a method for video processing includes receiving three-dimensional (3D) volumetric video data and encoding the three-dimensional volumetric video data into a media file. The 3D volumetric video data corresponds to one or more atlases, each comprising atlas data and one or more two-dimensional (2D) components. The atlas data and the one or more 2D components are stored in one or more media tracks in the media file.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for video processing, comprising: receiving three-dimensional (3D) volumetric video data; and encoding the 3D volumetric video data into a media file according to a file format, wherein the 3D volumetric video data corresponds to atlases, each atlas comprising atlas data and one or more two-dimensional (2D) components, and wherein the file format specifies that the atlas data and the one or more 2D components of each atlas are stored in one or more media tracks in the media file, wherein the file format further specifies at least one 2D component of the one or more 2D components that corresponds to an atlas is stored in a component track of the one or more media tracks, and wherein a part of the atlas data that the at least one 2D component corresponds to and is not common to all of the atlases is stored in the component track. 2. The method of claim 1 , wherein the one or more 2D components for each atlas comprise at least a geometry component, an occupancy component, or an attribute component. 3. The method of claim 1 , wherein the atlas data comprises projection relationship of projecting the 3D volumetric video data from a 3D space to a 2D plane. 4. The method of claim 1 , wherein the one or more media tracks includes a volumetric visual track with a specific sample entry type, the volumetric visual track comprising parameter information common to all of the atlases. 5. The method of claim 1 , wherein the one or more media tracks are organized into multiple groups, each group corresponding to the atlases arranged according to a grouping criterion associated with the 3D volumetric video data. 6. A method for video processing, comprising: decoding a media file that represents three-dimensional (3D) volumetric video data according to a file format, wherein the 3D volumetric video data corresponds to atlases, each atlas comprising atlas data and one or more two-dimensional (2D) components, and wherein the file format specifies that the atlas data and the one or more 2D components of each atlas are stored in one or more media tracks in the media file, wherein the file format further specifies at least one 2D component of the one or more 2D components that corresponds to an atlas is stored in a component track of the one or more media tracks, and wherein a part of the atlas data that the at least one 2D component corresponds to and is not common to all of the atlases is stored in the component track; and reconstructing the 3D volumetric video data based on at least one media track of the one or more media tracks. 7. The method of claim 6 , wherein the one or more 2D components for each atlas comprise at least a geometry component, an occupancy component, or an attribute component. 8. The method of claim 6 , wherein the atlas data comprises projection relationship of projecting the 3D volumetric video data from a 3D space to a 2D plane. 9. The method of claim 6 , wherein the one or more media tracks includes a volumetric visual track with a specific sample entry type, the volumetric visual track comprising parameter information common to all of the atlases. 10. The method of claim 6 , wherein the one or more media tracks are organized into multiple groups, each group corresponding to the atlases arranged according to a grouping criterion associated with the 3D volumetric video data. 11. A video processing apparatus comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to: receive three-dimensional (3D) volumetric video data; and encode the 3D volumetric video data into a media file according to a file format, wherein the 3D volumetric video data corresponds to atlases, each atlas comprising atlas data and one or more two-dimensional (2D) components, and wherein the file format specifies that the atlas data and the one or more 2D components of each atlas are stored in one or more media tracks in the media file, wherein the file format further specifies at least one 2D component of the one or more 2D components that corresponds to an atlas is stored in a component track of the one or more media tracks, and wherein a part of the atlas data that the at least one 2D component corresponds to and is not common to all of the atlases is stored in the component track. 12. The apparatus of claim 11 , wherein the one or more 2D components for each atlas comprise at least a geometry component, an occupancy component, or an attribute component. 13. The apparatus of claim 11 , wherein the atlas data comprises projection relationship of projecting the 3D volumetric video data from a 3D space to a 2D plane. 14. The apparatus of claim 11 , wherein the one or more media tracks includes a volumetric visual track with a specific sample entry type, the volumetric visual track comprising parameter information common to all of the atlases. 15. The apparatus of claim 11 , wherein the one or more media tracks are organized into multiple groups, each group corresponding to the atlases arranged according to a grouping criterion associated with the 3D volumetric video data. 16. A video processing apparatus comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to: decode a media file that represents three-dimensional (3D) volumetric video data according to a file format, wherein the 3D volumetric video data corresponds to atlases, each atlas comprising atlas data and one or more two-dimensional (2D) components, and wherein the file format specifies that the atlas data and the one or more 2D components of each atlas are stored in one or more media tracks in the media file, wherein the file format further specifies at least one 2D component of the one or more 2D components that corresponds to an atlas is stored in a component track of the one or more media tracks, and wherein a part of the atlas data that the at least one 2D component corresponds to and is not common to all of the atlases is stored in the component track; and reconstruct the 3D volumetric video data based on at least one media track of the one or more media tracks. 17. The apparatus of claim 16 , wherein the one or more 2D components for each atlas comprise at least a geometry component, an occupancy component, or an attribute component. 18. The apparatus of claim 16 , wherein the atlas data comprises projection relationship of projecting the 3D volumetric video data from a 3D space to a 2D plane. 19. The apparatus of claim 16 , wherein the one or more media tracks includes a volumetric visual track with a specific sample entry type, the volumetric visual track comprising parameter information common to all of the atlases. 20. The apparatus of claim 16 , wherein the one or more media tracks are organized into multiple groups, each group corresponding to the atlases arranged according to a grouping criterion associated with the 3D volumetric video data.

Assignees

Inventors

Classifications

  • Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title

  • characterised by memory arrangements (H04N19/433 takes precedence) · CPC title

  • Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking · CPC title

  • H04N19/136Primary

    Incoming video signal characteristics or properties · CPC title

  • H04N19/597Primary

    specially adapted for multi-view video sequence encoding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12137225B2 cover?
Methods, apparatus, and systems that provide flexible encapsulation of volumetric video data in media files are disclosed. In one example aspect, a method for video processing includes receiving three-dimensional (3D) volumetric video data and encoding the three-dimensional volumetric video data into a media file. The 3D volumetric video data corresponds to one or more atlases, each comprising …
Who is the assignee on this patent?
Zte Corp
What technology area does this patent fall under?
Primary CPC classification H04N19/136. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 05 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).