Method and apparatus for downscaling depth data for view plus depth data compression

US9736455B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9736455-B2
Application numberUS-201414497687-A
CountryUS
Kind codeB2
Filing dateSep 26, 2014
Priority dateJun 30, 2014
Publication dateAug 15, 2017
Grant dateAug 15, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, apparatus, and computer program product are disclosed for compression of a 2D-plus-depth representation based on spatial downsampling of an initial depth map. By utilizing the color image accompanying the initial depth map, it is possible to infer structural information that refines and reconstructs the initial depth map out of a heavily subsampled version of the depth map. In the process, no indexing of the exact positions of the subsampled depth values is needed, which leads to very efficient compression. An example method causes segmentation of the color image into a set of super-pixel segments, and causes downsampling of the initial depth map based on the set of super-pixel segments. The method subsequently causes generation and storage of a compressed representation based on the segmented color image and the downsampled depth map. A corresponding apparatus and computer program product are also provided.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving a two-dimensional (2D)-plus-depth representation, the 2D-plus-depth representation comprising a color image and an original depth map; causing segmentation of the color image into a set of super-pixel segments that group data of similar color, pattern, or texture; causing, by a processor, downsampling of the original depth map based on the set of super-pixel segments; causing generation of a compressed representation based on the segmented color image and the downsampled depth map, wherein the compressed representation includes one or more segmentation parameters associated with the segmentation of the color image and one or more downsampling parameters associated with the downsampling of the original depth map, wherein the compressed representation includes a binary tree that comprises depth intensity values and compressed representations for different layers of the binary tree; and causing storage of the compressed representation. 2. The method of claim 1 , wherein the downsampling of the original depth map comprises: identifying portions of the original depth map that correspond to segments of the set of super-pixel segments; for each identified portion of the original depth map, identifying a depth intensity value, and modifying the portion of the original depth map based on the identified depth intensity value, wherein the one or more downsampling parameters include the identified depth intensity values. 3. The method of claim 2 , wherein, in an instance in which the original depth map comprises floating point data, identifying the depth intensity value comprises: applying a thin-plate spline fitting function to the portion of the original depth map. 4. The method of claim 2 , further comprising: causing reconstruction of depth data from the downsampled depth map using an enhancement filter; and causing determination of filter parameters of the enhancement filter and depth intensity values that maximize a quality of the reconstruction, wherein the identified depth intensity values comprise the determined depth intensity values, and wherein the one or more downsampling parameters further include the determined filter parameters. 5. The method of claim 1 , further comprising: calculating a size of the compressed representation and a quality of the compressed representation; in an instance in which the calculated size fails to satisfy a size threshold or the calculated quality fails to satisfy a quality threshold, causing segmentation of a segment from the set of segments into sub-segments, causing further downsampling of the downsampled depth map based on the set of sub-segments, causing updating of the compressed representation based on the further downsampling, and re-calculating the size of the compressed representation and the quality of the compressed representation. 6. The method of claim 1 , further comprising: generating the binary tree to capture index information associated with the segmentation of the color image and the downsampled depth map. 7. The method of claim 6 , further comprising: causing application of predictive and arithmetic coding to the downsampled depth map prior to generation of the compressed representation. 8. The method of claim 1 , further comprising: causing embedding of the downsampled depth map in the compressed representation. 9. The method of 8 , further comprising: causing quantization of the color image by removing a set of least significant bits associated with the color image, wherein the downsampled depth map is embedded in the quantized color image of the compressed representation. 10. The method of claim 1 , wherein the segmentation parameters and the downsampling parameters included in the compressed representation are located in a header of the compressed representation. 11. The method of claim 1 , wherein the color image is segmented into the set of super-pixel segments via simple linear iterative clustering or super-pixel extraction via energy-driven sampling. 12. An apparatus comprising at least one processor and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to: receive a two-dimensional (2D)-plus-depth representation, the 2D-plus-depth representation comprising a color image and an original depth map; segment the color image into a set of super-pixel segments that group data of similar color, pattern, or texture; downsample the original depth map based on the set of super-pixel segments; generate a compressed representation based on the segmented color image and the downsampled depth map, wherein the compressed representation includes one or more segmentation parameters associated with the segmentation of the color image and one or more downsampling parameters associated with the downsampling of the original depth map, wherein the compressed representation includes a binary tree that comprises depth intensity values and compressed representations for different layers of the binary tree; and cause storage of the compressed representation. 13. The apparatus of claim 12 , wherein the downsampling the original depth map comprises: identifying portions of the original depth map that correspond to segments of the set of super-pixel segments; for each identified portion of the original depth map, identifying a depth intensity value, and modifying the portion of the original depth map based on the identified depth intensity value, wherein the one or more downsampling parameters include the identified depth intensity values. 14. The apparatus of claim 13 , wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus to: reconstruct depth data from the downsampled depth map using an enhancement filter; and determine filter parameters of the enhancement filter and depth intensity values that maximize a quality of the reconstruction, wherein the identified depth intensity values comprise the determined depth intensity values, and wherein the one or more downsampling parameters further include the determined filter parameters. 15. The apparatus of claim 12 , wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus to: calculate a size of the compressed representation and a quality of the compressed representation; in an instance in which the calculated size fails to satisfy a size threshold or the calculated quality fails to satisfy a quality threshold, segment a segment from the set of segments into sub-segments, further downsample the downsampled depth map based on the sub-segments, update the compressed representation based on the further downsampling, and re-calculate the size of the compressed representation and the quality of the compressed representation. 16. The apparatus of claim 12 , wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus to: generate the binary tree to capture index information associated with the segmentation of the color image and the downsampled depth map; and causing application of predictive and arithmetic coding to the downsampled depth map prior to generation of the compressed representation. 17. The apparatus of claim 12 , wherein the at least one memory and the computer program code are configured to, with the at least

Assignees

Inventors

Classifications

  • Methods or arrangements for coding, decoding, compressing or decompressing digital video signals · CPC title

  • according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • H04N13/161Primary

    Encoding, multiplexing or demultiplexing different image signal components (for multi-view video sequence encoding H04N19/597) · CPC title

  • Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title

  • specially adapted for multi-view video sequence encoding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9736455B2 cover?
A method, apparatus, and computer program product are disclosed for compression of a 2D-plus-depth representation based on spatial downsampling of an initial depth map. By utilizing the color image accompanying the initial depth map, it is possible to infer structural information that refines and reconstructs the initial depth map out of a heavily subsampled version of the depth map. In the pro…
Who is the assignee on this patent?
Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification H04N13/161. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 15 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).