Systems and methods for temporally consistent depth map generation

US2021358155A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021358155-A1
Application numberUS-202015931541-A
CountryUS
Kind codeA1
Filing dateMay 13, 2020
Priority dateMay 13, 2020
Publication dateNov 18, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are provided for performing temporally consistent depth map generation by implementing acts of obtaining a first stereo pair of images of a scene associated with a first timepoint and a first pose, generating a first depth map of the scene based on the first stereo pair of images, obtaining a second stereo pair of images of the scene associated with at a second timepoint and a second pose, generating a reprojected first depth map by reprojecting the first depth map to align the first depth map with the second stereo pair of images, and generating a second depth map that corresponds to the second stereo pair of images using the reprojected first depth map.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for generating temporally consistent depth maps, comprising: one or more processors; and one or more hardware storage devices having stored computer-executable instructions that are operable, when executed by the one or more processors, to cause the system to: obtain a first stereo pair of images of a scene captured at a first timepoint and with a first pose associated with the system; generate a first depth map of the scene based on the first stereo pair of images; obtain a second stereo pair of images of the scene, the second stereo pair of images being captured at a second timepoint and with a second pose associated with the system; generate a reprojected first depth map by reprojecting the first depth map to align the first depth map with the second stereo pair of images; and generate a second depth map that corresponds to the second stereo pair of images by performing stereo matching on the second stereo pair of images using the reprojected first depth map, thereby improving temporal consistency of the first depth map with the second depth map, and the overall depth map quality. 2 . The system of claim 1 , further comprising: a stereo pair of cameras, wherein the stereo pair of cameras captures the first stereo pair of images and the second stereo pair of images. 3 . The system of claim 1 , further comprising: a head tracking system, comprising: at least one head tracking camera; an accelerometer; a gyroscope; and a magnetometer, wherein the first pose associated with the system and the second pose associated with the system are based on measurements obtained by the head tracking system. 4 . The system of claim 1 , wherein the first pose associated with the system is different than the second pose associated with the system. 5 . The system of claim 1 , wherein generating the second depth map using the reprojected first depth map includes implementing a temporal consistency term into a cost function for performing stereo matching on the second stereo pair of images. 6 . The system of claim 5 , wherein the temporal consistency term applies a cost bonus for pixels of the second depth map that share a same or similar disparity value with corresponding pixels of the reprojected first depth map. 7 . The system of claim 1 , wherein the first stereo pair of images is a downsampled first stereo pair of images and the first depth map of the scene is based on the downsampled first stereo pair of images, and wherein the second stereo pair of images is a downsampled second stereo pair of images and the second depth map is generated by performing stereo matching on the downsampled second stereo pair of images. 8 . The system of claim 1 , wherein the first depth map of the scene is one of a plurality of first depth maps of the scene, each of the plurality of first depth maps having a different image size. 9 . The system of claim 8 , wherein the second stereo pair of images is one of a plurality of second stereo pairs of images, the second stereo pair of images having a lowest image size of the plurality of second stereo pairs of images. 10 . The system of claim 9 , wherein the computer-executable instructions are further operable to cause the system to: generate an upsampled second depth map by applying an edge-preserving filter to the second depth map, wherein the edge-preserving filter utilizes the second depth map, at least one of the plurality of first depth maps, and at least one of the plurality of second stereo pairs of images to generate the upsampled second depth map. 11 . The system of claim 10 , wherein the edge-preserving filter is a joint bilateral filter. 12 . The system of claim 1 , wherein the computer-executable instructions are further operable, when executed by the one or more processors, to cause the system to: reproject depth points based on the second depth map to correspond to a user perspective. 13 . A method for generating temporally consistent depth maps, comprising: obtaining a first stereo pair of images of a scene captured at a first timepoint and with a first pose associated with a computer system; generating a first depth map of the scene based on the first stereo pair of images; obtaining a second stereo pair of images of the scene, the second stereo pair of images being captured at a second timepoint and with a second pose associated with the computer system; generating a reprojected first depth map by reprojecting the first depth map to align the first depth map with the second stereo pair of images; and generating a second depth map that corresponds to the second stereo pair of images by performing stereo matching on the second stereo pair of images using the reprojected first depth map, thereby improving temporal consistency of the first depth map with the second depth map, and the overall depth map quality. 14 . The method of claim 13 , wherein the first pose associated with the computer system is different than the second pose associated with the computer system. 15 . The method of claim 13 , wherein generating the second depth map using the reprojected first depth map includes implementing a temporal consistency term into a cost function for performing stereo matching on the second stereo pair of images. 16 . The method of claim 15 , wherein the temporal consistency term applies a cost bonus for pixels of the second depth map that share a same or similar disparity value with corresponding pixels of the reprojected first depth map. 17 . The method of claim 13 , wherein the first depth map of the scene is one of a plurality of first depth maps of the scene, each of the plurality of first depth maps having a different image size. 18 . The method of claim 17 , wherein the second stereo pair of images is one of a plurality of second stereo pairs of images, the second stereo pair of images having a lowest image size of the plurality of second stereo pairs of images. 19 . The method of claim 18 , further comprising: generating an upsampled second depth map by applying an edge-preserving filter to the second depth map, wherein the edge-preserving filter utilizes the second depth map, at least one of the plurality of first depth maps, and at least one of the plurality of second stereo pairs of images to generate the upsampled second depth map. 20 . One or more hardware storage devices having stored thereon computer-executable instructions, the computer-executable instructions being executable by one or more processors of a computer system to cause the computer system to: obtain a first stereo pair of images of a scene captured at a first timepoint and with a first pose associated with the computer system; generate a first depth map of the scene based on the first stereo pair of images; obtain a second stereo pair of images of the scene, the second stereo pair of images being captured at a second timepoint and with a second pose associated with the computer system; generate a reprojected first depth map by reprojecting the first depth map to align the first depth map with the second stereo pair of images; and generate a second depth map that corresponds to the second stereo pair of images by performing stereo matching on the second stereo pair of images using the reprojected first depth map, thereby improving temporal consistency of the first depth map with the second depth map, and the overall depth map quality.

Assignees

Inventors

Classifications

  • Edge-driven scaling; Edge-based scaling · CPC title

  • G06T7/593Primary

    from stereo images · CPC title

  • Stereoscopic video; Stereoscopic image sequence · CPC title

  • Determining position or orientation of objects or cameras (camera calibration G06T7/80) · CPC title

  • Adjusting depth or disparity · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021358155A1 cover?
Systems and methods are provided for performing temporally consistent depth map generation by implementing acts of obtaining a first stereo pair of images of a scene associated with a first timepoint and a first pose, generating a first depth map of the scene based on the first stereo pair of images, obtaining a second stereo pair of images of the scene associated with at a second timepoint and…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06T7/593. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 18 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).