Depth from time-of-flight using machine learning

US10311378B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10311378-B2
Application numberUS-201715672261-A
CountryUS
Kind codeB2
Filing dateAug 8, 2017
Priority dateMar 13, 2016
Publication dateJun 4, 2019
Grant dateJun 4, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A depth detection apparatus is described which has a memory storing raw time-of-flight sensor data received from a time-of-flight sensor. The depth detection apparatus also has a trained machine learning component having been trained using training data pairs. A training data pair comprises at least one simulated raw time-of-flight sensor data value and a corresponding simulated ground truth depth value. The trained machine learning component is configured to compute in a single stage, for an item of the stored raw time-of-flight sensor data, a depth value of a surface depicted by the item, by pushing the item through the trained machine learning component.

First claim

Opening claim text (preview).

The invention claimed is: 1. A depth detection apparatus comprising: a memory storing raw time-of-flight sensor data received from a time-of-flight sensor; and a processor comprising a trained machine learning component having been trained using training data pairs, a training data pair comprising at least one simulated raw time-of-flight sensor data value and a corresponding simulated ground truth depth value; the trained machine learning component configured to compute, in a single stage, for an item of the stored raw time-of-flight sensor data, a depth value of a surface depicted by the item by pushing the item through the trained machine learning component. 2. The apparatus of claim 1 , the trained machine learning component having been trained using simulated raw time-of-flight sensor data values which incorporate simulated multi-path interference. 3. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values computed using a computer graphics renderer which simulates multi-path interference. 4. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values comprising, for an individual pixel, weighted intensity values at different depths potentially depicted by the pixel. 5. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values where information about an exposure profile of the time-of-flight sensor is combined with the simulated raw time-of-flight sensor data values. 6. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values where information about sensor noise of the time-of-flight sensor is combined with the simulated raw time-of-flight sensor data values. 7. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values computed using a computer graphics renderer from a plurality of instances of a parametric 3D environment model, where the instances of the parametric 3D environment model are computer generated automatically at random. 8. The apparatus of claim 7 where parameters of the parametric 3D environment model comprise one or more of: geometry of an object in the 3D environment model, position of an object in the 3D environment model, presence of an object in the 3D environment model, orientation of an object in the 3D environment model, surface materials and reflectivity, ambient illumination. 9. The apparatus of claim 1 wherein a training data pair comprises a frame of simulated raw time-of-flight sensor data values and a corresponding simulated ground truth depth map. 10. The apparatus of claim 1 , the trained machine learning component having been trained using simulated raw time-of-flight sensor data values computed using a computer graphics renderer for a plurality of randomly selected viewpoints of the time-of-flight sensor, and where any of the viewpoints which are within a threshold distance of a surface in a 3D environment model used by the computer graphics renderer are omitted. 11. The apparatus of claim 1 the trained machine learning component having been trained using simulated raw time-of-flight sensor data values aggregated over a neighborhood of a pixel, where the neighborhood is a spatial neighborhood, or a temporal neighborhood, or a spatial and temporal neighborhood. 12. The apparatus of claim 1 where the trained machine learning component is a pixel independent regressor. 13. The apparatus of claim 1 where the trained machine learning component is regressor which takes into account relationships between pixels of the stored time-of-flight sensor data. 14. The apparatus of claim 1 where the trained machine learning component is a convolutional neural network and where each training data pair comprises a frame of simulated raw time-of-flight sensor data and a ground truth depth map. 15. The apparatus of claim 1 where the trained machine learning component is at least partially implemented using hardware logic selected from any one or more of: a field-programmable gate array, an application-specific integrated circuit, an application-specific standard product, a system-on-a-chip, a complex programmable logic device, a graphics processing unit. 16. One or more computer-readable storage devices comprising computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform the following operations storing frames of raw time-of-flight sensor data received from a time-of-flight sensor; and operating a trained machine learning component having been trained using training data pairs, a training data pair comprising at least one simulated raw time-of-flight sensor data value and a corresponding simulated ground truth depth value; wherein operating the trained machine learning component comprises computing, in a single stage, for an item of the stored raw time-of-flight sensor data, a depth value of a surface depicted by the item, by pushing the item through the trained machine learning component. 17. The one or more computer-readable storage device of claim 16 where operating the trained machine learning component further comprises operating the trained machine learning component in real time by computing the depth value at a rate which is equivalent to or faster than a frame rate of the time-of-flight sensor. 18. The one or more computer-readable storage device of claim 16 where the trained machine learning component comprises a convolutional neural network. 19. The one or more computer-readable storage device of claim 16 where the trained machine learning component comprises a pixel independent regressor which is a regressor that does not take into account relationships between pixels of a time-of-flight sensor frame. 20. A computer-implemented method comprising: storing, at a memory, raw time-of-flight sensor data received from a time-of-flight sensor; and operating, by a processor, a trained machine learning component having been trained using training data pairs, a training data pair comprising at least one simulated raw time-of-flight sensor data value and a corresponding simulated ground truth depth value; wherein operating the trained machine learning component comprises computing, in a single stage, for an item of the stored raw time-of-flight sensor data, a depth value of a surface depicted by the item, by pushing the item through the trained machine learning component.

Assignees

Inventors

Classifications

  • Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • from laser ranging, e.g. using interferometry; from the projection of structured light · CPC title

  • Depth or shape recovery · CPC title

  • Evaluating distance, position or velocity data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10311378B2 cover?
A depth detection apparatus is described which has a memory storing raw time-of-flight sensor data received from a time-of-flight sensor. The depth detection apparatus also has a trained machine learning component having been trained using training data pairs. A training data pair comprises at least one simulated raw time-of-flight sensor data value and a corresponding simulated ground truth de…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 04 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).