Apparatuses and methods for training a machine learning network for use with a time-of-flight camera

US2021166124A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021166124-A1
Application numberUS-202017110330-A
CountryUS
Kind codeA1
Filing dateDec 3, 2020
Priority dateDec 3, 2019
Publication dateJun 3, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure proposes a concept of training a machine learning network for use with a ToF camera. Based on a predefined synthetic scene, it is simulated a ground truth time resolved illumination return signal for a light pulse emitted from the ToF camera to the synthetic scene and scattered back from the synthetic scene to the ToF camera. The synthetic scene comprises a plurality of scene points with known distances between each of the scene points and the ToF camera. Based on the ground truth time resolved illumination return signal and a simulation model of the ToF camera, it is then simulated an output signal of at least one ToF pixel capturing the synthetic scene. Based on the simulated output signal of the ToF pixel and the ground truth time resolved illumination return signal, weights of the machine learning network are adjusted to cause the machine learning network to map the simulated output signal of the ToF pixel to an output time resolved illumination return signal approximating the ground truth time resolved illumination return signal.

First claim

Opening claim text (preview).

1 . Method of training a machine learning network for use with a ToF camera, the method comprising: simulating, based on at least one predefined synthetic scene, a ground truth time resolved illumination return signal for a light pulse emitted from the ToF camera to the synthetic scene and scattered back from the synthetic scene to the ToF camera, wherein the synthetic scene comprises a plurality of scene points with known distances between each of the scene points and the ToF camera; simulating, based on the ground truth time resolved illumination return signal and a simulation model of the ToF camera, an output signal of at least one ToF pixel capturing light from the synthetic scene; and adjusting, based on the simulated output signal of the ToF pixel and the ground truth time resolved illumination return signal, weights of the machine learning network to cause the machine learning network to map the simulated output signal of the ToF pixel to an output time resolved illumination return signal approximating the ground truth time resolved illumination return signal. 2 . The method of claim 1 , wherein simulating the output signal comprises simulating the output signal of the ToF pixel for a plurality of different light modulation frequencies to obtain a respective output signal for each of the different light modulation frequencies, and wherein the weights of the machine learning network are adjusted based on the simulated output signals for the different light modulation frequencies as input to the machine learning network and based on the ground truth multipath propagation delay profile. 3 . The method of claim 1 , wherein simulating the ground truth time resolved illumination return signal comprises, based on transient rendering, simulating non-line-of-sight light components scattered back from the synthetic scene to the ToF camera. 4 . The method of claim 1 , wherein adjusting the weights of the machine learning network comprises minimizing a difference between the output time resolved illumination return signal and the ground truth time resolved illumination return signal. 5 . The method of claim 4 , wherein the difference is measured according to the earth mover's distance, EMD. 6 . The method of claim 1 , wherein the method of training is repeated for each of the plurality of scene points and/or a plurality of different predefined synthetic scenes. 7 . The method of claim 1 , wherein the simulation model of the ToF camera models specific camera properties including at least one of exact illumination shape, sensor sensitivity shape, and noise levels. 8 . The method of claim 1 , wherein the machine learning network comprises a convolutional network architecture. 9 . The method of claim 1 , wherein the ToF camera is an indirect ToF camera. 10 . Method for operating a ToF camera, the method comprising: capturing a scene with a ToF pixel matrix of the ToF camera; feeding output signals of the ToF pixel matrix into a machine learning network trained according to the method of claim 1 ; and providing, at an output of the machine learning network, time resolved illumination return signals associated with pixels of the ToF pixel matrix. 11 . A computer program having a program code for performing a method of claim 1 , when the computer program is executed on a programmable hardware device. 12 . A ToF camera comprising a machine learning network trained according to the method of claim 1 . 13 . The ToF camera according to claim 12 , wherein an input layer of the machine learning network is coupled to an output of a ToF pixel matrix of the ToF camera. 14 . The ToF camera according to claim 12 , wherein the machine learning network is configured to output a time resolved illumination return signal associated with each pixel of the ToF pixel matrix. 15 . The ToF camera according to claim 12 , wherein the ToF camera is an indirect ToF camera.

Assignees

Inventors

Classifications

  • Generative networks · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Supervised learning · CPC title

  • Adversarial learning · CPC title

  • Means for monitoring or calibrating · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021166124A1 cover?
The present disclosure proposes a concept of training a machine learning network for use with a ToF camera. Based on a predefined synthetic scene, it is simulated a ground truth time resolved illumination return signal for a light pulse emitted from the ToF camera to the synthetic scene and scattered back from the synthetic scene to the ToF camera. The synthetic scene comprises a plurality of s…
Who is the assignee on this patent?
Sony Semiconductor Solutions Corp
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jun 03 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).