Learning to simulate

US11518382B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11518382-B2
Application numberUS-201916696087-A
CountryUS
Kind codeB2
Filing dateNov 26, 2019
Priority dateSep 26, 2018
Publication dateDec 6, 2022
Grant dateDec 6, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method is provided for danger prediction. The method includes generating fully-annotated simulated training data for a machine learning model responsive to receiving a set of computer-selected simulator-adjusting parameters. The method further includes training the machine learning model using reinforcement learning on the fully-annotated simulated training data. The method also includes measuring an accuracy of the trained machine learning model relative to learning a discriminative function for a given task. The discriminative function predicts a given label for a given image from the fully-annotated simulated training data. The method additionally includes adjusting the computer-selected simulator-adjusting parameters and repeating said training and measuring steps responsive to the accuracy being below a threshold accuracy. The method further includes predicting a dangerous condition relative to a motor vehicle and providing a warning to an entity regarding the dangerous condition by applying the trained machine learning model to actual unlabeled data for the vehicle.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for danger prediction, comprising: generating, by a hardware processor, fully-annotated simulated training data for a machine learning model responsive to receiving a set of computer-selected simulator-adjusting parameters, the fully-annotated simulated training data including a category for each of pixels included in training images included in the fully-annotated simulated training data; training, by the hardware processor, the machine learning model using reinforcement learning on the fully-annotated simulated training data; measuring, by the hardware processor, an accuracy of the trained machine learning model relative to learning a discriminative function for a given task, the discriminative function predicting a given label for a given image from the fully-annotated simulated training data; adjusting, by the hardware processor, the computer-selected simulator-adjusting parameters and repeating said training and measuring steps responsive to the accuracy being below a threshold accuracy; and predicting, by the hardware processor, a dangerous condition relative to a motor vehicle and providing a warning to an entity regarding the dangerous condition by applying the trained machine learning model to actual unlabeled data for the motor vehicle. 2. The computer-implemented method of claim 1 , further comprising capturing the actual data using one or more vehicle mounted cameras. 3. The computer-implemented method of claim 1 , wherein said adjusting step is skipped responsive to the accuracy being equal to or greater than a threshold accuracy. 4. The computer-implemented method of claim 3 , wherein a reward is provided responsive to the accuracy being equal to or greater than the threshold accuracy. 5. The computer-implemented method of claim 4 , wherein the reward quantifies an error value, wherein the computer-selected simulator-adjusting parameters are adjusted responsive to a magnitude of the error signal. 6. The computer-implemented method of claim 1 , wherein the parameters are scene parameters that define a probability distribution of a set of scenes. 7. The computer-implemented method of claim 1 , wherein said adjusting step comprising updating a probability distribution of the computer-selected simulator-adjusting parameters. 8. The computer-implemented method of claim 1 , wherein the simulator comprises a scene model implemented as a composition of various scene probability distributions in a graphical model. 9. The computer-implemented method of claim 8 , wherein the graphical model comprises a vehicle path topology comprising a number of lanes, a number of objects in the scene, sidewalks, and landmarks. 10. The computer-implemented method of claim 8 , wherein the graphical model indicates types of the various scene probability distributions. 11. The computer-implemented method of claim 1 , wherein the fully- annotated simulated training data comprises a category for each of pixels comprised in a training image that is comprised in the fully-annotated simulated training data. 12. The computer-implemented method of claim 1 , wherein the fully- annotated simulated training data comprises traffic accident images. 13. The computer-implemented method of claim 1 , wherein the entity is the vehicle and the method further comprises controlling a feature of the vehicle selected from a group consisting of a vehicle speed, a vehicle braking, and vehicle steering, responsive to the dangerous condition being predicted. 14. The computer-implemented method of claim 1 , wherein a number of training epochs in each of a plurality of policy iterations corresponding to said generating, training, and measuring steps is defined as a respective one of the computer-selected simulator- adjusting parameters. 15. The computer-implemented method of claim 1 , wherein a dataset size in each of a plurality of policy iterations corresponding to said generating, training, and measuring steps is defined as a respective one of the computer-selected simulator-adjusting parameters. 16. The computer-implemented method of claim 1 , further comprising selectively choosing between fine-tuning the computer-selected simulator-adjusting parameters and estimating the updated computer-selected simulator-adjusting parameters from the scratch using a random initialization. 17. A computer program product for danger prediction, the computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform a method comprising: generating, by a hardware processor, fully-annotated simulated training data for a machine learning model responsive to receiving a set of computer-selected simulator-adjusting parameters, the fully-annotated simulated training data including a category for each of pixels included in training images included in the fully-annotated simulated training data; training, by the hardware processor, the machine learning model using reinforcement learning on the fully-annotated simulated training data; measuring, by the hardware processor, an accuracy of the trained machine learning model relative to learning a discriminative function for a given task, the discriminative function predicting a given label for a given image from the fully-annotated simulated training data; adjusting, by the hardware processor, the computer-selected simulator-adjusting parameters and repeating said training and measuring steps responsive to the accuracy being below a threshold accuracy; and predicting, by the hardware processor, a dangerous condition relative to a motor vehicle and providing a warning to an entity regarding the dangerous condition by applying the trained machine learning model to actual unlabeled data for the motor vehicle. 18. The computer program product of claim 17 , wherein said adjusting step is skipped responsive to the accuracy being equal to or greater than a threshold accuracy. 19. The computer program product of claim 18 , wherein a reward is provided responsive to the accuracy being equal to or greater than the threshold accuracy. 20. A computer processing system for determining command-to- process correspondence, comprising: a memory device including program code stored thereon; a hardware processor, operatively coupled to the memory device, and configured to run the program code stored on the memory device to generate fully-annotated simulated training data for a machine learning model responsive to receiving a set of computer-selected simulator-adjusting parameters, the fully-annotated simulated training data including a category for each of pixels included in training images included in the fully-annotated simulated training data; train the machine learning model using reinforcement learning on the fully- annotated simulated training data; measure an accuracy of the trained machine learning model relative to learning a discriminative function for a given task, the discriminative function predicting a given label for a given image from the fully-annotated simulated training data; adjust the computer-selected simulator-adjusting parameters and repeating said training and measuring steps responsive to the accuracy being below a threshold accuracy; and predict a dangerous condition relative to a motor vehicle and providing a warning to an entity regarding the dangerous condition by applying the trained machine learning model to actual unlab

Assignees

Inventors

Classifications

  • specially adapted for safety · CPC title

  • Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads · CPC title

  • Validation; Performance evaluation · CPC title

  • the prediction being responsive to traffic or environmental parameters · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11518382B2 cover?
A method is provided for danger prediction. The method includes generating fully-annotated simulated training data for a machine learning model responsive to receiving a set of computer-selected simulator-adjusting parameters. The method further includes training the machine learning model using reinforcement learning on the fully-annotated simulated training data. The method also includes meas…
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification B60W60/0015. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Dec 06 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).