System and method for training a machine learning model deployed on a simulation platform

US11328219B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11328219-B2
Application numberUS-201815952089-A
CountryUS
Kind codeB2
Filing dateApr 12, 2018
Priority dateApr 12, 2018
Publication dateMay 10, 2022
Grant dateMay 10, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

System and method for training a machine learning model are disclosed. In one embodiment, for each of the driving scenarios, responsive to sensor data from one or more sensors of a vehicle and the driving scenario, driving statistics and environment data of the vehicle are collected while the vehicle is driven by a human driver in accordance with the driving scenario. Upon completion of the driving scenario, the driver is requested to select a label for the completed driving scenario and the selected label is stored responsive to the driver selection. Features are extracted from the driving statistics and the environment data based on predetermined criteria. The extracted features include some of the driving statistics and some of the environment data collected at the different points in time during the driving scenario.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of training a machine learning model for evaluating autonomous driving of an autonomous driving vehicle (ADV), the method comprising: for each of a plurality of driving scenarios, collecting driving statistics and environment data of the ADV from one or more sensors of the ADV, while the ADV is driven by a human driver in manual driving mode in accordance with the driving scenario, upon completion of the driving scenario, displaying a user interface requesting the human driver to select a label for the completed driving scenario, the label indicating unsafe, safe, uncomfortable or comfortable, wherein the human driver selects the label using the user interface, storing the selected label responsive to the selection of the human driver, and extracting features from the driving statistics and the environment data based on predetermined criteria deciding to extract information indicating responses of the ADV and obstacle information, the extracted features including some of the driving statistics and some of the environment data collected at different points in time during the driving scenario; and training a machine learning model to continuously learn driving behaviors and preferences of the human driver under the plurality of driving scenarios using the extracted features and a plurality of labels selected by the human driver for the plurality of driving scenarios; wherein the trained machine learning model is deployed for performance of autonomous driving simulation that tests a perception and control system of the ADV by using the trained machine learning model to virtually operate a virtual ADV to drive through the driving scenarios under a simulation environment to test the perception and control system of the ADV; wherein simulation results from the autonomous driving simulation are used to determine whether the perception and control system of the ADV passes or fails. 2. The method of claim 1 , wherein some of the driving statistics includes information indicating responses of the ADV, and some of the environment data includes obstacle information. 3. The method of claim 1 , wherein the extracted features include, for each driving scenario, identifiers (IDs) that include obstacle IDs representing obstacles in the driving scenario and a vehicle ID representing the ADV, a timestamp associated with each of the IDs, wherein the timestamp represents a time T i , where i≥0, recorded at the different points in time while the ADV is driven through the driving scenario, information indicating responses of the ADV captured at the recorded time T i , and obstacle information captured at the recorded time T i . 4. The method of claim 2 , wherein the information indicating responses of the ADV includes speed, acceleration, deceleration, and rate of change of acceleration of the ADV, and the obstacle information includes positions of obstacles. 5. The method of claim 1 , further comprising: subsequent to collecting driving statistics and environment data of the ADV, storing the collected driving statistics and the environment data remotely onto a server; and prior to extracting features from the driving statistics and the environment data, retrieving the collected driving statistics and the environment data from the server. 6. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations, the operations comprising: for each of a plurality of driving scenarios, collecting driving statistics and environment data of an autonomous driving vehicle (ADV) from one or more sensors of the ADV, while the ADV is driven by a human driver in manual driving mode in accordance with the driving scenario, upon completion of the driving scenario, displaying a user interface requesting the driver to select a label for the completed driving scenario, the label indicating unsafe, safe, uncomfortable or comfortable, wherein the human driver selects the label using the user interface, storing the selected label responsive to the driver selection, and extracting features from the driving statistics and the environment data based on predetermined criteria deciding to extract information indicating responses of the ADV and obstacle information, the extracted features including some of the driving statistics and some of the environment data collected at different points in time during the driving scenario; and training a machine learning model to continuously learn driving behaviors and preferences of the human driver under the plurality of driving scenarios using the extracted features and a plurality of labels selected by the human driver for the plurality of driving scenarios; wherein the trained machine learning model is deployed for performance of autonomous driving simulation that tests a perception and control system of the ADV by using the trained machine learning model to virtually operate a virtual ADV to drive through the driving scenarios under a simulation environment to test the perception and control system of the ADV; wherein simulation results from the autonomous driving simulation are used to determine whether the perception and control system of the ADV passes or fails. 7. The non-transitory machine-readable medium of claim 6 , wherein some of the driving statistics includes information indicating responses of the ADV, and some of the environment data includes obstacle information. 8. The non-transitory machine-readable medium of claim 6 , wherein the extracted features include, for each driving scenario, identifiers (IDs) that include obstacle IDs representing obstacles in the driving scenario and a vehicle ID representing the ADV, a timestamp associated with each of the IDs, wherein the timestamp represents a time T i , where i≥0, recorded at the different points in time while the ADV is driven through the driving scenario, information indicating responses of the ADV captured at the recorded time T i , and obstacle information captured at the recorded time T i . 9. The non-transitory machine-readable medium of claim 7 , wherein the information indicating responses of the ADV includes speed, acceleration, deceleration, and rate of change of acceleration of the ADV, and the obstacle information includes positions of obstacles. 10. The non-transitory machine-readable medium of claim 6 , wherein the operations further comprise: subsequent to collecting driving statistics and environment data of the ADV, storing the collected driving statistics and the environment data remotely onto a server; and prior to extracting features from the driving statistics and the environment data, retrieving the collected driving statistics and the environment data from the server. 11. A data processing system, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations, the operations including for each of a plurality of driving scenarios, collecting driving statistics and environment data of an autonomous driving vehicle (ADV) from one or more sensors of the ADV, while the ADV is driven by a human driver in manual driving mode in accordance with the driving scenario, upon completion of the driving scenario, displaying a user interface requesting the driver to select a label for the completed driving scenario, the label indicating unsafe, safe, uncomfortable or comfortable, wherein the human driver selects the label using the user interface, storing the selected label responsive to the driver selection, and extracting features from the d

Assignees

Inventors

Classifications

  • Design optimisation, verification or simulation (optimisation, verification or simulation of circuit designs G06F30/30) · CPC title

  • G06F30/15Primary

    Vehicle, aircraft or watercraft design · CPC title

  • using electronic data carriers · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • communicating information to a remotely located station (transmission systems for measured values G08C) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11328219B2 cover?
System and method for training a machine learning model are disclosed. In one embodiment, for each of the driving scenarios, responsive to sensor data from one or more sensors of a vehicle and the driving scenario, driving statistics and environment data of the vehicle are collected while the vehicle is driven by a human driver in accordance with the driving scenario. Upon completion of the dri…
Who is the assignee on this patent?
Baidu Usa Llc
What technology area does this patent fall under?
Primary CPC classification G06F30/15. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 10 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).