Learning safety and human-centered constraints in autonomous vehicles

US11613269B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11613269-B2
Application numberUS-201916724635-A
CountryUS
Kind codeB2
Filing dateDec 23, 2019
Priority dateDec 23, 2019
Publication dateMar 28, 2023
Grant dateMar 28, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Traversing a vehicle transportation network includes operating a scenario-specific operational control evaluation module instance. The scenario-specific operational control evaluation module instance includes an instance of a scenario-specific operational control evaluation model of a distinct vehicle operational scenario. Operating the scenario-specific operational control evaluation module instance includes identifying a multi-objective policy for the scenario-specific operational control evaluation model. The multi-objective policy may include a relationship between at least two objectives. Traversing the vehicle transportation network includes receiving a candidate vehicle control action associated with each of the at least two objectives. Traversing the vehicle transportation network includes selecting a vehicle control action based on a buffer value. Traversing the vehicle transportation network includes performing the selected vehicle control action, determining a preference indicator for each objective, and updating the multi-objective policy.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for use in a vehicle transportation network, the method comprising: identifying, by an autonomous vehicle, a multi-objective policy for an intersection scenario-specific operational control evaluation model, wherein the multi-objective policy includes at least a first objective associated with a passenger comfort preference that is learned based on user override data, a second objective associated with a passenger risk preference that is learned based on the user override data, and a topological graph that shows a relationship between the first objective and the second objective, wherein an edge of the topological graph defines a constraint that restricts policies of the first objective and the second objective; receiving, from the multi-objective policy, a candidate vehicle control action associated with the first objective and a candidate vehicle control action associated with the second objective; determining a buffer value based on the relationship between the first objective and the second objective, wherein the buffer value includes an amount of variation of risk between the first objective and the second objective; selecting a vehicle control action based on the buffer value, wherein the selected vehicle control action is the candidate vehicle control action associated with the first objective or the candidate vehicle control action associated with the second objective; performing the selected vehicle control action to traverse a portion of the vehicle transportation network; determining a preference indicator for each objective based on the performed vehicle control action; determining a competence measure for each of the first objective and the second objective based on the performed vehicle control action to restrict a number of available vehicle control actions, wherein each of the competence measures of the first objective and the second objective is a learned probability of success; and update the multi-objective policy based on the preference indicator and each of the competence measures of the first objective and the second objective. 2. The method of claim 1 , wherein the preference indicator is a reward value. 3. The method of claim 2 , wherein the reward value is a positive integer on a condition that the performed vehicle control action is successful. 4. The method of claim 2 , wherein the reward value is a negative integer on a condition that the performed vehicle control action is overridden. 5. The method of claim 1 , wherein the preference indicator is based on a user input. 6. The method of claim 1 , wherein the preference indicator is a change in the buffer value. 7. The method of claim 1 , wherein the preference indicator is an ordering of a plurality of objectives. 8. The method of claim 1 , wherein at least the first candidate vehicle control action or the second candidate vehicle control action is based on aggregated data. 9. The method of claim 8 , wherein the aggregated data is based on a geographic region, a user, a vehicle, a company, or a fleet of vehicles. 10. An autonomous vehicle comprising: a processor configured to execute instructions stored on a non-transitory computer readable medium to: identify a multi-objective policy for an intersection scenario-specific operational control evaluation model, wherein the multi-objective policy includes at least a first objective that prioritizes vehicle control actions based on a passenger comfort preference that is learned based on user override data, a second objective that prioritizes vehicle control actions based on an expected time of completion, and a topological graph that shows a relationship between the first objective and the second objective, wherein an edge of the topological graph defines a constraint that restricts policies of the first objective and the second objective; receive, from the multi-objective policy, a candidate vehicle control action associated with the first objective and a candidate vehicle control action associated with the second objective; determine a buffer value based on the relationship between the first objective and the second objective, wherein the buffer value includes an amount of variation of risk between the first objective and the second objective; select a vehicle control action based on the buffer value, wherein the selected vehicle control action is the candidate vehicle control action associated with the first objective or the candidate vehicle control action associated with the second objective; perform the selected vehicle control action to traverse a portion of the vehicle transportation network; determine a preference indicator for each objective based on the performed vehicle control action; determine a competence measure for each of the first objective and the second objective based on the performed vehicle control action to restrict a number of available vehicle control actions, wherein each of the competence measures of the first objective and the second objective is a learned probability of success; and update the multi-objective policy based on the preference indicator and each of the competence measures of the first objective and the second objective. 11. The autonomous vehicle of claim 10 , wherein the preference indicator is a reward value. 12. The autonomous vehicle of claim 11 , wherein the reward value is a positive integer on a condition that the performed vehicle control action is successful. 13. The autonomous vehicle of claim 11 , wherein the reward value is a negative integer on a condition that the performed vehicle control action is overridden. 14. The autonomous vehicle of claim 10 , wherein the preference indicator is based on a user input. 15. The autonomous vehicle of claim 10 , wherein the preference indicator is a change in the buffer value. 16. The autonomous vehicle of claim 10 , wherein the preference indicator is an ordering of a plurality of objectives. 17. The autonomous vehicle of claim 10 , wherein at least the first candidate vehicle control action or the second candidate vehicle control action is based on aggregated data. 18. The autonomous vehicle of claim 10 , wherein the aggregated data is based on a geographic region, a user, a vehicle, a company, or a fleet of vehicles. 19. A method for use in a vehicle transportation network, the method comprising: identifying, by an autonomous vehicle, a multi-objective policy for an intersection scenario-specific operational control evaluation model, wherein the multi-objective policy includes a topological graph that shows a relationship between at least two objectives, wherein an edge of the topological graph defines a constraint that restricts policies of the at least two objectives, and wherein one of the at least two objectives is associated with a passenger comfort preference that is learned based on user override data; receiving, from the multi-objective policy for the scenario-specific operational control evaluation module instance, candidate vehicle control actions associated with each of the at least two objectives; selecting a vehicle control action based on a buffer value, wherein the selected vehicle control action is one of the candidate vehicle control actions; performing the selected vehicle control action to traverse a portion of the vehicle transportation network; determining a preference indicator for each objective based on the performed vehicle control action; determining a competence measure each of the first objective and the second objective based on the perfo

Assignees

Inventors

Classifications

  • Adaptive recalibration · CPC title

  • exterior to a vehicle by using sensors mounted on the vehicle · CPC title

  • of positioning data, e.g. GPS [Global Positioning System] data · CPC title

  • B60W50/085Primary

    Changing the parameters of the control units, e.g. changing limit values, working points by control input · CPC title

  • Details of control systems ensuring comfort, safety or stability not otherwise provided for · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11613269B2 cover?
Traversing a vehicle transportation network includes operating a scenario-specific operational control evaluation module instance. The scenario-specific operational control evaluation module instance includes an instance of a scenario-specific operational control evaluation model of a distinct vehicle operational scenario. Operating the scenario-specific operational control evaluation module in…
Who is the assignee on this patent?
Nissan North America Inc, Univ Massachusetts, Renault Sas
What technology area does this patent fall under?
Primary CPC classification B60W50/085. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Mar 28 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).