Generating environmental input encoding for training neural networks

US11537134B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11537134-B1
Application numberUS-201815988929-A
CountryUS
Kind codeB1
Filing dateMay 24, 2018
Priority dateMay 25, 2017
Publication dateDec 27, 2022
Grant dateDec 27, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An encoding of an environment for operating vehicles is obtained, comprising a combination of at least a representation of moving entities with a graph representation of static infrastructure elements. Using the encoding and a set of one or more observations of the environment state, a machine learning model is trained to produce a probabilistic representation of a set of predicted states of the environment. A trained version of the machine learning model is stored and deployed at one or more vehicles to help plan and control vehicle movements.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: performing, at one or more computing devices: obtaining a first encoding of a vehicle operation environment, wherein the first encoding comprises a combination of: (a) a representation of a plurality of moving entities, (b) an infrastructure graph, distinct from the representation of the plurality of moving entities, wherein the infrastructure graph comprises a plurality of nodes representing respective static infrastructure components and one or more edges linking the nodes, wherein a particular edge of the one or more edges indicates a relationship between a pair of nodes of the plurality of nodes, and (c) an occupancy grid map; training, using at least the first encoding, a first machine learning model to produce a first probabilistic representation of a set of predicted states of the environment, wherein training the first machine learning model comprises: (a) a first phase of training using the infrastructure graph, wherein at least a portion of the first machine learning model is trained during the first phase to learn representations of one or more infrastructure components; and (b) a second phase of training, using at least the portion of the first machine learning model resulting from the first phase, to train the first machine learning model to predict actions of the plurality of moving entities using the representation of the plurality of moving entities; and storing a trained version of the first machine learning model, wherein output produced by the trained version is usable by one or more motion decision making components of a vehicle to generate motion control directives of the vehicle. 2. The method as recited in claim 1 , wherein the first machine learning model comprises a first neural network model that comprises at least a policy model predicting the respective set of actions taken by a plurality of moving entities, and the method further comprising generating a motion-control directive for the vehicle based at least in part on an output of a state transition model predicting at least one or more physical state changes of one or more entities. 3. The method as recited in claim 2 , wherein the state transition model includes a transition model of one or more of: the vehicle, one or more traffic rules, one or more traffic signals, a pedestrian head orientation, or pedestrian pose, and wherein a particular physical state change of the one or more physical state changes includes one or more of: a position, an orientation, a velocity, an acceleration, a current lane, or a traffic light state. 4. The method as recited in claim 1 , wherein the first encoding comprises the occupancy grid map overlaid with the representation of the plurality of moving entities and the infrastructure graph. 5. The method as recited in claim 4 , wherein the occupancy grid map comprises respective representations of one or more of: (a) an obstacle, (b) a member of a set of pre-defined objects, (c) a structured element of the environment, or (d) an unstructured element of the environment. 6. The method as recited in claim 1 , wherein the plurality of nodes includes a first node representing a first static infrastructure component comprising one or more of: (a) a lane segment of a road, (b) an intersection, (c) a traffic sign, (d) a traffic signal, or (e) a pedestrian walkway. 7. The method as recited in claim 6 , wherein the particular edge of the one or more edges indicates one or more of: (a) a geometric constraint associated with a pair of static infrastructure components including the first static infrastructure component, (b) a topological constraint associated with a pair of static infrastructure components, or (c) one or more attributes associated with a pair of static infrastructure components. 8. The method as recited in claim 1 , wherein the representation of the plurality of moving entities comprises one or more of: (a) a padded vector with an upper-bounded capacity, (b) a mapping of respective representations of the moving entities from a source space to a target space with a different dimensionality than the source space, (c) an output of an aggregation layer of a neural network model, or (d) an output obtained from a neural network model comprising one or more recurrent connections. 9. The method as recited in claim 1 , wherein the first probabilistic representation comprises a joint density function. 10. The method as recited in claim 1 , wherein the first probabilistic representation comprises one or more of: (a) a parametric probability density function or (b) a non-parametric probability density function. 11. A system, comprising: a machine learning model trainer comprising one or more computing devices; wherein the machine learning model trainer is configured to: obtain a first encoding of a vehicle operation environment, wherein the first encoding comprises a combination of: (a) a representation of a plurality of moving entities, (b) an infrastructure graph, distinct from the representation of the plurality of moving entities, wherein the infrastructure graph comprises a plurality of nodes representing respective infrastructure components and one or more edges linking the nodes, wherein a particular edge of the one or more edges indicates a relationship between a pair of nodes of the plurality of nodes, and (c) an occupancy grid map; train, using at least the first encoding, a first machine learning model to produce a first probabilistic representation of a set of predicted states of the environment, wherein training the first machine learning model comprises: (a) a first phase of training using the infrastructure graph, wherein at least a portion of the first machine learning model is trained during the first phase to learn representations of one or more infrastructure components; and (b) a second phase of training, using at least the portion of the first machine learning model resulting from the first phase, to train the first machine learning model to predict actions of the plurality of moving entities using the representation of the plurality of moving entities; and store a trained version of the first machine learning model, wherein output produced by the trained version is usable by one or more motion decision making components of a vehicle to generate motion control directives of the vehicle. 12. The system as recited in claim 11 , wherein the model trainer is configured to train the first machine learning model using one or more observations of the state of the vehicle operation environment. 13. The system as recited in claim 12 , wherein the one or more observations comprise a temporal sequence of observations. 14. The system as recited in claim 11 , wherein the model trainer is configured to train the first machine learning model using one or more latent variables obtained from an estimator, wherein a particular latent variable of the one or more latent variables represents a goal of a moving entity of the plurality of moving entities. 15. The system as recited in claim 11 , wherein at least one computing device of the one or more computing devices is located at a data center, wherein the model trainer is configured to: cause the trained version of the first machine learning model to be transmitted to the vehicle. 16. A non-transitory computer-accessible storage medium storing program instructions that when executed on one or more processors cause the one or more processors to: obtain a first encoding of a vehicle operation environment, wherein the first encoding comprises a combination of: a representation of a plur

Assignees

Inventors

Classifications

  • characterized by the autonomous decision making process, e.g. artificial intelligence, predefined behaviours (using knowledge based models G06N5/00) · CPC title

  • G05D1/0221Primary

    involving a learning process · CPC title

  • Non-supervised learning, e.g. competitive learning · CPC title

  • G06N3/0455Primary

    Auto-encoder networks; Encoder-decoder networks · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11537134B1 cover?
An encoding of an environment for operating vehicles is obtained, comprising a combination of at least a representation of moving entities with a graph representation of static infrastructure elements. Using the encoding and a set of one or more observations of the environment state, a machine learning model is trained to produce a probabilistic representation of a set of predicted states of th…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G05D1/0221. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).