Vehicle trajectory prediction model with semantic map and LSTM

US11127142B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11127142-B2
Application numberUS-201916732125-A
CountryUS
Kind codeB2
Filing dateDec 31, 2019
Priority dateDec 31, 2019
Publication dateSep 21, 2021
Grant dateSep 21, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for predicting the near-term trajectory of a moving obstacle sensed by an autonomous driving vehicle (ADV) is disclosed. The method applies neural networks such as a LSTM model to learn dynamic features of the moving obstacle's motion based on its past trajectory up to its current position and a CNN model to learn the semantic map features of the driving environment in a portion of an image map. From the learned dynamic features of the moving obstacle and the learned semantic map features of the environment, the method applies a neural network to iteratively predict the moving obstacle's positions for successive time points of a prediction interval. To predict the moving obstacle's position at the next time point from the currently predicted position, the methods may update the learned dynamic features of the moving obstacle based on its past trajectory up to the currently predicted position.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method to predict a trajectory of a target obstacle detected by an autonomous driving vehicles (ADV), the method comprising: in response to an image of a driving environment around the target obstacle, processing the image by a first neural network (NN) to learn semantic map features of the image; embedding features of the target obstacle into a low-dimensional vector; processing the low-dimensional vector and most recent dynamic features of the target obstacle by a second NN to learn updated dynamic features of the target obstacle; processing the updated dynamic features of the target obstacle and the semantic map features of the image by a third NN to generate a next predicted position of the target obstacle, the next predicted position including predicted features; and repeating embedding the predicted features of the next predicted position, processing by the second NN and processing by the third NN until all predicted positions of a prediction interval are generated. 2. The method of claim 1 , wherein the features of the target obstacle in the image comprise one or more of a position, a heading, a speed, or a turning angle of the target obstacle. 3. The method of claim 1 , wherein the semantic map features of the image comprise one or more of vehicles, traffic elements, or road features in the image. 4. The method of claim 1 , wherein the most recent dynamic features of the target obstacle processed by the second NN to learn updated dynamic features of the target obstacle used to generate a first predicted position of the prediction interval comprises dynamic features of the target obstacle learned from a previous planning cycle of a plurality of planning cycles. 5. The method of claim 4 , wherein all predicted positions of the prediction interval are generated to correspond to a plurality of periodic time points during the prediction interval for each of the plurality of planning cycles. 6. The method of claim 1 , wherein the most recent dynamic features of the target obstacle processed by the second NN to learn updated dynamic features of the target obstacle used to generate a second or subsequent predicted position of the prediction interval comprises dynamic features of the target obstacle learned from a previous predicted position of the prediction interval. 7. The method of claim 1 , wherein the first NN comprises a convolutional NN (CNN) model, the second NN comprises a long short-term memory (LSTM) model, and the third NN comprises a multi-layer perceptron (MLP). 8. The method of claim 1 , wherein receiving an image containing the target obstacle comprises: receiving an image map of one or more sensed obstacles detected by the ADV for a planning cycle; selecting the target obstacle from the one or more sensed obstacles; rotating the image map to position the target obstacle at a reference point in a reference heading of the rotated image map; and cropping the rotated image map with respect to the target obstacle at the reference point in the reference heading to generate the image. 9. The method of claim 8 , further comprising: selecting each of the one or more sensed obstacles as the target obstacle during the planning cycle. 10. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations to predict a trajectory of a target obstacle detected by an autonomous driving vehicles (ADV), the operations comprising: in response to an image of a driving environment around the target obstacle, processing the image by a first neural network (NN) to learn semantic map features of the image; embedding features of the target obstacle into a low-dimensional vector; processing the low-dimensional vector and most recent dynamic features of the target obstacle by a second NN to learn updated dynamic features of the target obstacle; processing the updated dynamic features of the target obstacle and the semantic map features of the image by a third NN to generate a next predicted position of the target obstacle, the next predicted position including predicted features; and repeating embedding the predicted features of the next predicted position, processing by the second NN and processing by the third NN until all predicted positions of a prediction interval are generated. 11. The non-transitory machine-readable medium of claim 10 , wherein the features of the target obstacle in the image comprise one or more of a position, a heading, a speed, or a turning angle of the target obstacle. 12. The non-transitory machine-readable medium of claim 10 , wherein the semantic map features of the image comprise one or more of vehicles, traffic elements, or road features in the image. 13. The non-transitory machine-readable medium of claim 10 , wherein the most recent dynamic features of the target obstacle processed by the second NN to learn updated dynamic features of the target obstacle used to generate a first predicted position of the prediction interval comprises dynamic features of the target obstacle learned from a previous planning cycle of a plurality of planning cycles. 14. The non-transitory machine-readable medium of claim 10 , wherein the most recent dynamic features of the target obstacle processed by the second NN to learn updated dynamic features of the target obstacle used to generate a second or subsequent predicted position of the prediction interval comprises dynamic features of the target obstacle learned from a previous predicted position of the prediction interval. 15. The non-transitory machine-readable medium of claim 10 , wherein the first NN comprises a convolutional NN (CNN) model, the second NN comprises a long short-term memory (LSTM) model, and the third NN comprises a multi-layer perceptron (MLP). 16. The non-transitory machine-readable medium of claim 10 , wherein the operations further comprise: receiving an image map of one or more sensed obstacles detected by the ADV for a planning cycle; selecting the target obstacle from the one or more sensed obstacles; rotating the image map to position the target obstacle at a reference point in a reference heading of the rotated image map; and cropping the rotated image map with respect to the target obstacle at the reference point in the reference heading to generate the image. 17. A data processing system, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations of predicting a trajectory of a target obstacle detected by an autonomous driving vehicles (ADV), the operations comprising: in response to an image of a driving environment around the target obstacle, processing the image by a first neural network (NN) to learn semantic map features of the image, embedding features of the target obstacle into a low-dimensional vector; processing the low-dimensional vector and most recent dynamic features of the target obstacle by a second NN to learn updated dynamic features of the target obstacle, processing the updated dynamic features of the target obstacle and the semantic map features of the image by a third NN to generate a next predicted position of the target obstacle, the next predicted position including predicted features, and repeating embedding the predicted features of the next predicted position, processing by the second NN and processing by the third NN until all predicted positions of a prediction interval are generated.

Assignees

Inventors

Classifications

  • relating to a temporal dimension, e.g. time-based feature extraction; Pattern tracking · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Validation; Performance evaluation; Active pattern learning techniques · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11127142B2 cover?
A system and method for predicting the near-term trajectory of a moving obstacle sensed by an autonomous driving vehicle (ADV) is disclosed. The method applies neural networks such as a LSTM model to learn dynamic features of the moving obstacle's motion based on its past trajectory up to its current position and a CNN model to learn the semantic map features of the driving environment in a por…
Who is the assignee on this patent?
Baidu Usa Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 21 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).