Systems and methods for sensor data processing and object detection and motion prediction for robotic platforms

US12259694B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12259694-B2
Application numberUS-202418656210-A
CountryUS
Kind codeB2
Filing dateMay 6, 2024
Priority dateOct 14, 2020
Publication dateMar 25, 2025
Grant dateMar 25, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are disclosed for detecting and predicting the motion of objects within the surrounding environment of a system such as an autonomous vehicle. For example, an autonomous vehicle can obtain sensor data from a plurality of sensors comprising at least two different sensor modalities (e.g., RADAR, LIDAR, camera) and fused together to create a fused sensor sample. The fused sensor sample can then be provided as input to a machine learning model (e.g., a machine learning model for object detection and/or motion prediction). The machine learning model can have been trained by independently applying sensor dropout to the at least two different sensor modalities. Outputs received from the machine learning model in response to receipt of the fused sensor samples are characterized by improved generalization performance over multiple sensor modalities, thus yielding improved performance in detecting objects and predicting their future locations, as well as improved navigation performance.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: (a) obtaining sensor data from a plurality of sensors comprising at least two different sensor modalities; (b) applying independent sensor dropout to the at least two different sensor modalities; (c) fusing the sensor data from the at least two different sensor modalities with sensor dropout independently applied thereto to generate a fused sensor sample; (d) generating a training data set comprising the fused sensor sample; and (e) training a machine-learned model for object detection using the training data set, wherein the trained machine-learned model is employed by a robotic platform operating within an environment. 2. The computer-implemented method of claim 1 , wherein the robotic platform comprises an autonomous vehicle. 3. The computer-implemented method of claim 1 , wherein the environment comprises a real-world environment or a simulated environment. 4. The computer-implemented method of claim 1 , wherein the trained machine-learned model comprises an end-to-end model that is configured to jointly perform object detection and motion prediction. 5. The computer-implemented method of claim 1 , wherein (b) comprises independently applying sensor dropout to each of the at least two different sensor modalities at a fixed probability associated with the sensor modality. 6. The computer-implemented method of claim 1 , wherein the plurality of sensors comprise a RADAR system, a LIDAR system, and a camera. 7. The computer-implemented method of claim 6 , wherein: the at least two different sensor modalities comprise at least one of the RADAR system or the camera; and (b) comprises zeroing out a final feature vector for a portion of the sensor data obtained from the at least one of the RADAR system or the camera. 8. The computer-implemented method of claim 6 , wherein: the at least two different sensor modalities comprise the LIDAR system; and (b) comprises replacing a LIDAR intensity value with a sentinel value for a portion of the sensor data obtained from the LIDAR system. 9. The computer-implemented method of claim 1 , wherein (e) comprises: inputting the fused sensor sample into the machine-learned model; generating a loss metric for the machine-learned model based on output of at least a portion of the machine-learned model in response to the fused sensor sample as input; and modifying at least a portion of the machine-learned model based on the loss metric. 10. The computer-implemented method of claim 9 , wherein the loss metric comprises at least one of a regression loss, a classification loss, an adversarial loss, a multi-task loss, or a perceptual loss. 11. The computer-implemented method of claim 9 , wherein the loss metric is associated with a plurality of loss terms, the plurality of loss terms comprising at least a first loss term associated with a determination or generation of bounding shapes. 12. The computer-implemented method of claim 11 , the plurality of loss terms further comprising at least a second loss term associated with a classification of features. 13. A training computing system, comprising: one or more processors; and one or more non-transitory computer-readable medium storing instructions that when executed by the one or more processors cause the training computing system to perform operations, the operations comprising: (a) obtaining sensor data from a plurality of sensors comprising at least two different sensor modalities; (b) applying independent sensor dropout to the at least two sensor modalities; (c) fusing the sensor data from the at least two different sensor modalities with sensor dropout independently applied thereto to generate a fused sensor sample; (d) generating a training data set comprising the fused sensor sample; and (e) training a machine-learned model for object detection using the training data set, wherein the trained machine-learned model is employed by a robotic platform operating within an environment. 14. The training computing system of claim 13 , wherein the robotic platform comprises an autonomous vehicle. 15. The training computing system of claim 13 , wherein the environment comprises a real-world environment or a simulated environment. 16. The training computing system of claim 13 , wherein the trained machine-learned model comprises an end-to-end model that is configured to jointly perform object detection and motion prediction. 17. The training computing system of claim 13 , wherein (b) comprises independently applying sensor dropout to each of the at least two different sensor modalities at a fixed probability associated with the sensor modality. 18. The training computing system of claim 13 , wherein (e) comprises: inputting the fused sensor sample into the machine-learned model; generating a loss metric for the machine-learned model based on output of at least a portion of the machine-learned model in response to the fused sensor sample as input; and modifying at least a portion of the machine-learned model based on the loss metric. 19. The training computing system of claim 18 , wherein the loss metric comprises one or more of a regression loss and a classification loss. 20. One or more non-transitory computer-readable medium storing instructions that when executed by one or more processors cause the one or more processors to perform operations, the operations comprising: (a) obtaining sensor data from a plurality of sensors comprising at least two different sensor modalities; (b) applying independent sensor dropout to the at least two sensor modalities; (c) fusing the sensor data from the at least two different sensor modalities with sensor dropout independently applied thereto to generate a fused sensor sample; (d) generating a training data set comprising the fused sensor sample; and (e) training a machine-learned model for object detection using the training data set, wherein the trained machine-learned model is employed by a robotic platform operating within an environment.

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Radar; Laser, e.g. lidar · CPC title

  • Image sensing, e.g. optical camera · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12259694B2 cover?
Systems and methods are disclosed for detecting and predicting the motion of objects within the surrounding environment of a system such as an autonomous vehicle. For example, an autonomous vehicle can obtain sensor data from a plurality of sensors comprising at least two different sensor modalities (e.g., RADAR, LIDAR, camera) and fused together to create a fused sensor sample. The fused senso…
Who is the assignee on this patent?
Aurora Operations Inc
What technology area does this patent fall under?
Primary CPC classification G05B13/0265. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 25 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).