Automatically generating machine-learning training data

US12288402B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12288402-B2
Application numberUS-202217850241-A
CountryUS
Kind codeB2
Filing dateJun 27, 2022
Priority dateJun 27, 2022
Publication dateApr 29, 2025
Grant dateApr 29, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer includes a processor and a memory, and the memory stores instructions executable by the processor to receive first environmental data recorded by an environmental sensor on board a vehicle, receive nonenvironmental data recorded on board the vehicle independently of the first environmental data, add a plurality of annotations derived from the nonenvironmental data to the environmental data, and train a machine-learning program to process second environmental data by using the first environmental data as training data and the annotations as ground truth for the first environmental data.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer comprising a processor and a memory, the memory storing instructions executable by the processor to: receive second environmental data of an area outside a second vehicle; execute a machine-learning program with the second environmental data as input; and actuate a component of the second vehicle based on the execution of the machine-learning program; wherein the machine-learning program is trained by: receiving first environmental data of an area outside a first vehicle recorded by an environmental sensor on board the first vehicle; receiving nonenvironmental data recorded on board the first vehicle about an inside of the first vehicle, the nonenvironmental data including at least one of audio data or motion data, the motion data indicating motion of the first vehicle, the nonenvironmental data being time-synchronized with the first environmental data; receiving speed data that is time-synchronized with the first environmental data; determining a plurality of annotation times for a plurality of respective annotations based on the speed data, the annotations being derived from the nonenvironmental data at a plurality of respective record times, wherein determining the annotation times includes applying offsets to the record times, resulting in offsetted times, and applying the annotations at the offsetted times instead of the record times, the offsets dependent on the speed data; adding the annotations derived from the at least one of the audio data or the motion data to the first environmental data, adding the annotations being based on the speed data, the speed data indicating a speed at which the first vehicle is traveling, wherein adding the annotations includes adding the annotations at the respective annotation times; and training the machine-learning program to process the second environmental data by using the first environmental data as training data and the annotations as ground truth for the first environmental data. 2. The computer of claim 1 , wherein the component includes at least one of a propulsion system, a brake system, or a steering system. 3. The computer of claim 1 , wherein the environmental sensor generates the first environmental data at a constant data rate, and determining the annotation times is based on the data rate. 4. The computer of claim 1 , wherein the nonenvironmental data includes the audio data, and the annotations are derived from the audio data. 5. The computer of claim 4 , wherein the machine-learning program is further trained by converting the audio data to text, and the annotations are derived from the text. 6. The computer of claim 4 , wherein the audio data is recorded from a passenger cabin of the first vehicle. 7. The computer of claim 1 , wherein the nonenvironmental data includes data transmitted over a vehicle network of the first vehicle. 8. The computer of claim 7 , wherein the nonenvironmental data is generated by components of the first vehicle. 9. The computer of claim 7 , wherein the nonenvironmental data is automatically generated on board the first vehicle. 10. The computer of claim 7 , wherein the vehicle network includes a controller area network (CAN) bus. 11. The computer of claim 1 , wherein the first environmental data is based on electromagnetic radiation. 12. The computer of claim 1 , wherein the first environmental data is first video data, the environmental sensor is a camera, and the second environmental data is second video data. 13. The computer of claim 12 , wherein the machine-learning program is trained to perform image recognition. 14. The computer of claim 1 , wherein, when used to train the machine-learning program, the first environmental data lacks manually added annotations. 15. The computer of claim 1 , wherein the nonenvironmental data includes the motion data indicating the motion of the first vehicle, and the annotations are derived from the motion data. 16. A method comprising: receiving first environmental data of an area outside a first vehicle recorded by an environmental sensor on board the first vehicle; receiving nonenvironmental data recorded on board the first vehicle about an inside of the first vehicle, the nonenvironmental data including at least one of audio data or motion data, the motion data indicating motion of the first vehicle, the nonenvironmental data being time-synchronized with the first environmental data; receiving speed data that is time-synchronized with the first environmental data; determining a plurality of annotation times for a plurality of respective annotations based on the speed data, the annotations being derived from the nonenvironmental data at a plurality of respective record times, wherein determining the annotation times includes applying offsets to the record times, resulting in offsetted times, and applying the annotations at the offsetted times instead of the record times, the offsets dependent on the speed data; adding the annotations derived from the at least one of the audio data or the motion data to the first environmental data, adding the annotations being based on the speed data, the speed data indicating a speed at which the first vehicle is traveling, wherein adding the annotations includes adding the annotations at the respective annotation times; training a machine-learning program to process second environmental data by using the first environmental data as training data and the annotations as ground truth for the first environmental data; installing the machine-learning program on a vehicle computer of a second vehicle; and actuating a component of the second vehicle based on the machine-learning program.

Assignees

Inventors

Classifications

  • Radar; Laser, e.g. lidar · CPC title

  • Image sensing, e.g. optical camera · CPC title

  • related to drivers or passengers · CPC title

  • using neural networks · CPC title

  • inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12288402B2 cover?
A computer includes a processor and a memory, and the memory stores instructions executable by the processor to receive first environmental data recorded by an environmental sensor on board a vehicle, receive nonenvironmental data recorded on board the vehicle independently of the first environmental data, add a plurality of annotations derived from the nonenvironmental data to the environmenta…
Who is the assignee on this patent?
Ford Global Tech Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).