Audio logging for model training and onboard validation utilizing autonomous driving vehicle

US11735205B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11735205-B2
Application numberUS-202117248172-A
CountryUS
Kind codeB2
Filing dateJan 12, 2021
Priority dateJan 12, 2021
Publication dateAug 22, 2023
Grant dateAug 22, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for generating labelled audio data and onboard validation of the labelled audio data utilizing an autonomous driving vehicle (ADV) while the ADV is operating within a driving environment are disclosed. The method includes recording a sound emitted by an object within the driving environment of the ADV, and converting the recorded sound into audio samples. The method further includes labelling the audio samples, and refining the labelled audio samples to produce refined labelled audio data. The refined labelled audio data is utilized to subsequently train a machine learning algorithm to recognize a sound source during autonomous driving of the ADV. The method further includes generating a performance profile of the refined labelled audio data based at least on the audio samples, a position of the object, and a relative direction of the object. The position of the object and the relative direction of the object are determined by a perception system of the ADV.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of generating labelled audio data and onboard validation of the labelled audio data utilizing an autonomous driving vehicle (ADV) while the ADV is operating within a driving environment, the method comprising: recording a sound emitted by an object within the driving environment of the ADV, and converting the recorded sound into audio samples; labelling the audio samples, and refining the labelled audio samples to produce refined labelled audio data, wherein the refined labelled audio data is utilized to subsequently train a machine learning algorithm to recognize a sound source during autonomous driving of the ADV; and generating a performance profile of the refined labelled audio data based at least on the audio samples, a position of the object, and a relative direction of the object, wherein the position of the object and the relative direction of the object are determined by a perception system of the ADV, wherein using the audio samples, the position of the object, and the relative direction of the object to generate the performance profile comprises profiling the refined labelled audio data against the audio samples, the position of the object, and the relative direction of the object. 2. The method of claim 1 , wherein generating the performance profile of the refined labelled audio data further comprises determining the position of the object and the relative direction of the object based on sensors data provided by visual sensors of the ADV, wherein the visual sensors are coupled to the perception system. 3. The method of claim 1 , wherein labelling the audio samples comprises tagging the audio samples with an audio sample identifier (ID), one or more positions associated with the audio samples, and a direction associated with the audio samples. 4. The method of claim 1 , wherein the object is an emergency vehicle and the emitted sound is a siren sound. 5. The method of claim 1 , wherein the audio samples are manually labelled by a user of the ADV. 6. The method claim 1 , wherein the performance profile is stored locally in a persistent storage device in the ADV. 7. A computer-implemented method for onboard validation of labelled audio data utilizing an autonomous driving vehicle (ADV) while the ADV is operating within a driving environment, the method comprising: recording a sound emitted by an obstacle within the driving environment of the ADV to create audio samples; determining a position of the obstacle and a relative direction of the obstacle based on sensors data provided by visual sensors of the ADV; and using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate a performance profile of refined labelled audio data, wherein the refined labelled audio data is generated by labelling the audio samples and refining the labelled audio samples, and is utilized to subsequently train a machine learning algorithm to recognize a sound source during autonomous driving of the ADV, wherein using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate the performance profile comprises profiling the refined labelled audio data against the audio samples, the position of the obstacle, and the relative direction of the obstacle. 8. The method of claim 7 , wherein the labelled audio samples comprise the audio samples, an audio sample identifier (ID), one or more positions associated with the audio samples, and a direction associated with the audio samples. 9. The method of claim 7 , wherein the obstacle is an emergency vehicle and the emitted sound is a siren sound. 10. The method of claim 7 , wherein the audio samples are manually labelled by a user of the ADV. 11. The method of claim 7 , wherein the performance profile is stored locally in a persistent storage device in the ADV. 12. A system for onboard validation of labelled audio data, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations, the operations including recording a sound emitted by an obstacle within a driving environment of an autonomous driving vehicle (ADV) to create audio samples; determining a position of the obstacle and a relative direction of the obstacle based on sensors data provided by visual sensors of the ADV; and using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate a performance profile of refined labelled audio data, wherein the refined labelled audio data is generated by labelling the audio samples and refining the labelled audio samples, and is utilized to subsequently train a machine learning algorithm to recognize a sound source during autonomous driving of the ADV, wherein using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate the performance profile comprises profiling the refined labelled audio data against the audio samples, the position of the obstacle, and the relative direction of the obstacle. 13. The system of claim 12 , wherein the labelled audio samples comprise the audio samples, an audio sample identifier (ID), one or more positions associated with the audio samples, and a direction associated with the audio samples. 14. The system of claim 12 , wherein the obstacle is an emergency vehicle and the emitted sound is a siren sound. 15. The system of claim 12 , wherein the audio samples are manually labelled by a user of the ADV. 16. The system of claim 12 , wherein the performance profile is stored locally in a persistent storage device in the ADV. 17. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor of an autonomous driving vehicle (ADV), cause the ADV to perform operations, the operations comprising: recording a sound emitted by an obstacle within the ADV to create audio samples; determining a position of the obstacle and a relative direction of the obstacle based on sensors data provided by visual sensors of the ADV; and using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate a performance profile of refined labelled audio data, wherein the refined labelled audio data is generated by labelling the audio samples and refining the labelled audio samples, and is utilized to subsequently train a machine learning algorithm to recognize a sound source during autonomous driving of the ADV, wherein using the audio samples, the position of the obstacle, and the relative direction of the obstacle to generate the performance profile comprises profiling the refined labelled audio data against the audio samples, the position of the obstacle, and the relative direction of the obstacle. 18. The non-transitory machine-readable medium of claim 17 , wherein the labelled audio samples comprise the audio samples, an audio sample identifier (ID), one or more positions associated with the audio samples, and a direction associated with the audio samples. 19. The non-transitory machine-readable medium of claim 17 , wherein the obstacle is an emergency vehicle and the emitted sound is a siren sound. 20. The non-transitory machine-readable medium of claim 17 , wherein the audio samples are manually labelled by a user of the ADV. 21. The non-transitory machine-readable medium of claim 17 , wherein the performance profile is stored locally in a persistent storage de

Assignees

Inventors

Classifications

  • involving a learning process · CPC title

  • using a video camera in combination with image processing means · CPC title

  • using acoustic signals, e.g. ultra-sonic singals (sonar systems designed for anti-collision purposes G01S15/93) · CPC title

  • G10L25/51Primary

    for comparison or discrimination · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11735205B2 cover?
Systems and methods for generating labelled audio data and onboard validation of the labelled audio data utilizing an autonomous driving vehicle (ADV) while the ADV is operating within a driving environment are disclosed. The method includes recording a sound emitted by an object within the driving environment of the ADV, and converting the recorded sound into audio samples. The method further …
Who is the assignee on this patent?
Baidu Usa Llc
What technology area does this patent fall under?
Primary CPC classification G10L25/51. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).