Identifying The Position Of A Horn Honk Or Other Acoustical Information Using Multiple Autonomous Vehicles
US-2022024484-A1 · Jan 27, 2022 · US
US11430466B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11430466-B2 |
| Application number | US-202117248196-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 13, 2021 |
| Priority date | Jan 13, 2021 |
| Publication date | Aug 30, 2022 |
| Grant date | Aug 30, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and methods for sound source detection and localization utilizing an autonomous driving vehicle (ADV) are disclosed. The method includes receiving audio data from a number of audio sensors mounted on the ADV. The audio data comprises sounds captured by the audio sensors and emitted by one or more sound sources. Based on the received audio data, the method further includes determining a number of sound source information. Each sound source information comprises a confidence score associated with an existence of a specific sound. The method further includes generating a data representation to report whether there exists the specific sound within the driving environment of the ADV. The data representation comprises the determined sound source information. The received audio data and the generated data representation are utilized to subsequently train a machine learning algorithm to recognize the specific sound source during autonomous driving of the ADV in real-time.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for sound source detection and localization utilizing an autonomous driving vehicle (ADV) while the ADV is operating within a driving environment, the method comprising: receiving audio data from a plurality of audio sensors mounted on the ADV, the audio data comprising sounds captured by the plurality of audio sensors and emitted by one or more sound sources; based on the received audio data, determining a plurality of sound source information, each sound source information comprising a confidence score associated with an existence of a specific sound; and generating a data representation to report whether there exists the specific sound within the driving environment of the ADV, the data representation comprising the determined plurality of sound source information; wherein the received audio data and the generated data representation are utilized to subsequently train a machine learning algorithm to recognize a specific sound source during autonomous driving of the ADV in real-time. 2. The method of claim 1 , wherein determining the plurality of sound source information comprises performing sound source localization with the plurality of audio sensors to determine at least one of: directions of the sound sources relative to their corresponding audio sensors, distances between the sound sources and their corresponding audio sensors, relative positions of the captured sounds, absolute positions of the captured sounds, approaching/departing statuses of the captured sounds, or intensities of the captured sounds associated with current timestamps. 3. The method of claim 2 , wherein each sound source information further comprises at least one of: a direction of a sound source relative to a corresponding audio sensor, a distance between the sound source and the corresponding audio sensor, a relative position of a captured sound, an absolute position of a captured sound, an approaching/departing status of a captured sound, or an intensity of a captured sound associated with a current timestamp. 4. The method of claim 3 , wherein the data representation is a grid including a plurality of regions that collectively cover the driving environment of the ADV, each region corresponding to an audio sensor from the plurality of audio sensors and reporting a vector of results indicating whether the specific sound exists in the region, the vector of results including a region identifier (ID) and one sound source information. 5. The method of claim 4 , wherein each region is configured to partially cover a particular size within the driving environment. 6. The method of claim 1 , wherein the sound sources are emergency vehicles, and the specific sound is a siren sound. 7. The method of claim 1 , wherein the confidence score is within a range of 0-1 value. 8. The method of claim 4 , wherein a center of the grid represents a position of the ADV. 9. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations, the operations comprising: receiving audio data from a plurality of audio sensors mounted on an autonomous driving vehicle (ADV), the audio data comprising sounds captured by the plurality of audio sensors and emitted by one or more sound sources; based on the received audio data, determining a plurality of sound source information, each sound source information comprising a confidence score associated with an existence of a specific sound; and generating a data representation to report whether there exists the specific sound within a driving environment of the ADV, the data representation comprising the determined plurality of sound source information; wherein the received audio data and the generated data representation are utilized to subsequently train a machine learning algorithm to recognize a specific sound source during autonomous driving of the ADV in real-time. 10. The non-transitory machine-readable medium of claim 9 , wherein determining the plurality of sound source information comprises performing sound source localization with the plurality of audio sensors to determine at least one of: directions of the sound sources relative to their corresponding audio sensors, distances between the sound sources and their corresponding audio sensors, relative positions of the captured sounds, absolute positions of the captured sounds, approaching/departing statuses of the captured sounds, or intensities of the captured sounds associated with current timestamps. 11. The non-transitory machine-readable medium of claim 10 , wherein each sound source information further comprises at least one of: a direction of a sound source relative to a corresponding audio sensor, a distance between the sound source and the corresponding audio sensor, a relative position of a captured sound, an absolute position of a captured sound, an approaching/departing status of a captured sound, or an intensity of a captured sound associated with a current timestamp. 12. The non-transitory machine-readable medium of claim 11 , wherein the data representation is a grid including a plurality of regions that collectively cover the driving environment of the ADV, each region corresponding to an audio sensor from the plurality of audio sensors and reporting a vector of results indicating whether the specific sound exists in the region, the vector of results including a region identifier (ID) and one sound source information. 13. The non-transitory machine-readable medium of claim 12 , wherein each region is configured to partially cover a particular size within the driving environment. 14. The non-transitory machine-readable medium of claim 9 , wherein the sound sources are emergency vehicles, and the specific sound is a siren sound. 15. The non-transitory machine-readable medium of claim 9 , wherein the confidence score is within a range of 0-1 value. 16. The non-transitory machine-readable medium of claim 12 , wherein a center of the grid represents a position of the ADV. 17. A system for sound source detection and localization, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations, the operations including receiving audio data from a plurality of audio sensors mounted on an autonomous driving vehicle (ADV), the audio data comprising sounds captured by the plurality of audio sensors and emitted by one or more sound sources; based on the received audio data, determining a plurality of sound source information, each sound source information comprising a confidence score associated with an existence of a specific sound; and generating a data representation to report whether there exists the specific sound within a driving environment of the ADV, the data representation comprising the determined plurality of sound source information; wherein the received audio data and the generated data representation are utilized to subsequently train a machine learning algorithm to recognize a specific sound source during autonomous driving of the ADV in real-time. 18. The system of claim 17 , wherein determining the plurality of sound source information comprises performing sound source localization with the plurality of audio sensors to determine at least one of: directions of the sound sources relative to their corresponding audio sensors, distances between the sound sources and their corresponding audio sensors, relative positions of the captured sound
Position of source determined by co-ordinating a plurality of position lines defined by path-difference measurements (G01S5/28 takes precedence) · CPC title
Position of source determined by a plurality of spaced direction-finders · CPC title
by electric means · CPC title
Machine learning · CPC title
using ultrasonic, sonic or infrasonic waves · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.