Emergency sound localization

US11516613B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11516613-B1
Application numberUS-202017077746-A
CountryUS
Kind codeB1
Filing dateOct 22, 2020
Priority dateOct 22, 2020
Publication dateNov 29, 2022
Grant dateNov 29, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for determining information associated with sounds detected in an environment based on audio data are discussed herein. Audio sensors of a vehicle may determine audio data associated with sounds from the environment. Sounds may be caused by objects in the environment such as emergency vehicles, construction zones, non-emergency vehicles, humans, audio speakers, nature, etc. A model may determine a classification of the audio data and/or a probability value representing a likelihood that sound in the audio data is associated with the classification. A direction of arrival may be determined based on receiving classification values from multiple audio sensors of the vehicle, and other actions can be performed or the vehicle can be controlled based on the direction of arrival.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: one or more processors; and one or more non-transitory computer-readable media storing instructions executable by the one or more processors, wherein the instructions, when executed, cause the system to perform operations comprising: receiving first audio data from a first audio sensor associated with a vehicle; receiving second audio data from a second audio sensor associated with the vehicle; inputting the first audio data and the second audio data into a model; receiving, from the model, a first classification of the first audio data, the first classification including a first probability that the first audio data is associated with a sound class; receiving, from the model, a second classification of the second audio data, the second classification including a second probability that the second audio data is associated with the sound class; and based at least in part on the first probability that the first audio data is associated with the sound class and the second probability that the second audio data is associated with the sound class, at least one of: determining a direction of arrival associated with a source of the first audio data or the second audio data; or determining a level of computational processing associated with another model for determining data for the source of the first audio data or the second audio data, wherein the first classification and second classification comprises an emergency event or a non-emergency event. 2. The system of claim 1 , wherein the first classification comprises a first classification value representing a likelihood that the first audio data is associated with the first classification and the second classification comprises a second classification value representing a likelihood that the second audio data is associated with the second classification, and wherein the operations further comprise determining, based at least in part on a difference between the first classification and the second classification, the direction of arrival associated with the first audio data or the second audio data. 3. The system of claim 2 , the operations further comprising: receiving, as pose data, position data and orientation data associated the first audio sensor; and determining, based at least in part on the first classification value, the second classification value, and the pose data, the direction of arrival associated with the source of the first audio data or the second audio data. 4. The system of claim 3 , the operations further comprising: determining, based at least in part on the first classification of the second classification, that the first audio data or the second audio data is associated with the emergency event; and determining an action to control the vehicle in an environment based at least in part on the direction of arrival associated with the emergency event. 5. The system of claim 1 , wherein the sound class comprises one or more of: an emergency sound class, a non-emergency sound class, an engine sound class, a music sound class, a thundering sound class, an animal sound class, an ambulance siren sound class, a police siren sound class, a fire truck siren sound class, a power tool sound class, a reverse direction sound class, a honking sound class or a speech sound class. 6. A method comprising: receiving first audio data; receiving second audio data; inputting the first audio data and the second audio data into a model comprising one or more classifiers; receiving, from the model, a first classification value indicative of a likelihood that the first audio data is associated with a classifier of the one or more classifiers; receiving, from the model, a second classification value indicative of a likelihood that the second audio data is associated with the classifier of the one or more classifiers; and based at least in part on the first classification value and the second classification value, at least one of: determining a direction of arrival value associated with a source of the first audio data or the second audio data; or determining a level of computational processing associated with another model for determining data for the source of the first audio data or the second audio data, wherein the first classification and second classification comprises an emergency event or a non-emergency event. 7. The method of claim 6 , wherein the classifier comprises one or more of: an emergency sound class, a non-emergency sound class, an engine sound class, a music sound class, a thundering sound class, an animal sound class, an ambulance siren sound class, a police siren sound class, a fire truck siren sound class, a power tool sound class, a reverse direction sound class, a honking sound class or a speech sound class. 8. The method of claim 6 , further comprising: based at least in part on determining the level of computational processing, sending the first classification value and the second classification value to the other model; and causing the other model to determine a threshold associated with a sensor based at least in part on the first classification value and the second classification value. 9. The method of claim 6 , wherein determining the level of computational processing comprises determining an amount of computational processing or memory capacity to allocate during subsequent processing of an object associated with the first audio data or the second audio data. 10. The method of claim 6 , further comprising: determining an action to control a vehicle in an environment based at least in part on the direction of arrival value associated with the source of the first audio data or the second audio data. 11. The method of claim 9 , wherein: the first audio data is associated with a first audio sensor, the second audio data is associated with a second audio sensor, and determining the direction of arrival value associated with the first audio data or the second audio data is further based at least in part on one or more of position data or orientation data associated with at least one of the first audio sensor or the second audio sensor. 12. The method of claim 9 , wherein the direction of arrival value is a first direction of arrival value, the method further comprising: determining, based at least in part on at least one of: a magnitude difference between the first audio data and the second audio data or a time difference between the first audio data and the second audio data, a second direction of arrival value. 13. The method of claim 6 , wherein: the first audio data is associated with a first signal captured by a first audio sensor and a second signal captured by a second audio sensor, and the second audio data is associated with a third signal captured by a third audio sensor and a fourth signal captured by a fourth audio sensor. 14. The method of claim 10 , further comprising: determining that the first audio data or the second audio data are associated with the emergency event; and wherein controlling the vehicle comprises at least one of stopping the vehicle or determining a trajectory for the vehicle to follow in the environment. 15. The method of claim 6 , further comprising: identifying an intensity or a frequency of sound associated with the first audio data and the second audio data; determining the first classification value based at least in part on the intensity or the frequency of sound associated with the first audio data; and determining the second classification value based at least in part on the intensity or the frequency of so

Assignees

Inventors

Classifications

  • Acoustic transducers and sound field adaptation in vehicles · CPC title

  • Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic (H04R2203/12 takes precedence) · CPC title

  • Non-uniform arrays of transducers or a plurality of uniform arrays with different transducer spacing · CPC title

  • H04R1/406Primary

    microphones · CPC title

  • Classification techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11516613B1 cover?
Techniques for determining information associated with sounds detected in an environment based on audio data are discussed herein. Audio sensors of a vehicle may determine audio data associated with sounds from the environment. Sounds may be caused by objects in the environment such as emergency vehicles, construction zones, non-emergency vehicles, humans, audio speakers, nature, etc. A model m…
Who is the assignee on this patent?
Zoox Inc
What technology area does this patent fall under?
Primary CPC classification H04R1/406. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 29 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).