Acoustic event detection

US12308045B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12308045-B2
Application numberUS-202318243804-A
CountryUS
Kind codeB2
Filing dateSep 8, 2023
Priority dateDec 10, 2021
Publication dateMay 20, 2025
Grant dateMay 20, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving first audio data representing occurrence of a first acoustic event; determining first encoded data representing the first acoustic event; receiving second audio data representing audio detected by a device; determining, by processing the second audio data and the first encoded data using a first acoustic event detection (AED) component configured to detect occurrence of one or more acoustic events from a first set of acoustic events, first event detection data representing a first likelihood that at least one acoustic event from the first set of acoustic events is represented in the second audio data; determining, by processing the first audio data using a second AED component configured to detect occurrence of one or more acoustic events from a second set of acoustic events, second event detection data representing a second likelihood that at least one acoustic event from the second set of acoustic events is represented in the second audio data; determining, based at least in part on the first event detection data and the second event detection data, that at least one of the first acoustic event from the first set of acoustic events or a second acoustic event from the second set of acoustic events is represented in the second audio data; and determining output data indicating that at least one of the first acoustic event or the second acoustic event occurred. 2. The computer-implemented method of claim 1 , further comprising: determining, using the second audio data, acoustic feature data, wherein processing the second audio data using the first AED component comprises processing the first acoustic feature data using the first AED component, and wherein processing the second audio data using the second AED component comprises processing the acoustic feature data using the second AED component. 3. The computer-implemented method of claim 1 , wherein the first AED component comprises a machine learning component. 4. The computer-implemented method of claim 1 , wherein the second AED component comprises a machine learning component. 5. The computer-implemented method of claim 1 , wherein the device is associated with a profile and wherein the method further comprises: storing the first encoded data in a manner associated with the profile. 6. The computer-implemented method of claim 1 , wherein processing by the second AED component comprises processing the first encoded data with respect to second encoded data corresponding to the second audio data. 7. A system comprising: at least one processor; and at least one memory comprising instructions that, when executed by the at least one processor, cause the system to: receive first audio data representing occurrence of a first acoustic event; determine first encoded data representing the first acoustic event; receive second audio data representing audio detected by a device; determine, by processing the second audio data and the first encoded data using a first acoustic event detection (AED) component configured to detect occurrence of one or more acoustic events from a first set of acoustic events, first event detection data representing a first likelihood that at least one acoustic event from the first set of acoustic events is represented in the second audio data; determine, by processing the second audio data using a second AED component configured to detect occurrence of one or more acoustic events from a second set of acoustic events, second event detection data representing a second likelihood that at least one acoustic event from the second set of acoustic events is represented in the second audio data; determine, based at least in part on the first event detection data and the second event detection data, that at least one of the first acoustic event from the first set of acoustic events or a second acoustic event from the second set of acoustic events is represented in the second audio data; and determine output data indicating that at least one of the first acoustic event or the second acoustic event occurred. 8. The system of claim 7 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine, using the second audio data, acoustic feature data, wherein the instructions that cause the system to process the second audio data using the first AED component comprise instructions that, when executed by the at least one processor, cause the system to process the acoustic feature data using the first AED component, and wherein the instructions that cause the system to process the second audio data using the second AED component comprise instructions that, when executed by the at least one processor, cause the system to process the first acoustic feature data using the second AED component. 9. The system of claim 7 , wherein the first AED component comprises a machine learning component. 10. The system of claim 7 , wherein the second AED component comprises a machine learning component. 11. The system of claim 7 , wherein the device is associated with a profile and wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: store the first encoded data in a manner associated with the profile. 12. The system of claim 7 , wherein the instructions that cause the system to process by the second AED component comprise instructions that, when executed by the at least one processor, cause the system to process, by the second AED component, the first encoded data with respect to second encoded data corresponding to the second audio data. 13. The system of claim 7 , wherein: processing by the first AED component is performed by the device; and processing by the second AED component is performed by the device. 14. The computer-implemented method of claim 1 , wherein the first set of acoustic events include a custom set of events associated with the device. 15. The computer-implemented method of claim 1 , wherein the second set of acoustic events include a predetermined set of events. 16. The computer-implemented method of claim 1 , wherein the first AED component is a comparison-based AED component. 17. The computer-implemented method of claim 1 , wherein the second AED component is a classifier-based AED component. 18. The computer-implemented method of claim 1 , wherein the first AED component is configured by a user. 19. The system of claim 7 , wherein the first set of acoustic events include a custom set of events associated with the device. 20. The system of claim 7 , wherein the second set of acoustic events include a predetermined set of events.

Assignees

Inventors

Classifications

  • Execution procedure of a spoken command · CPC title

  • Word spotting · CPC title

  • using neural networks · CPC title

  • the extracted parameters being power information · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12308045B2 cover?
A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data …
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L25/51. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).