Sound event detection

US2016335488A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016335488-A1
Application numberUS-201514713619-A
CountryUS
Kind codeA1
Filing dateMay 15, 2015
Priority dateMay 15, 2015
Publication dateNov 17, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for the use of sensors and processors of existing, distributed systems, operating individually or in cooperation with other systems, networks or cloud-based services to enhance the detection and classification of sound events in an environment (e.g., a home), while having low computational complexity. The system and method provides functions where the most relevant features that help in discriminating sounds are extracted from an audio signal and then classified depending on whether the extracted features correspond to a sound event that should result in a communication to a user. Threshold values and other variables can be determined by training on audio signals of known sounds in defined environments, and implemented to distinguish human and pet sounds from other sounds, and compensate for variations in the magnitude of the audio signal, different sizes and reverberation characteristics of the environment, and variations in microphone responses.

First claim

Opening claim text (preview).

1 . An environmental data monitoring and reporting system, comprising: a device sensor that detects sound in an area and generates an audio signal based on the detected sound; a device processor communicatively coupled to the device sensor, wherein the processor is configured to convert the audio signal received from the device sensor into low-resolution audio signal data and analyze the audio signal data, at the device processor level, to identify the detected sound as an area human or pet occupancy-related sound and provide a communication regarding the detected occupancy-related sound; and a device communication interface communicatively coupled to the device processor, wherein the communication interface is configured to send the communication regarding the detected occupancy-related sound, wherein the device sensor, device processor and device communication interface are integrated into a single premises management device. 2 . The system of claim 1 , wherein the processor is configured to: perform a frequency domain conversion of the audio signal data and extract low-resolution feature vectors that distinguish detected sounds; determine state transition conditions by comparing the low-resolution feature vectors to threshold values that distinguish sound categories and generate outputs indicating occurrences of distinguished sound categories; and detect the occurrence of a sound category indicating an area human or pet occupancy and generating a user message in response. 3 . The system of claim 2 , further comprising a Fast Fourier Transform element, controlled by the processor, to perform the frequency domain conversion of the audio signal data, on a frame-by-frame basis. 4 . The system of claim 2 , further comprising: a plurality of bandwidth filters, controlled by the processor, to divide the bands of the frequency domain conversion; a plurality of median filters, controlled by the processor, to filter a sample length of the divided bands; a plurality of range filters, controlled by the processor, to filter a range of the sample lengths; and a plurality of summers, controlled by the processor, to subtract a minimum sample range value from a maximum sample range value to calculate the plurality of low-resolution feature vectors that distinguish detected sounds, on a frame-by-frame basis. 5 . The system of claim 2 , further comprising: a state classifier element, controlled by the processor, to determine the transition conditions by comparing the plurality of low-resolution feature vectors to threshold values and generate the outputs indicating the occurrences of distinguished sound categories, on a frame-by-frame basis. 6 . The system of claim 5 , wherein the processor is configured to train on audio signal data of known sound categories in defined areas to determine threshold values that distinguish sound categories and that compensate for audio signal data, area and sensor variations. 7 . The system of claim 2 , further comprising: a detector element, controlled by the processor, to detect the occurrence of the sound category indicating an area human or pet occupancy; and the communication interface, controlled by the processor, to communicate a user message in response to the detected occurrence of the sound category indicating an area human or pet occupancy. 8 . The system of claim 7 , wherein the detector element is configured to analyze each output indicating an occurrence of a sound category as received to detect an output denoting an occurrence of the sound category indicating an area human or pet occupancy. 9 . The system of claim 7 , wherein the detector element is configured to analyze a set of outputs indicating occurrences of sound categories to detect the first output of the set denoting an occurrence of the sound category indicating an area human or pet occupancy. 10 . The system of claim 7 , wherein the detector element is configured to statistically analyze a set of outputs indicating occurrences of sound categories to detect a likelihood of an occurrence of the sound category indicating an area human or pet occupancy. 11 . An environmental data monitoring and reporting system, comprising: a device sensor that detects a condition in an area and generates a signal based on the detected condition; a device processor communicatively coupled to the device sensor, wherein the processor is configured to convert the signal received from the sensor into low-resolution signal data and analyze the signal data, at the processor level, by: performing a frequency domain conversion of the signal data and extracting low-resolution feature vectors that distinguish detected conditions, comparing the low-resolution feature vectors to threshold values that distinguish condition categories, generating outputs indicating occurrences of distinguished condition categories, and detecting the occurrence of a condition category indicating an area human or pet occupancy and generating a user message in response; and a device communication interface communicatively coupled to the device processor, wherein the communication interface is configured to send the user message regarding the detected occupancy-related condition, wherein the device sensor, device processor and device communication interface are integrated into a single premises management device. 12 . The system of claim 11 , further comprising: a Fast Fourier Transform element, controlled by the processor, to perform the frequency domain conversion of the signal data; a plurality of bandwidth filters, controlled by the processor, to divide the bands of the frequency domain conversion; a plurality of median filters, controlled by the processor, to filter a sample length of the divided bands; a plurality of range filters, controlled by the processor, to filter a range of the sample lengths; and a plurality of summers, controlled by the processor, to subtract a minimum sample range value from a maximum sample range value to calculate the plurality of low-resolution feature vectors that distinguish detected conditions. 13 . The system of claim 11 , further comprising: a state classifier element, controlled by the processor, to compare the plurality of low-resolution feature vectors to threshold values and generate the outputs indicating the occurrences of distinguished condition categories. 14 . The system of claim 13 , wherein the processor is configured to train on audio signal data of known condition categories in defined areas to determine threshold values that distinguish condition categories and that compensate for signal data, area and sensor variations. 15 . The system of claim 11 , further comprising: a detector element, controlled by the processor, to detect the occurrence of the condition category indicating an area human or pet occupancy; and the communication interface, controlled by the processor, to communicate a user message in response to the detected occurrence of the condition category indicating an area human or pet occupancy. 16 . A method for controlling an environmental data monitoring and reporting system, comprising: detecting sound in an area and generating an audio signal based on the detected sound; converting the audio signal into low-resolution audio signal data and analyzing the audio signal data, at a device processor level, to identify the detected sound as an area human or pet occupancy-related sound and provide a communication regarding the detected occupancy-related sound; and sending the communication regarding the detected occupancy-related sound,

Assignees

Inventors

Classifications

  • using sonic detecting means, e.g. a microphone operating in the audio frequency range · CPC title

  • G10L25/51Primary

    for comparison or discrimination · CPC title

  • the extracted parameters being spectral information of each sub-band · CPC title

  • Circuit arrangements · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016335488A1 cover?
A system and method for the use of sensors and processors of existing, distributed systems, operating individually or in cooperation with other systems, networks or cloud-based services to enhance the detection and classification of sound events in an environment (e.g., a home), while having low computational complexity. The system and method provides functions where the most relevant features …
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G10L25/51. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 17 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).