Method and system for acoustic based industrial machine inspection using das-beamforming and dictionary learning
US-2024288340-A1 · Aug 29, 2024 · US
US12504402B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12504402-B2 |
| Application number | US-202318372300-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 25, 2023 |
| Priority date | Nov 3, 2022 |
| Publication date | Dec 23, 2025 |
| Grant date | Dec 23, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In industrial inspection scenarios, early detection of machine malfunction is extremely essential as it helps in preventing any significant damage and the associated economic losses. Embodiments herein provide a method and system for an acoustic based anomaly detection in industrial machines using a beamforming and a sequential transform learning. Herein, the system employs two-stage multi-channel source separation technique that uses the well-known delay and sum beamforming followed by a recent data-driven sequential transform learning (STL) approach to obtain clean sources. The STL is a solution to linear state-space model where operators/matrices are learnt from data and is used here to model the dynamics of time-varying source signals for source separation. Subsequently, a reference template matching is employed on each separated source to detect an anomaly. The numerical results obtained with the Malfunctioning Industrial Machine Investigation and Inspection (MIMII) dataset demonstrate superior performance for source separation and anomaly detection.
Opening claim text (preview).
What is claimed is: 1 . A processor-implemented method for an acoustic based anomaly detection in an industrial machine comprising: receiving, via a microphone array, a mixed audio signal of one or more spatially distributed audio sources, wherein the mixed audio signal includes noises; beamforming, via one or more hardware processors, the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimating, via the one or more hardware processors, at least one clean audio source of the one or more spatially distributed audio sources using a pre-trained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determining, via the one or more hardware processors, a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detecting, via the one or more hardware processors, one or more abnormalities in the industrial machine based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 2 . The processor-implemented method of claim 1 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 3 . The processor-implemented method of claim 1 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 4 . The processor-implemented method of claim 1 , wherein the delay depends upon the spatial location of the audio source. 5 . The processor-implemented method of claim 1 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine. 6 . A system for an acoustic based anomaly detection in an industrial machine comprising: an input/output interface; one or more hardware processors; a memory in communication with the one or more hardware processors, wherein the one or more hardware processors are configured to execute programmed instructions stored in the memory, to: receive a mixed audio signal of one or more spatially distributed audio sources via a microphone array, wherein the mixed audio signal includes noises; beamform the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimate at least one clean audio source of the one or more spatially distributed audio sources using a pretrained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determine a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detect one or more abnormalities in the industrial machines based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 7 . The system of claim 6 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 8 . The system of claim 6 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 9 . The system of claim 6 , wherein the delay depends upon the spatial location of the audio source. 10 . The system of claim 6 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine. 11 . One or more non-transitory machine-readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors cause an acoustic based anomaly detection in an industrial machine comprising: receiving, via a microphone array, a mixed audio signal of one or more spatially distributed audio sources, wherein the mixed audio signal includes noises; beamforming the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimating at least one clean audio source of the one or more spatially distributed audio sources using a pre-trained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determining a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detecting one or more abnormalities in the industrial machine based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 12 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 13 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 14 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the delay depends upon the spatial location of the audio source. 15 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine.
Structural degradation, e.g. fatigue of composites, ageing of oils · CPC title
with a model, e.g. best-fit, regression analysis · CPC title
using acoustic emission techniques {(echo of particles G01N29/046; measuring mechanical vibrations or acoustic waves in solids in general G01H1/00)} · CPC title
Defect imaging, localisation and sizing using, e.g. time of flight diffraction [TOFD], synthetic aperture focusing technique [SAFT], Amplituden-Laufzeit-Ortskurven [ALOK] technique · CPC title
by electronic orientation or focusing, e.g. with phased arrays (phased arrays per se G10K11/34) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.