Method and system for an acoustic based anomaly detection in industrial machines

US12504402B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12504402-B2
Application numberUS-202318372300-A
CountryUS
Kind codeB2
Filing dateSep 25, 2023
Priority dateNov 3, 2022
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In industrial inspection scenarios, early detection of machine malfunction is extremely essential as it helps in preventing any significant damage and the associated economic losses. Embodiments herein provide a method and system for an acoustic based anomaly detection in industrial machines using a beamforming and a sequential transform learning. Herein, the system employs two-stage multi-channel source separation technique that uses the well-known delay and sum beamforming followed by a recent data-driven sequential transform learning (STL) approach to obtain clean sources. The STL is a solution to linear state-space model where operators/matrices are learnt from data and is used here to model the dynamics of time-varying source signals for source separation. Subsequently, a reference template matching is employed on each separated source to detect an anomaly. The numerical results obtained with the Malfunctioning Industrial Machine Investigation and Inspection (MIMII) dataset demonstrate superior performance for source separation and anomaly detection.

First claim

Opening claim text (preview).

What is claimed is: 1 . A processor-implemented method for an acoustic based anomaly detection in an industrial machine comprising: receiving, via a microphone array, a mixed audio signal of one or more spatially distributed audio sources, wherein the mixed audio signal includes noises; beamforming, via one or more hardware processors, the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimating, via the one or more hardware processors, at least one clean audio source of the one or more spatially distributed audio sources using a pre-trained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determining, via the one or more hardware processors, a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detecting, via the one or more hardware processors, one or more abnormalities in the industrial machine based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 2 . The processor-implemented method of claim 1 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 3 . The processor-implemented method of claim 1 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 4 . The processor-implemented method of claim 1 , wherein the delay depends upon the spatial location of the audio source. 5 . The processor-implemented method of claim 1 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine. 6 . A system for an acoustic based anomaly detection in an industrial machine comprising: an input/output interface; one or more hardware processors; a memory in communication with the one or more hardware processors, wherein the one or more hardware processors are configured to execute programmed instructions stored in the memory, to: receive a mixed audio signal of one or more spatially distributed audio sources via a microphone array, wherein the mixed audio signal includes noises; beamform the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimate at least one clean audio source of the one or more spatially distributed audio sources using a pretrained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determine a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detect one or more abnormalities in the industrial machines based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 7 . The system of claim 6 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 8 . The system of claim 6 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 9 . The system of claim 6 , wherein the delay depends upon the spatial location of the audio source. 10 . The system of claim 6 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine. 11 . One or more non-transitory machine-readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors cause an acoustic based anomaly detection in an industrial machine comprising: receiving, via a microphone array, a mixed audio signal of one or more spatially distributed audio sources, wherein the mixed audio signal includes noises; beamforming the received mixed audio signal using a delay-and-sum beamforming technique to obtain a constructive superposition of the audio signal along a predefined direction; estimating at least one clean audio source of the one or more spatially distributed audio sources using a pre-trained sequential transform learning (STL), wherein the STL is used to model dynamics of the time-varying source signals for source separation; determining a change in the estimated at least one clean audio source of the one or more spatially distributed audio sources obtained using STL and beamforming by comparing with a template of a normal audio source of the same industrial machine; and detecting one or more abnormalities in the industrial machine based on the change determined in the associated clean audio source that is below a predefined threshold in terms of signal to noise ratio (SNR). 12 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the sequential transform learning (STL) model is trained on a mixture of normal audio of each of the one or more spatially distributed audio sources. 13 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the mixed audio signal received by each microphone of the microphone array is associated with different delays. 14 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the delay depends upon the spatial location of the audio source. 15 . The one or more non-transitory machine-readable information storage mediums of claim 11 , wherein the predefined threshold in terms of signal to noise ratio (SNR) is empirically calculated for each machine.

Assignees

Inventors

Classifications

  • Structural degradation, e.g. fatigue of composites, ageing of oils · CPC title

  • with a model, e.g. best-fit, regression analysis · CPC title

  • using acoustic emission techniques {(echo of particles G01N29/046; measuring mechanical vibrations or acoustic waves in solids in general G01H1/00)} · CPC title

  • Defect imaging, localisation and sizing using, e.g. time of flight diffraction [TOFD], synthetic aperture focusing technique [SAFT], Amplituden-Laufzeit-Ortskurven [ALOK] technique · CPC title

  • by electronic orientation or focusing, e.g. with phased arrays (phased arrays per se G10K11/34) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12504402B2 cover?
In industrial inspection scenarios, early detection of machine malfunction is extremely essential as it helps in preventing any significant damage and the associated economic losses. Embodiments herein provide a method and system for an acoustic based anomaly detection in industrial machines using a beamforming and a sequential transform learning. Herein, the system employs two-stage multi-chan…
Who is the assignee on this patent?
Tata Consultancy Services Ltd
What technology area does this patent fall under?
Primary CPC classification G01N29/043. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).