Contextual anomaly detection across assets

US11271957B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11271957-B2
Application numberUS-201916526359-A
CountryUS
Kind codeB2
Filing dateJul 30, 2019
Priority dateJul 30, 2019
Publication dateMar 8, 2022
Grant dateMar 8, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and computer program products for contextual anomaly detection across assets are provided herein. A method includes obtaining time-series data frames corresponding to assets; clustering the assets into one or more cohorts based on the time-series data frames, each cohort comprising assets having statistically similar time-series data frames; for each given asset within each cohort: applying a time-context window to the portion of the time-series data frames corresponding to the given asset to generate at least one transformed data frame, and determining an asset distribution for the given asset based on the at least one transformed data frame; determining one or more of that at least one of the assets within at least one of the cohorts is anomalous and that at least one of the cohorts is anomalous; and causing at least one remediation action to be performed based on the determining.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: obtaining time-series data frames corresponding to a plurality of assets; clustering the assets into one or more cohorts based on the time-series data frames, wherein each one of the cohorts comprises assets having statistically similar time-series data frames; for each given one of the assets within each one of the cohorts: (i) applying a time-context window to a portion of the time-series data frames corresponding to the given asset to generate at least one transformed data frame, and (ii) determining an asset distribution for the given asset based at least in part on said at least one transformed data frame; computing an isolation metric for each given one of the assets within at least one of the cohorts based on a corresponding asset distributions; determining that the at least one of the assets is anomalous in response to the computed isolation metric satisfying a predetermined threshold; and causing at least one remediation action to be performed based at least in part on said determining; wherein the method is carried out by at least one computing device. 2. The computer-implemented method of claim 1 , wherein said clustering comprises applying an unsupervised clustering algorithm to the time-series data frames. 3. The computer-implemented method of claim 1 , wherein said determining the asset distribution for the given asset comprises applying a sparse mixture of sparse Gaussians (SMSG) to the at least one transformed data frame for the given asset. 4. The computer-implemented method of claim 1 , wherein said determining the asset distribution for the given asset comprises applying a Gaussian distribution to the transformed data frame for the given asset. 5. The computer-implemented method of claim 1 , comprising: computing an isolation metric for each one of the cohorts based at least in part on the corresponding asset distributions; and determining that at least one of the cohorts is anomalous in response to the computed isolation metric satisfying a predetermined threshold. 6. The computer-implemented method of claim 5 , wherein said determining that at least one of the cohorts is anomalous comprises: computing a Gaussian dictionary for each one of the cohorts, wherein each of the Gaussian dictionaries comprises the asset distributions of the assets corresponding to the cohort; and averaging all pairs of asset distributions between the assets distributions in the Gaussian dictionary corresponding to a first one of the cohorts and the asset distributions in the Gaussian dictionary corresponding to a second one of the cohorts. 7. The computer-implemented method of claim 1 , wherein the time-series data frames comprise (i) timestamps and (ii) sensor values for the plurality of assets. 8. The computer-implemented method of claim 1 , wherein the at least one remediation action comprises one or more of: outputting a list of the anomalous assets to a user; outputting a list of the anomalous cohorts to a user; disabling the at least on asset; resetting the at least one asset; and adjusting future data frames obtained from one or more of the anomalous assets and the anomalous cohorts. 9. The computer-implemented method of claim 1 , wherein the plurality of assets comprise at least a part of one or more of: (i) a heating system, (ii) a ventilation system, (iii) a cooling system and (iv) a turbine system. 10. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computing device to cause the computing device to: obtain time-series data frames corresponding to a plurality of assets; cluster the assets into one or more cohorts based on the time-series data frames, wherein each one of the cohorts comprises assets having statistically similar time-series data frames; for each given one of the assets within each one of the cohorts: (i) apply a time-context window to a portion of the time-series data frames corresponding to the given asset to generate at least one transformed data frame, and (ii) determine an asset distribution for the given asset based at least in part on said at least one transformed data frame; compute an isolation metric for each given one of the assets within at least one of the cohorts based on a corresponding asset distributions; determine that the at least one of the assets is anomalous in response to the computed isolation metric satisfying a predetermined threshold; and cause at least one remediation action to be performed based at least in part on said determining. 11. The computer program product of claim 10 , wherein said clustering comprises applying an unsupervised clustering algorithm to the time-series data frames. 12. The computer program product of claim 10 , wherein said determining the asset distribution for the given asset comprises applying a sparse mixture of sparse Gaussians (SMSG) to the at least one transformed data frame for the given asset. 13. The computer program product of claim 10 , wherein said determining the asset distribution for the given asset comprises applying a Gaussian distribution to the transformed data frame for the given asset. 14. The computer program product of claim 10 , wherein the computing device is caused to: compute an isolation metric for each one of the cohorts based at least in part on the corresponding asset distributions; and determine that at least one of the cohorts is anomalous in response to the computed isolation metric satisfying a predetermined threshold. 15. The computer program product of claim 14 , wherein said determining that at least one of the cohorts is anomalous comprises: computing a Gaussian dictionary for each one of the cohorts, wherein each of the Gaussian dictionaries comprises the asset distributions of the assets corresponding to the cohort; and averaging all pairs of asset distributions between the assets distributions in the Gaussian dictionary corresponding to a first one of the cohorts and the asset distributions in the Gaussian dictionary corresponding to a second one of the cohorts. 16. The computer program product of claim 10 , wherein the time-series data frames comprise (i) timestamps and (ii) sensor values for the plurality of assets. 17. A system comprising: a memory; and at least one processor operably coupled to the memory and configured for: obtaining time-series data frames corresponding to a plurality of assets; clustering the assets into one or more cohorts based on the time-series data frames, wherein each one of the cohorts comprises assets having statistically similar time-series data frames; for each given one of the assets within each one of the cohorts: (i) applying a time-context window to a portion of the time-series data frames corresponding to the given asset to generate at least one transformed data frame, and (ii) determining an asset distribution for the given asset based at least in part on said at least one transformed data frame; computing an isolation metric for each given one of the assets within at least one of the cohorts based on a corresponding asset distributions; determining that the at least one of the assets is anomalous in response to the computed isolation metric satisfying a predetermined threshold; and causing at least one remediation action to be performed based at least in part on said determining. 18. A computer-implemented method comprising: obtaining time-series data frames corresponding to a p

Assignees

Inventors

Classifications

  • Traffic logging, e.g. anomaly detection · CPC title

  • Machine learning · CPC title

  • Countermeasures against malicious traffic (countermeasures against attacks on cryptographic mechanisms H04L9/002) · CPC title

  • for evaluating statistical data {, e.g. average values, frequency distributions, probability functions, regression analysis (forecasting specially adapted for a specific administrative, business or logistic context G06Q10/04)} · CPC title

  • Services for machine-to-machine communication [M2M] or machine type communication [MTC] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11271957B2 cover?
Methods, systems, and computer program products for contextual anomaly detection across assets are provided herein. A method includes obtaining time-series data frames corresponding to assets; clustering the assets into one or more cohorts based on the time-series data frames, each cohort comprising assets having statistically similar time-series data frames; for each given asset within each co…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification H04L63/1425. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 08 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).