Sequential anomaly detection

US9727821B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9727821-B2
Application numberUS-201313969151-A
CountryUS
Kind codeB2
Filing dateAug 16, 2013
Priority dateAug 16, 2013
Publication dateAug 8, 2017
Grant dateAug 8, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A dataset including at least one temporal event sequence is collected. A one-class sequence classifier f(x) that obtains a decision boundary is statistically learned. At least one new temporal event sequence is evaluated, wherein the at least one new temporal event sequence is outside of the dataset. It is determined whether the at least one new temporal event sequence is one of a normal sequence or an abnormal sequence based on the evaluation. Numerous additional aspects are disclosed.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: collecting a first computer process related dataset comprising a plurality of normal temporal event sequences; learning, using the first computer process related dataset, a one-class sequence classifier f(x) that obtains a decision boundary configured to label temporal event sequences; collecting a second computer process related dataset comprising at least one new temporal event sequence; and evaluating the at least one new temporal event sequence using the decision boundary to label the at least one new temporal event sequence as an abnormal sequence, causing a computer system to issue an alert, wherein learning the classifier learning comprises: randomly initializing a solution space (Ω); constructing an undirected graph for the normal temporal event sequences in the first computer process related dataset; capturing at least one temporal dynamic of the normal temporal event sequences of the first computer process related dataset; assigning labels to each of the normal temporal event sequences of the first computer process related dataset by computing, for each of the normal temporal event sequences a probability that the at least one normal temporal event sequence of the first computer process related dataset is a normal sequence or an abnormal sequence, wherein at least one of the at least one normal temporal event sequences is labeled as an abnormal sequence; and refining the classifier using the plurality of normal temporal event sequences of the first computer process related dataset with respective labels. 2. The method of claim 1 , wherein the normal temporal event sequences of the first computer process related dataset are unlabeled. 3. The method of claim 1 , wherein evaluating the new temporal event sequence uses the learned classifier to compute a probabilistic distance of the new temporal event sequence to the decision boundary. 4. The method of claim 1 , wherein the first computer process related dataset further comprises latent random variables, and wherein each latent random variable is associated with each event in the sequence. 5. The method of claim 1 , further comprising: repeating the steps for learning the classifier until a termination criterion is satisfied. 6. The method of claim 5 , wherein repeating the steps for learning the classifier until the termination criterion is satisfied further comprises: enforcing, using a user defined parameter, a plurality of the datasets to have a higher probability of having a normal sequence, while keeping a solution vector of the solution space (Ω) small as compared to a number of normal sequences. 7. The method of claim 1 , further comprising: reviewing an automatic classification of at least one of the normal temporal event sequences in the first computer process related dataset as abnormal; receiving a user inputted label of “true anomaly” for at least one of the reviewed automatically classified sequences; adding the “true anomaly” labeled sequences to the first computer process related dataset; and re-training the classifier using the first computer process related dataset including the “true anomaly” labeled sequences. 8. The method of claim 7 , wherein during the re-training, a relative weight of the “true anomaly” labeled sequences is higher than a weight given to other sequences in the first computer process related dataset by distribution-sensitive learning. 9. The method of claim 8 , wherein the relative weight of the “true anomaly” labeled sequences is inversely proportional to a distribution of the sequences in the first computer process related dataset. 10. The method of claim 1 , wherein learning the one-class classifier further comprises: defining at least one view configuration organizing at least one feature of the normal temporal event sequences of the first computer process related dataset into multiple views; refining the classifier with at least one of the defined configuration of views using multi-view learning; determining an automatic classification of the at least one new temporal event sequence with the refined classifier, wherein the automatic classification is a negative label; receiving user input regarding the automatic classification of the at least one new temporal event sequence, the user input updating a negative label of the at least new temporal event sequence to a positive label; and generating at least one suggestion to adjust the at least one view configuration based on classification results of the refined classifier. 11. The method of claim 1 , further comprising providing a system, wherein the system comprises distinct software modules, each of the distinct software modules being embodied on a computer-readable storage medium, and wherein the distinct software modules comprise a data collection module, an optimization engine module, an evaluation engine module, and an analysis module; wherein: said dataset collection is carried out by said data collection module executing on at least one hardware processor; said classifier learning is carried out by said optimization engine module executing on said at least one hardware processor; said evaluating is carried out by said evaluation engine module executing on said at least one hardware processor; and said determining is carried out by said analysis module executing on said at least one hardware processor. 12. A computer program product comprising a non-transitory computer readable storage medium having computer readable program code embodied therewith, said computer readable program code comprising: computer readable program code configured to: collect a dataset comprising a plurality of normal temporal event sequences; learn, using the dataset, a one-class sequence classifier f(x) that obtains a decision boundary labeling each temporal event sequence; evaluate, using the decision boundary, at least one new temporal event sequence, wherein the at least one new temporal event sequence is outside of the dataset; and determine that the at least one new temporal event sequence is an abnormal sequence based on the evaluating step, causing an alert to be issued, wherein the computer readable program code configured to learn the classifier learning comprises computer readable program code configured to: randomly initialize a solution space (Ω); construct an undirected graph for the normal temporal event sequences in the dataset; capture at least one temporal dynamic of the normal temporal event sequences of the dataset; assign labels to each of the normal temporal event sequences of the dataset by computing, for each of the normal temporal event sequences a probability that the at least one normal temporal event sequence of the dataset is a normal sequence or an abnormal sequence, wherein at least one of the at least one normal temporal event sequences is labeled as an abnormal sequence; and refine the classifier using the plurality of normal temporal event sequences of the dataset with respective labels. 13. An apparatus comprising: a memory; and at least one processor, coupled to said memory, and operative to: collect a dataset comprising a plurality of normal temporal event sequences; learn, using the plurality of normal temporal event sequences, a one-class sequence classifier f(x) that obtains a decision boundary labeling each temporal event sequence; evaluate, using the decision boundary, at least one new temporal event sequence, wherein the at least one new temporal event sequence is outside of the dataset; and determine that the at least one new temporal event sequence is an abnormal sequence based on

Assignees

Inventors

Classifications

  • G06N7/01Primary

    Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • G06N7/005Primary

    Physics · mapped topic

  • Physics · mapped topic

  • Knowledge engineering; Knowledge acquisition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9727821B2 cover?
A dataset including at least one temporal event sequence is collected. A one-class sequence classifier f(x) that obtains a decision boundary is statistically learned. At least one new temporal event sequence is evaluated, wherein the at least one new temporal event sequence is outside of the dataset. It is determined whether the at least one new temporal event sequence is one of a normal sequen…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N7/01. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 08 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).