Detecting an abnormal subsequence in a data sequence

US9547543B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9547543-B2
Application numberUS-201514741819-A
CountryUS
Kind codeB2
Filing dateJun 17, 2015
Priority dateJan 27, 2014
Publication dateJan 17, 2017
Grant dateJan 17, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for detecting abnormal subsequences in data sequence includes constructing a hierarchical data structure of a target subsequence, each node in a bottommost layer of the data structure storing corresponding data of the target subsequence, and each node in a layer above the bottommost layer storing values based on data stored in corresponding nodes in a lower layer next to the layer above the bottommost layer; determining a second number of neighbors of the target subsequence based on the data structure of the target subsequence and of the first number of reference subsequences constructed in advance, the second number of neighbors having minimum Euclidean distances from the target subsequence; determining a third number of neighbors of each reference subsequence in the second number of reference subsequences, which have minimum Euclidean distances from each reference subsequence and determining whether the target subsequence is an abnormal subsequence.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for detecting an abnormal subsequence in a data sequence, the data sequence including a target subsequence to be detected and a first number of reference subsequences, the method comprising: constructing, with a processing device, a hierarchical data structure of the target subsequence, each node in a bottommost layer of the hierarchical data structure storing corresponding data of the target subsequence, and each node in a layer above the bottommost layer storing values derived based on data stored in corresponding nodes in a lower layer next to the layer above the bottommost layer; determining a second number of neighbors of the target subsequence based on the hierarchical data structure of the target subsequence and hierarchical data structures of the first number of reference subsequences constructed in advance, the second number of neighbors of the target subsequence being a second number of reference subsequences, which have minimum Euclidean distances from the target subsequence, in the first number of reference subsequences; determining a third number of neighbors of each reference subsequence in the second number of reference subsequences, the third number of neighbors being a third number of subsequences, which have minimum Euclidean distances from the each reference subsequence in the second number of reference subsequences, in the data sequence; and determining whether the target subsequence is an abnormal subsequence, according to the second number of neighbors of the target subsequence, and the third number of neighbors of a reference subsequence, which has the target subsequence as a neighbor thereof, in the second number of reference subsequences. 2. The method of claim 1 , wherein the hierarchical data structure is a binary tree. 3. The method of claim 1 , wherein the values stored by the each node in the layer above the bottommost layer and derived based on the data stored in the corresponding nodes in the lower layer next to the layer above the bottommost layer comprise an average value, a maximum value, and a minimum value of the data stored in the corresponding nodes, and wherein the determining a second number of neighbors of the target subsequence based on the hierarchical data structure of the target subsequence and hierarchical data structures of the first number of reference subsequences constructed in advance includes: determining upper limit values and lower limit values of Euclidean distances between the target subsequence and respective reference subsequences, corresponding to one or more layers of the hierarchical data structure of the target subsequence and the first number of reference subsequences, based on the data or values stored in nodes in the one or more layers of the hierarchical data structure, respectively, starting from an uppermost layer of the hierarchical data structure, and determining the second number of the neighbors of the target subsequence on the basis of the upper limit values and the lower limit values, until the second number of neighbors of the target subsequence are determined. 4. The method of claim 3 , wherein the determining the neighbors of the target subsequence on the basis of the upper limit values and the lower limit values includes: determining one reference subsequence in the first number of reference subsequences as the neighbor of the target subsequence, in response to determining that an upper limit value of the Euclidean distance between the target subsequence and the one reference subsequence is smaller than lower limit values of the Euclidean distances between the target subsequence and the other reference subsequences in the first number of reference subsequences. 5. The method of claim 1 , wherein the third number is equal to the second number, and wherein the determining whether the target subsequence is an abnormal subsequence, according to the second number of neighbors of the target subsequence, and the third number of neighbors of a reference subsequence, which has the target subsequence as a neighbor thereof, in the second number of reference subsequences includes: determining whether the target subsequence is the abnormal subsequence, according to an approximation degree between the target subsequence and the second number of neighbors, and an approximation degree between the reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences and the third number of neighbors thereof. 6. The method of claim 5 , wherein, the determining whether the target subsequence is the abnormal subsequence, according to an approximation degree between the target subsequence and the second number of neighbors, and an approximation degree between the reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences and the third number of neighbors thereof includes: calculating a reciprocal of an average value of Euclidean distances between the target subsequence and the second number of neighbors thereof, so as to determine the approximation degree between the target subsequence and the second number of neighbors of the target subsequence; calculating a reciprocal of an average value of Euclidean distances between each reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences and the third number of neighbors; calculating an average value of reciprocals of average values corresponding to the respective reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences, so as to determine the approximation degree between the reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences and the third number of neighbors thereof; and determining whether the target subsequence is the abnormal subsequence, based on the reciprocal of the average value of the Euclidean distances between the target subsequence and the second number of neighbors thereof, and the average value of the reciprocals of the average values corresponding to the respective reference subsequence in the second number of reference subsequences, which have the target subsequence as the neighbor thereof, in the second number of reference subsequences. 7. The method of claim 6 , wherein the determining whether the target subsequence is the abnormal subsequence, based on the reciprocal of the average value of the Euclidean distances between the target subsequence and the second number of neighbors thereof, and the average value of the reciprocals of the average values corresponding to the respective reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences includes: calculating a ratio between the average value of the reciprocals of the average values corresponding to the respective reference subsequence, which has the target subsequence as the neighbor thereof, in the second number of reference subsequences and the reciprocal of the average value of the Euclidean distances between the target subsequence and the second number of neighbors thereof; and determining that the target subsequence is the abnormal subsequence, in response to determining that the ratio is greater than a predetermined threshold value.

Assignees

Inventors

Classifications

  • based on qualitative trend analysis, e.g. system evolution · CPC title

  • Error or fault detection not based on redundancy (power supply failures G06F1/30; network fault management H04L41/06) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9547543B2 cover?
A method for detecting abnormal subsequences in data sequence includes constructing a hierarchical data structure of a target subsequence, each node in a bottommost layer of the data structure storing corresponding data of the target subsequence, and each node in a layer above the bottommost layer storing values based on data stored in corresponding nodes in a lower layer next to the layer abov…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G05B23/0232. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 17 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).