Dynamic graph anomaly detection framework and scalable system architecture

US9898604B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9898604-B2
Application numberUS-201514924550-A
CountryUS
Kind codeB2
Filing dateOct 27, 2015
Priority dateJun 21, 2013
Publication dateFeb 20, 2018
Grant dateFeb 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Machine generated event log data which includes events occurring over a window of time is received where each event includes a first node, a second node, and a timestamp. The events are aggregated into a plurality of aggregated graph snapshots. Communities within the plurality of aggregated graph snapshots are identified and community tracking links are determined between communities in the plurality of aggregated graph snapshots. A community that has an anomalous evolution in the plurality of aggregated graph snapshots compared to the evolution of other communities is identified based at least in part on the community tracking links. The communities are displayed where the display includes the community tracking links and identifies the community that has the anomalous evolution.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving, at a first segment and a second segment in a massively parallel processing (MPP) database system, machine generated event log data which includes one or more events occurring during a first window of time and one or more one or more events occurring during a first window of time, wherein each event includes a first node, a second node, and a timestamp; aggregating the events into a plurality of aggregated graph snapshots, including by: sending, from the first segment to the second segment, any machine generated event log data associated with events occurring during the second window of time; sending, from the second segment to the first segment, any machine generated event log data associated with events occurring during the first window of time; using a first hardware processor on the first segment to generate a first aggregated graph snapshot associated with the first window of time; and using a second hardware processor on the second segment to generate a second aggregated graph snapshot associated with the second window of time; identifying one or more communities within the plurality of aggregated graph snapshots, including by: identifying, on the first segment, one or more communities within the first aggregated graph snapshot; storing, on the first segment, the communities identified in the first aggregated graph snapshot; identifying, on the second segment, one or more communities within the second aggregated graph snapshot; and storing, on the second segment, the communities identified in the second aggregated graph snapshot; determining one or more community tracking links between communities in the plurality of aggregated graph snapshots, including by: sending, from the first segment to the second segment, the communities identified in the first aggregated graph snapshot; comparing, on the second segment, the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot in order to generate one or more community tracking links between the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot; and storing, on the second segment, the community tracking links between the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot; using a processor to identify, based at least in part on the community tracking links, a community that has an anomalous evolution in the plurality of aggregated graph snapshots compared to the evolution of other communities; and displaying the communities, wherein the display includes the community tracking links and identifies the community that has the anomalous evolution. 2. The method of claim 1 , wherein displaying further includes: displaying a legend which indicates one or more selected communities in a current aggregated graph snapshot; and displaying a backward navigation arrow and a forward navigation arrow associated with displaying one or more linked communities in a prior aggregated graph snapshot or a next aggregated graph snapshot, respectively, that are linked to the selected communities in the current aggregated graph snapshot. 3. The method of claim 1 , wherein displaying further includes: displaying a legend which indicates one or more selected communities in a current aggregated graph snapshot; and using a plurality of colors to foreshadow one or more vertices which are associated with one or more linked communities in a next aggregated graph snapshot that are linked to the selected communities in the current aggregated graph snapshot, wherein a different color is used for each linked community. 4. The method of claim 1 , wherein displaying further includes displaying, for one or more selected communities, one or more of the following characteristics: a Jaccard coefficient, a modified Jaccard coefficient, a size of a front community (F), a size of a current community, or a ratio of a size of a front community (F) to a size of a current community (C). 5. The method of claim 1 , wherein displaying further includes displaying a two-feature scatter plot, wherein a first axis of the two-feature scatter plot is associated with a modified Jaccard coefficient and a second axis of the two-feature scatter plot is associated with a ratio of a size of a front community (F) to a size of a current community (C). 6. A system, comprising: a processor; and a memory coupled with the processor, wherein the memory is configured to provide the processor with instructions which when executed cause the processor to: receive, at a first segment and a second segment in a massively parallel processing (MPP) database system, machine generated event log data which includes one or more events occurring during a first window of time and one or more one or more events occurring during a first window of time, wherein each event includes a first node, a second node, and a timestamp; aggregate the events into a plurality of aggregated graph snapshots, including by: sending, from the first segment to the second segment, any machine generated event log data associated with events occurring during the second window of time; sending, from the second segment to the first segment, any machine generated event log data associated with events occurring during the first window of time; generating, on the first segment, a first aggregated graph snapshot associated with the first window of time; and generating, on the second segment, a second aggregated graph snapshot associated with the second window of time; identify one or more communities within the plurality of aggregated graph snapshots, including by: identifying, on the first segment, one or more communities within the first aggregated graph snapshot; storing, on the first segment, the communities identified in the first aggregated graph snapshot; identifying, on the second segment, one or more communities within the second aggregated graph snapshot; and storing, on the second segment, the communities identified in the second aggregated graph snapshot; determine one or more community tracking links between communities in the plurality of aggregated graph snapshots, including by: sending, from the first segment to the second segment, the communities identified in the first aggregated graph snapshot; comparing, on the second segment, the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot in order to generate one or more community tracking links between the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot; and storing, on the second segment, the community tracking links between the communities identified in the first aggregated graph snapshot and the communities identified in the second aggregated graph snapshot; identify, based at least in part on the community tracking links, a community that has an anomalous evolution in the plurality of aggregated graph snapshots compared to the evolution of other communities; and display the communities, wherein the display includes the community tracking links and identifies the community that has the anomalous evolution. 7. The system of claim 6 , wherein displaying further includes: displaying a legend which indicates one or more selected communities in a current aggregated graph snapshot; and displaying a backward navigation arrow and a forward navigation arrow associated with displaying one or more linked communities in a prior aggregated graph snapshot or a next aggregated gr

Assignees

Inventors

Classifications

  • Traffic logging, e.g. anomaly detection · CPC title

  • Test or assess a computer or a system · CPC title

  • G06F21/56Primary

    Computer malware detection or handling, e.g. anti-virus arrangements · CPC title

  • involving long-term monitoring or reporting · CPC title

  • for detecting or protecting against malicious traffic · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9898604B2 cover?
Machine generated event log data which includes events occurring over a window of time is received where each event includes a first node, a second node, and a timestamp. The events are aggregated into a plurality of aggregated graph snapshots. Communities within the plurality of aggregated graph snapshots are identified and community tracking links are determined between communities in the plu…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification H04L63/1425. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).