Generating problem signatures from snapshots of time series data

US9632859B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9632859-B2
Application numberUS-201514736514-A
CountryUS
Kind codeB2
Filing dateJun 11, 2015
Priority dateJun 11, 2015
Publication dateApr 25, 2017
Grant dateApr 25, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Software that generates statistical models of events impacting computer systems and uses those models to detect similar events in the future. The software performs the following operations: (i) receiving a snapshot of a first event impacting a computer system, where the snapshot includes a first set of values for a plurality of metrics occurring over a first time period corresponding to the first event; (ii) extracting a first set of feature vectors from the first set of values; (iii) generating a first statistical model representing the first event based, at least in part, on the extracted first set of feature vectors; and (iv) determining that a second event is similar to the first event by comparing the first statistical model to a second set of values for the plurality of metrics occurring over a second time period corresponding to the second event.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by one or more processors, a snapshot of a first event impacting a computer system, where the snapshot includes a first set of values for a plurality of metrics occurring over a first time period corresponding to the first event; extracting, by one or more processors, a first set of feature vectors from the first set of values for the plurality of metrics occurring over the first time period; generating, by one or more processors, a first statistical model representing the first event based, at least in part, on the extracted first set of feature vectors; and determining, by one or more processors, that a second event is similar to the first event by comparing the first statistical model to a second set of values for the plurality of metrics occurring over a second time period corresponding to the second event. 2. The method of claim 1 , wherein the plurality of metrics and the first time period for the snapshot are selected by a human user. 3. The method of claim 1 , further comprising: generating, by one or more processors, a second statistical model representing the second event based, at least in part, on the second set of values; wherein determining that the second event is similar to the first event by comparing the first statistical model to the second set of values includes comparing the first statistical model to the second statistical model. 4. The method of claim 3 , wherein comparing the first statistical model to the second statistical model includes calculating a Kullback-Leibler distance. 5. The method of claim 1 , wherein determining that the second event is similar to the first event by comparing the first statistical model to the second set of values comprises: generating, by one or more processors, a similarity score for the second set of values, wherein the similarity score represents an amount of likelihood that the second set of values could result from the first statistical model; determining, by one or more processors, that the similarity score for the second set of values is above a predetermined threshold; and responsive to determining that the similarity score for the second set of values is above the predetermined threshold, determining, by one or more processors, that the second event is similar to the first event. 6. The method of claim 1 , wherein the comparing of the first statistical model to the second set of values utilizes a distance metric. 7. The method claim 1 , further comprising: informing a user, by one or more processors, that an event similar to the first event has occurred. 8. A computer program product comprising a computer readable storage medium, wherein the computer readable storage medium is not a transitory signal per se, the computer readable storage medium having stored thereon: first program instructions programmed to receive a snapshot of a first event impacting a computer system, where the snapshot includes a first set of values for a plurality of metrics occurring over a first time period corresponding to the first event; second program instructions programmed to extract a first set of feature vectors from the first set of values for the plurality of metrics occurring over the first time period; third program instructions programmed to generate a first statistical model representing the first event based, at least in part, on the extracted first set of feature vectors; and fourth program instructions programmed to determine that a second event is similar to the first event by comparing the first statistical model to a second set of values for the plurality of metrics occurring over a second time period corresponding to the second event. 9. The computer program product of claim 8 , wherein the plurality of metrics and the first time period for the snapshot are selected by a human user. 10. The computer program product of claim 8 , further comprising: fifth program instructions programmed to generate a second statistical model representing the second event based, at least in part, on the second set of values; wherein determining that the second event is similar to the first event by comparing the first statistical model to the second set of values includes comparing the first statistical model to the second statistical model. 11. The computer program product of claim 10 , wherein comparing the first statistical model to the second statistical model includes calculating a Kullback-Leibler distance. 12. The computer program product of claim 8 , wherein determining that the second event is similar to the first event by comparing the first statistical model to the second set of values comprises: generating, by one or more processors, a similarity score for the second set of values, wherein the similarity score represents an amount of likelihood that the second set of values could result from the first statistical model; determining, by one or more processors, that the similarity score for the second set of values is above a predetermined threshold; and responsive to determining that the similarity score for the second set of values is above the predetermined threshold, determining, by one or more processors, that the second event is similar to the first event. 13. The computer program product of claim 8 , wherein the comparing of the first statistical model to the second set of values utilizes a distance metric. 14. The computer program product of claim 8 , further comprising: fifth program instructions programmed to inform a user that an event similar to the first event has occurred. 15. A computer system comprising: a processor(s) set; and a computer readable storage medium; wherein: the processor set is structured, located, connected and/or programmed to run program instructions stored on the computer readable storage medium; and the program instructions include: first program instructions programmed to receive a snapshot of a first event impacting a computer system, where the snapshot includes a first set of values for a plurality of metrics occurring over a first time period corresponding to the first event; second program instructions programmed to extract a first set of feature vectors from the first set of values for the plurality of metrics occurring over the first time period; third program instructions programmed to generate a first statistical model representing the first event based, at least in part, on the extracted first set of feature vectors; and fourth program instructions programmed to determine that a second event is similar to the first event by comparing the first statistical model to a second set of values for the plurality of metrics occurring over a second time period corresponding to the second event. 16. The computer system of claim 15 , wherein the plurality of metrics and the first time period for the snapshot are selected by a human user. 17. The computer system of claim 15 , wherein the program instructions further include: fifth program instructions programmed to generate a second statistical model representing the second event based, at least in part, on the second set of values; wherein determining that the second event is similar to the first event by comparing the first statistical model to the second set of values includes comparing the first statistical model to the second statistical model. 18. The computer system of claim 17 , wherein comparing the first statistical model to the second statistical model includes calculating a Kullback-Leibler distance.

Assignees

Inventors

Classifications

  • for systems · CPC title

  • Performance evaluation by statistical analysis · CPC title

  • where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems (multiprogramming arrangements G06F9/46; allocation of resources G06F9/50) · CPC title

  • Performance evaluation by modeling · CPC title

  • by exceeding a count or rate limit, e.g. word- or bit count limit · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9632859B2 cover?
Software that generates statistical models of events impacting computer systems and uses those models to detect similar events in the future. The software performs the following operations: (i) receiving a snapshot of a first event impacting a computer system, where the snapshot includes a first set of values for a plurality of metrics occurring over a first time period corresponding to the fir…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/079. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 25 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).