What technology area does this patent fall under?

Primary CPC classification H04L41/142. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Jun 29 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).

Probability-distribution-based log-file analysis

US11048608B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11048608-B2
Application number	US-201514660461-A
Country	US
Kind code	B2
Filing date	Mar 17, 2015
Priority date	Mar 17, 2015
Publication date	Jun 29, 2021
Grant date	Jun 29, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The current document is directed to systems, and methods incorporated within the systems, that carry out probability-distribution-based analysis of log-file entries. A monitoring subsystem within a distributed computer system uses probability-distribution-based analysis of log-file entries to detect changes in the state of the distributed computer system. A log-file-analysis subsystem within a distributed computer system uses probability-distribution-based analysis of log-file entries to identify subsets of log-file entries that predict anomalies and impending problems in the distributed computer system. In many implementations, a numerical comparison of probability distributions of log-file-entry types is used to detect state changes in the distributed computer system.

First claim

Opening claim text (preview).

The invention claimed is: 1. A log-file analysis subsystem within a computer system having one or more processors, one or more memories, and computer instructions, stored in one or more of the one or more memories that, when executed by one or more of the one or more processors, control the log-file analysis system to monitor a state of the computer system by repeatedly: generating, for one or more log files, each having multiple entries that are each associated with an event type, a probability distribution of all or a subset of the event types in the one or more log files for a time interval to represent the state of a monitored computer system for the time interval; storing the generated probability distribution in association with an indication of the time interval; and after generating and storing each probability distribution following generation and storing of an initial set of probability distributions, computing a divergence metric from the two most recently generated and stored probability distributions; distributions, and when the divergence metric is greater than a threshold value, raising an alarm to indicate, or displaying an indication of, a significant system-state change. 2. The log-file analysis subsystem of claim 1 wherein monitories the state of the computer system by the log-file analysis system further includes: using the stored probability distributions collected over a first time interval spanning multiple shorter, secondary time intervals to generate a typical probability distribution for each of a set of time intervals selected from the multiple shorter, secondary time intervals; and at subsequent secondary time intervals, generating a probability distribution for the event types of log entries selected from the most recently completed secondary time interval, computing a Jensen-Shannon divergence metric for the probability distribution generated from the most recently completed secondary time interval and the typical probability distribution for the most recently completed secondary time interval, and when the Jensen-Shannon divergence metric is greater than a threshold value, raising an alarm to indicate, or displaying an indication of, a system-state change. 3. The log-file analysis subsystem of claim 1 wherein monitoring the state of the computer system by the log-file analysis system further includes: for each of a number of different subsets of the event types for which the log-file analysis subsystem has generated and stored probability distributions for different time intervals, computing a Jensen-Shannon divergence metric for the probability distributions for different pairs of time intervals, and computing a measure of the variance of the Jensen-Shannon divergence metrics computed for the probability distributions for different pairs of the time intervals; and selecting, as a basis for a monitoring fingerprint, a subset of the event types having the greatest computed variance. 4. A method that monitors a state of a distributed computer system that includes multiple, network interconnected discrete computer systems, each having one or more processors, one or more memories, and one or more data-storage devices, one or more of the discrete computer systems including computer instructions, stored in one or more of the one or more memories of the discrete computer system, that, when executed by one or more of the one or more processors, control the discrete computer system to carry out the method comprising: repeatedly generating, for one or more log files, each having multiple entries that are each associated with an event type, a probability distribution of all or a subset of the event types in the one or more log files for a time interval to represent the state of a monitored computer system for the time interval, storing the generated probability distribution in association with an indication of the time interval in one or more of one or more memories and/or data-storage devices, and after generating and storing each probability distribution following generation and storing of an initial set of probability distributions, computing a divergence metric from the two most recently generated and stored probability distributions, and when the divergence metric is greater than a threshold value, raising an alarm to indicate, or displaying an indication of, a system-state change. 5. The method of claim 4 wherein the divergence metric is the Jensen-Shannon divergence metric. 6. The method of claim 4 further including: using the stored probability distributions collected over a first time interval spanning multiple shorter, secondary time intervals to generate a typical probability distribution for each of a set of time intervals selected from the multiple shorter, secondary time intervals; and at subsequent secondary time intervals, generating a probability distribution for the event types of log entries selected from the most recently completed secondary time interval, computing a divergence metric for the probability distribution generated from the most recently completed secondary time interval and the typical probability distribution for the most recently completed secondary time interval, and when the divergence metric is greater than a threshold value, raising an alarm to indicate, or displaying an indication of, a system-state change. 7. The method of claim 6 wherein the divergence metric is the Jensen-Shannon divergence metric. 8. The method of claim 4 further including: for each of a number of different subsets of the event types for which the log-file analysis subsystem has generated and stored probability distributions for different time intervals, computing a divergence metric for the probability distributions for different pairs of time intervals, and computing a measure of the variance of the divergence metrics computed for the probability distributions for different pairs of the time intervals; and selecting, as a basis for a monitoring fingerprint, a subset of the event types having the greatest computed variance. 9. The method of claim 8 wherein the divergence metric is the Jensen-Shannon divergence metric.

Assignees

Vmware Inc

Inventors

Classifications

H04L41/142Primary
using statistical or mathematical methods · CPC title
G06F11/0709
in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems · CPC title
G06F11/0781
Error filtering or prioritizing based on a policy defined by the user or on a policy defined by a hardware/software module, e.g. according to a severity level · CPC title
G06F17/40
Data acquisition and logging (for input to computer G06F3/00) · CPC title
G06F11/0778
Dumping, i.e. gathering error/state information after a fault for later diagnosis · CPC title

Patent family

Related publications grouped by family.

View patent family 56924019

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11048608B2 cover?: The current document is directed to systems, and methods incorporated within the systems, that carry out probability-distribution-based analysis of log-file entries. A monitoring subsystem within a distributed computer system uses probability-distribution-based analysis of log-file entries to detect changes in the state of the distributed computer system. A log-file-analysis subsystem within a …
Who is the assignee on this patent?: Vmware Inc
What technology area does this patent fall under?: Primary CPC classification H04L41/142. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Jun 29 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).