Identifying solutions to application execution problems in distributed computing environments

US10089169B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10089169-B2
Application numberUS-201615202803-A
CountryUS
Kind codeB2
Filing dateJul 6, 2016
Priority dateFeb 2, 2015
Publication dateOct 2, 2018
Grant dateOct 2, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An expert system extracts events associated with executing an application from log files generated by various topological resources in a distributed computing environment. The events are plotted as plot points on a time series graph. Patterns are identified in the plot points that are associated with application problems, along with the computing environment configurations both before the problem and after the problem was resolved. The difference in the configurations represents a corrective action for the application problem, and the expert system links the corrective action to the pattern. When a pattern repeats in conjunction with another application problem, the corrective action is identified as a possible solution to the new problem. A confidence level associated with the pattern/corrective action may be increased when a user accepts the corrective action and may be decreased when a user rejects the corrective action.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for identifying a solution to a problem condition associated with execution of an application, the method comprising: plotting each of a plurality of events associated with the execution of the application as a plot point on a graph having an x-axis representing a time of occurrence of each event and having a y-axis representing a topological resource associated with each event, the plot point representing a combination of an error type of each event and a log type of each event; identifying a pattern in the plot points, the pattern temporally associated with an earlier problem condition, the earlier problem condition associated with the execution of the application; identifying a second pattern in the plot points, the second pattern temporally associated with the problem condition; and identifying, by an expert system, a corrective action as the solution to the problem condition based on comparing the pattern and the second pattern. 2. The method of claim 1 further comprising: extracting, by the expert system, the plurality of events from a plurality of log files, the plurality of log files generated at a plurality of topological resources, each event identified by the error type, the log type, a time of occurrence, and the topological resource. 3. The method of claim 1 , wherein the pattern starts at a start time and ends at an end time, the method further comprising: identifying a pre-problem configuration of a distributed computing environment before the start time and a post-problem configuration of the distributed computing environment after the end time; and linking, by the expert system, the corrective action to the pattern, the corrective action representing configuration changes needed to convert the pre-problem configuration to the post-problem configuration. 4. The method of claim 1 , wherein the execution of the application utilizes a plurality of topological resources in a distributed computing environment. 5. The method of claim 1 , further comprising: displaying, by the expert system, at least part of the graph to a user; displaying the corrective action to the user; receiving a response from the user, the response associated with the corrective action; and modifying, by the expert system and based on the received response, a confidence level associated with the corrective action as the solution to the problem condition. 6. The method of claim 1 , further comprising: identifying, by the expert system, a second corrective action as a second solution to the problem condition; displaying, by the expert system, at least part of the graph to a user; displaying the corrective action and the second corrective action to the user; receiving a response from the user, the response indicating that the user accepts the corrective action and rejects the second corrective action; increasing a confidence level associated with the corrective action as the solution to the problem condition; and decreasing a second confidence level associated with the second corrective action as the solution to the problem condition. 7. A computer program product for identifying a solution to a problem condition associated with execution of an application, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to perform a method comprising: plotting each of a plurality of events associated with the execution of the application as a plot point on a graph having an x-axis representing a time of occurrence of each event and having a y-axis representing a topological resource associated with each event, the plot point representing a combination of an error type of each event and a log type of each event; identifying a pattern in the plot points, the pattern temporally associated with an earlier problem condition, the earlier problem condition associated with the execution of the application; identifying a second pattern in the plot points, the second pattern temporally associated with the problem condition; and identifying, by an expert system, a corrective action as the solution to the problem condition based on comparing the pattern and the second pattern. 8. The computer program product of claim 7 , wherein the method further comprises: extracting, by the expert system, the plurality of events from a plurality of log files, the plurality of log files generated at a plurality of topological resources, each event identified by the error type, the log type, a time of occurrence, and the topological resource. 9. The computer program product of claim 7 , wherein the pattern starts at a start time and ends at an end time, and wherein the method further comprises: identifying a pre-problem configuration of a distributed computing environment before the start time and a post-problem configuration of the distributed computing environment after the end time; and linking, by the expert system, the corrective action to the pattern, the corrective action representing configuration changes needed to convert the pre-problem configuration to the post-problem configuration. 10. The computer program product of claim 7 , wherein the execution of the application utilizes a plurality of topological resources in a distributed computing environment. 11. The computer program product of claim 7 , wherein the method further comprises: displaying, by the expert system, at least part of the graph to a user; displaying the corrective action to the user; receiving a response from the user, the response associated with the corrective action; and modifying, by the expert system and based on the received response, a confidence level associated with the corrective action as the solution to the problem condition. 12. The computer program product of claim 7 , wherein the method further comprises: identifying, by the expert system, a second corrective action as a second solution to the problem condition; displaying, by the expert system, at least part of the graph to a user; displaying the corrective action and the second corrective action to the user; receiving a response from the user, the response indicating that the user accepts the corrective action and rejects the second corrective action; increasing a confidence level associated with the corrective action as the solution to the problem condition; and decreasing a second confidence level associated with the second corrective action as the solution to the problem condition. 13. A computing system for identifying a solution to a problem condition associated with execution of an application, the computing system comprising: a memory; and a processor in communication with the memory, wherein the computing system is configured to perform a method, the method comprising: plotting each of a plurality of events associated with the execution of the application as a plot point on a graph having an x-axis representing a time of occurrence of each event and having a y-axis representing a topological resource associated with each event, the plot point representing a combination of an error type of each event and a log type of each event; identifying a pattern in the plot points, the pattern temporally associated with an earlier problem condition, the earlier problem condition associated with the execution of the application; identifying a second pattern in the plot points, the second pattern temporally associated with the problem condition; and identifying, by an expert system, a corrective action as the solution to the problem condition based on comparing the patter

Assignees

Inventors

Classifications

  • the processing taking place on a specific hardware platform or in a specific software environment · CPC title

  • Remedial or corrective actions (recovery from an exception in an instruction pipeline G06F9/3861; by retry G06F11/1402; for recovering from a failure of a protocol instance or entity H04L69/40) · CPC title

  • Error or fault reporting or storing · CPC title

  • Content or structure details of the error report, e.g. specific table structure, specific error fields · CPC title

  • Error or fault detection not based on redundancy (power supply failures G06F1/30; network fault management H04L41/06) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10089169B2 cover?
An expert system extracts events associated with executing an application from log files generated by various topological resources in a distributed computing environment. The events are plotted as plot points on a time series graph. Patterns are identified in the plot points that are associated with application problems, along with the computing environment configurations both before the probl…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/079. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 02 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).