Technology system auto-recovery and optimality engine and techniques

US2021286668A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021286668-A1
Application numberUS-202117335531-A
CountryUS
Kind codeA1
Filing dateJun 1, 2021
Priority dateNov 27, 2018
Publication dateSep 16, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are hardware and techniques for correcting computer process faults by identifying risk associated with correcting a computer process fault and computer processes that may depend on the corrected computer process. The interdependent computer processes in a network may be determined by evaluating a stream of process break flags from a monitoring component coupled to the network. Each computer process break flag in the stream of computer process break flags indicates a process fault detected by the monitoring component and is correlated to a corrective response. The break flag and the corrective response are assigned a risk. A risk matrix accounts for interdependencies between computer processes and identified corrective actions. A final response strategy that corrects the computer process faults is determined using the assigned risk and computer system interdependence. A runbook stores the final response strategy, which may be updated based on changing computer process interdependencies and assigned risk.

First claim

Opening claim text (preview).

What is claimed is: 1 . An apparatus, comprising: a memory storing programming code; and a triage processing component, coupled to the memory and, via a communication interface, to a monitoring component that monitors operation of computer processes of a network, operable to execute the stored programming code, that when executed causes the triage processing component to perform functions to: populate a risk assessment matrix with a break risk assessment value and a fix risk assessment value both assigned to one or more corrective actions identified as being able to correct a possible cause of a potential operational breakdown of a computer process; identify interdependency risk patterns in the risk assessment matrix populated with the break risk assessment value and the fix risk assessment value; obtain, from a runbook, a list of corrective actions correlated to a first process break flag, wherein the runbook includes a plurality of corrective actions that correct potential operational breakdowns of the computer process of the network and the first process break flag indicating a symptom of the potential operational breakdown of one or more computer-implemented processes; and generate, based on the identified interdependency risk patterns, a response strategy incorporating at least one corrective action from the list of corrective actions. 2 . The apparatus of claim 1 , wherein the identified interdependency risk patterns indicate risks related to each corrective action in the runbook and an effect of applying each corrective action on the computer process in the network. 3 . The apparatus of claim 1 , wherein the memory further comprises programming code that causes the triage processing component to perform further functions to: assign the break risk assessment value indicating a likelihood of occurrence of the potential operational breakdown of the computer process in the network; and assign a respective fix risk assessment value to each of the identified one or more corrective actions. 4 . The apparatus of claim 3 , wherein: the assigned break risk assessment value has a range from a value indicating the potential operational breakdown has a high likelihood of occurring to another value indicating the potential operation breakdown has a low likelihood of occurring; and the respective fix risk assessment value assigned to each of the identified corrective actions has a range from a value indicating the potential operational breakdown has a high likelihood of being fixed to a different value indicating the potential operation breakdown has a low likelihood of being fixed by the respective identified corrective action. 5 . The apparatus of claim 1 , wherein the memory further comprises programming code that causes the triage processing component to perform further functions to: assign an interdependency rating to each corrective action in the list of corrective actions, wherein the interdependency rating quantifies a level of interdependence of the computer process on other computer processes potentially affected by application of each of the one or more corrective action in the list of corrective actions; populate the risk assessment matrix with the assigned interdependency rating of each corrective action in the list of corrective actions; and evaluate the risk assessment matrix, based on the assigned interdependency rating of each corrective action in the list of corrective actions to one another. 6 . The apparatus of claim 1 , wherein the memory further comprises programming code that causes the triage processing component to perform further functions to: in response to an evaluation of the risk assessment matrix, flag a respective corrective action from the list of corrective actions as an optimal corrective action for use in the response strategy. 7 . The apparatus of claim 6 , wherein the memory further comprises programming code that causes the triage processing component to perform further functions to: apply the flagged respective corrective action to the computer process experiencing a process fault associated with a computer process break flag in the network. 8 . The apparatus of claim 1 , wherein the memory further comprises programming code that causes the triage processing component to perform further functions to: receive successive process break flags that follow previous process break flags from the monitoring circuit coupled to the triage processing component; generate, based on the process break flags, break risk assessment values and fix risk assessment values of the process break flags; populate a copy of the risk assessment matrix using the generated break risk assessment values and the generated fix risk assessment values to produce a revised risk assessment matrix; analyze the break risk assessment values and the fix risk assessment values of the previous process break flags in the populated risk assessment matrix with reference to the break risk assessment values and the fix risk assessment values of the successive process break flags in the revised risk assessment matrix; and update, based on results of the analysis of the break risk assessment values and the fix risk assessment values, the runbook to identify one corrective action in the list of corrective actions for implementing the one corrective action to fix the potential operational breakdown. 9 . A method, comprising: populating, by a triage processing component coupled to a plurality of monitoring circuits and a network environment, a risk assessment matrix with a break risk assessment value and a fix risk assessment value assigned to one or more corrective actions identified to correct a possible cause of a potential operational breakdown of a computer process; identifying interdependency risk patterns in the risk assessment matrix populated with the assigned break risk assessment value and the fix risk assessment value assigned for each of the one or more identified corrective actions; obtaining a list of corrective actions correlated to a first process break flag from a runbook, wherein the runbook includes a plurality of corrective actions that correct potential operational breakdowns of the computer process; and generating, based on the identified interdependency risk patterns, a response strategy incorporating at least one corrective action from the list of corrective actions. 10 . The method of claim 9 , further comprising: applying the at least one corrective action in the response strategy to the computer process experiencing a process fault associated with a computer process break flag in the network environment. 11 . The method of claim 9 , wherein the identified interdependency risk patterns indicate risks related to each corrective action in the runbook and an effect of applying each corrective action on the computer process in the network. 12 . The method of claim 9 , further comprising: assigning, by the triage component, the break risk assessment value indicating a likelihood of occurrence of the potential operational breakdown of the computer process; assigning a respective fix risk assessment value to each of the identified corrective actions; and populating the risk assessment matrix with the assigned break risk assessment value of the computer process to each of the identified corrective actions and the assigned fix risk assessment value to each of the identified corrective actions. 13 . The method of claim 9 , wherein: the assigned break risk assessment value has a range from a value indicating the potential operational breakdown has a high likelihood of occurring to another value indicat

Assignees

Inventors

Classifications

  • Storage of error reports, e.g. persistent data storage, storage using memory protection · CPC title

  • Error or fault detection not based on redundancy (power supply failures G06F1/30; network fault management H04L41/06) · CPC title

  • Root cause analysis, i.e. error or fault diagnosis (in a hardware test environment G06F11/22; in a software test environment G06F11/36) · CPC title

  • Remedial or corrective actions (recovery from an exception in an instruction pipeline G06F9/3861; by retry G06F11/1402; for recovering from a failure of a protocol instance or entity H04L69/40) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021286668A1 cover?
Disclosed are hardware and techniques for correcting computer process faults by identifying risk associated with correcting a computer process fault and computer processes that may depend on the corrected computer process. The interdependent computer processes in a network may be determined by evaluating a stream of process break flags from a monitoring component coupled to the network. Each co…
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06F11/0793. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).