Programmable diagnosis model for correlation of network events

US11533215B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11533215-B2
Application numberUS-202016821745-A
CountryUS
Kind codeB2
Filing dateMar 17, 2020
Priority dateJan 31, 2020
Publication dateDec 20, 2022
Grant dateDec 20, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Network management techniques are described. A controller device of this disclosure manages a device group of a network. The controller device includes processing circuitry in communication with the memory, the processing circuitry being configured to receive, using a programmable diagnosis service executed by the processing circuitry, a programming input, to form, using the programmable diagnosis service, based on the programming input, a resource definition graph that models interdependencies between a plurality of resources supported by the device group, to detect, using the programmable diagnosis service, an event affecting a first resource of the plurality of resources, and to identify, using the programmable diagnosis service, based on the interdependencies modeled in the resource definition graph formed based on the programming input, a root cause event that caused the event affecting the first resource, the root cause event occurring at a second resource of the plurality of resources.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of monitoring a device group of a network, the method comprising: receiving, by a programmable diagnosis service running on a controller device that manages the device group, a programming input; forming, by the programmable diagnosis service, based on the programming input, a resource definition graph that models interdependencies between a plurality of resources supported by the device group, the plurality of resources including services provided by devices of the device group and components of the devices of the device group, and the resource definition graph including interdependencies between the services and the components of the devices involved in providing the services; initializing, by the programmable diagnosis service, one or more telemetry rules that enable the controller device to perform one or more of: monitoring state information for one or more components of the device group; or instigating one or more alarms in response to detecting threshold events occurring within the plurality of resources, wherein initializing the one or more telemetry rules comprises configuring, by the programmable diagnosis service, based on the programming input, first cause-and-effect relationships between device resources supported by the device group, and second cause-and-effect relationships between service resources supported by the device group; detecting, by the programmable diagnosis service, using a detected packet loss between two or more of the devices of the device group, an event affecting a first resource of the plurality of resources; and identifying, by the programmable diagnosis service, based on the interdependencies modeled in the resource definition graph formed based on the programming input, a root cause event that caused the event affecting the first resource, the root cause event occurring at a second resource of the plurality of resources. 2. The method of claim 1 , wherein identifying the root cause event that caused the event affecting the first resource comprises: applying, by the programmable diagnosis service, the resource definition graph to at least a subset of the plurality of resources to generate one or more inference rules with respect to the plurality of resources; and performing, by the programmable diagnosis service, a forward-chained root cause analysis (RCA) by applying the one or more inference rules to a plurality of events detected over the plurality of resources. 3. The method of claim 1 , wherein forming the resource definition graph that models the interdependencies between the plurality of resources supported by the device group comprises applying, by the programmable diagnosis service, one or more temporal constraints to the modeled interdependencies. 4. The method of claim 3 , wherein the one or more temporal constraints include a constraint according to which the event affecting the first resource occurs after the root cause event occurring at the second resource. 5. The method of claim 3 , wherein the one or more temporal constraints include a constraint according to which the event affecting the first resource occurs before the root cause event occurring at the second resource. 6. The method of claim 1 , wherein the plurality of resources comprises one or more network resources, the method further comprising configuring, by the programmable diagnosis service, at least a subset of the one or more network resources. 7. The method of claim 1 , wherein the plurality of resources comprises one or more service resources, the method further comprising configuring, by the programmable diagnosis service, at least a subset of the one or more service resources. 8. The method of claim 1 , wherein the plurality of resources comprises one or more device resources, the method further comprising configuring, by the programmable diagnosis service, at least a subset of the one or more device resources. 9. A controller device for managing a device group of a network, the controller device comprising: a network interface; a memory; and processing circuitry in communication with the memory, the processing circuitry being configured to: receive, using a programmable diagnosis service executed by the processing circuitry, a programming input; form, using the programmable diagnosis service, based on the programming input, a resource definition graph that models interdependencies between a plurality of resources supported by the device group, the plurality of resources including services provided by devices of the device group and components of the devices of the device group, and the resource definition graph including interdependencies between the services and the components of the devices involved in providing the services; initialize, using the programmable diagnosis service, one or more telemetry rules that enable the controller device to perform one or more of: monitoring state information for one or more components of the device group; or instigating one or more alarms in response to detecting threshold events occurring within the plurality of resources, wherein to initialize the one or more telemetry rules, the processing circuitry is configured to use the programmable diagnosis service to configure, based on the programming input, first cause-and-effect relationships between device resources supported by the device group, and second cause-and-effect relationships between service resources supported by the device group; detect, using the programmable diagnosis service, using a detected packet loss between two or more of the devices of the device group, an event affecting a first resource of the plurality of resources; and identify, using the programmable diagnosis service, based on the interdependencies modeled in the resource definition graph formed based on the programming input, a root cause event that caused the event affecting the first resource, the root cause event occurring at a second resource of the plurality of resources. 10. The controller device of claim 9 , wherein to identify the root cause event that caused the event affecting the first resource, the processing circuitry is configured to: apply, using the programmable diagnosis service, the resource definition graph to at least a subset of the plurality of resources to generate one or more inference rules with respect to the plurality of resources; and perform, using the programmable diagnosis service, a forward-chained root cause analysis (RCA) by applying the one or more inference rules to a plurality of events detected over the plurality of resources. 11. The controller device of claim 9 , wherein to form the resource definition graph that models the interdependencies between the plurality of resources supported by the device group, the processing circuitry is configured to apply, using the programmable diagnosis service, one or more temporal constraints to the modeled interdependencies. 12. The controller device of claim 11 , wherein the one or more temporal constraints include a constraint according to which the event affecting the first resource occurs after the root cause event occurring at the second resource. 13. The controller device of claim 11 , wherein the one or more temporal constraints include a constraint according to which the event affecting the first resource occurs before the root cause event occurring at the second resource. 14. The controller device of claim 9 , wherein the plurality of resources comprises one or more network resources, and wherein the processing circuitry is further configured to use the programmable diagnosis service to configure a

Assignees

Inventors

Classifications

  • using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis · CPC title

  • using machine learning or artificial intelligence · CPC title

  • using filtering, e.g. reduction of information by using priority, element types, position or time · CPC title

  • Errors, e.g. transmission errors · CPC title

  • for initial configuration or provisioning, e.g. plug-and-play · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11533215B2 cover?
Network management techniques are described. A controller device of this disclosure manages a device group of a network. The controller device includes processing circuitry in communication with the memory, the processing circuitry being configured to receive, using a programmable diagnosis service executed by the processing circuitry, a programming input, to form, using the programmable diagno…
Who is the assignee on this patent?
Juniper Networks Inc
What technology area does this patent fall under?
Primary CPC classification H04L41/0631. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Dec 20 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).