Automated datacenter network failure mitigation
US-9025434-B2 · May 5, 2015 · US
US9722694B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9722694-B2 |
| Application number | US-201514852184-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 11, 2015 |
| Priority date | Sep 11, 2015 |
| Publication date | Aug 1, 2017 |
| Grant date | Aug 1, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Various techniques for managing communications backup for computer networks are disclosed herein. In one embodiment, a method includes detecting an abnormal operating condition at a primary network node, the primary network node being coupled to a computing device via a first optical connection between an optical switch and the primary network node. In response to the detected abnormal operation condition, the method includes prompting the optical switch to switch from the first optical connection to a second optical connection between the optical switch and a standby network node. The method further includes instructing the standby network node to facilitate communications with the computing device based on the replicated network configuration.
Opening claim text (preview).
We claim: 1. A method for providing communications backup in a computer network that includes: multiple primary network nodes and a standby network node; multiple primary optical switches individually having: an input port coupled to one or more computing devices; a first output port coupled to one of the primary network nodes; and a second output port different from the first output port; a standby optical switch having multiple input ports individually coupled to the second output port of the primary optical switches and an output port coupled to the standby network node; wherein the method comprising: receiving, from the primary network nodes via the computer network, data representing operating parameters or a status indicator of the primary network nodes; determining, based on the received data, an abnormal operating condition exists at one of the primary network nodes; and in response to determining that the abnormal operation condition exists at the primary network node, switching one of the primary optical switches corresponding to the primary network node and the standby optical switch from a first optical connection to a second optical connection, wherein: the first optical connection is between the one or more computing devices and the primary network node via the input port and the first output port of the corresponding primary optical switch; and the second optical connection is between the one or more computing devices and the standby network node via the input port and the second output port of the corresponding primary optical switch, and one of the input ports and the output port of the standby optical switch. 2. The method of claim 1 wherein switching one of the primary optical switches corresponding to the primary network node and the standby optical switch includes: switching the primary optical switch from outputting via the first output port to outputting via the second output port; and switching the standby optical switch to connect the output port to one of the input ports coupled to the second output port of the primary optical switch. 3. The method of claim 1 wherein: the standby network node is a first standby network node; the output port of the standby optical switch is a first output port of the standby optical switch coupled to the first standby network node; the standby optical switch further includes a second output port coupled to a second standby network node of the computer network; the method further includes, in response to determining that the abnormal operation condition exists at the primary network node, selecting the first standby network node as a backup network node; and the second optical connection is between the one or more computing devices and the selected first network node via the input port and the second output port of the corresponding primary optical switch, one of first input ports and the output port of the standby optical switch. 4. The method of claim 1 wherein: The multiple primary network nodes include first and second primary network nodes; and determining an abnormal operating condition exists at one of the primary network nodes includes determining that an abnormal operating condition exists at both the first primary network node and the second primary network node; the method further includes selecting one of the first primary network node or the second primary network node based on operating profiles of the one or more computing device coupled to the first and second primary network nodes, respectively; and switching one of the primary optical switches corresponding to the primary network node and the standby optical switch includes switching one of the primary optical switches corresponding to the first or second primary network node to switch from being connected to the first primary network node or the second primary network node, respectively, to being connected to the standby optical switch. 5. The method of claim 1 , further comprising in response to determining that the abnormal operation condition exists at the primary network node, configuring the standby network node using configuration information collected from the one of the primary network nodes such that the standby network node functions generally similarly as the one of the primary network nodes. 6. The method of claim 1 , further comprising in response to determining that the abnormal operation condition exists at the primary network node, configuring the standby network node using configuration information collected from the one of the primary network nodes such that the standby network node functions generally similarly as the one of the primary network nodes, wherein the configuration information includes data representing parameters of port configuration and a routing table. 7. The method of claim 1 , further comprising: in response to determining that the abnormal operation condition exists at the primary network node, configuring the standby network node using configuration information collected from the one of the primary network nodes such that the standby network node functions generally similarly as the one of the primary network nodes, wherein the configuration information includes data representing parameters of port configuration and a routing table; receiving a notification from the standby network node indicating configuration is completed successfully; in response to receiving the notification, switching one of the primary optical switches corresponding to the primary network node and the standby optical switch from the first optical connection to the second optical connection. 8. A method performed by a computing device in a computer network that includes: first and second primary network nodes and a standby network node; a first primary optical switch having an input port coupled to one or more first servers, a first output port coupled to the first primary network node, and a second output port; a second primary optical switch having an input port coupled to one or more second servers, a first output port coupled to the second primary network node and a second output port; and a standby optical switch having first and second input ports individually coupled to the second output port of the first and second primary optical switches, and an output port coupled to the standby network node, wherein the method comprising: receiving data representing operating parameters or a status indicator from the first and second primary network nodes; determining, based on the received data, whether at least one of the first or second primary network node has an abnormal operating condition; and in response to determining that the first primary network node has an abnormal operation condition, switching the first primary optical switch from outputting via the first output port to outputting via the second output port that is coupled to the first input port of the standby optical switch; and switching the standby optical switch to connect the first servers coupled to the input port of the first primary optical switch to the standby network node via the input port of the primary optical switch, the second output port of the primary optical switch, and the first input port of the standby optical switch. 9. The method of claim 8 wherein: the standby network node is a first standby network node; the output port of the standby optical switch is a first output port of the standby optical switch; the standby optical switch further includes a second output port coupled to a second standby network node; and the method further includes, in response to the detected abnormal operation condition of the first primary network node, selecting one of the first sta
Route discovery packet · CPC title
Arrangements for fault recovery · CPC title
Configuration by using pre-existing information, e.g. using templates or copying from other elements · CPC title
Switch and router aspects · CPC title
by dynamic selection of recovery network elements, e.g. replacement by the most appropriate element after failure · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.