Rebooting infiniband clusters
US-2015248298-A1 · Sep 3, 2015 · US
US9769016B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9769016-B2 |
| Application number | US-201113092460-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 22, 2011 |
| Priority date | Jun 7, 2010 |
| Publication date | Sep 19, 2017 |
| Grant date | Sep 19, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
One embodiment of the present invention provides a switch system. The switch includes a port that couples to a server hosting a number of virtual machines. The switch also includes a link tracking module. During operation, the link tracking module determines that reachability to at least one end host coupled to a virtual cluster switch of which the switch is a member is disrupted. The link tracking module then determines that at least one virtual machine coupled to the port is affected by the disrupted reachability, and communicates to the server hosting the affected virtual machine about the disrupted reachability.
Opening claim text (preview).
What is claimed is: 1. A switch, comprising: one or more ports; a link tracking module configured to: determine a first failure which affects reachability to the switch via at least one of the ports; determine that the first failure affects reachability to at least one virtual machine hosted in a computing device distinct from the switch; and construct a notification packet comprising information regarding the first failure, wherein the notification packet is destined to a hypervisor running the virtual machine in the computing device to allow the hypervisor to reconfigure the virtual machine. 2. The switch of claim 1 , wherein the link tracking module is further configured to monitor at least a link coupled to the switch, the port of the switch, or both. 3. The switch of claim 1 , wherein the link tracking module is further configured to construct a second notification packet comprising affected reachability information associated with the first failure, wherein the second notification packet is destined to a second hypervisor residing in a second computing device distinct from the switch. 4. The switch of claim 1 , further comprising a routing module configured to update network topology information and corresponding reachability information upon detecting the first failure. 5. The switch of claim 1 , wherein the switch is a member of a fabric switch; wherein the fabric switch includes a number of member switches; and wherein the fabric switch is controlled as a single logical switch. 6. A first computing system, comprising: a processor; and a non-transitory computer-readable storage medium storing instructions which when executed by the processor causes the processor to perform a method, the method comprising: determining a failure which affects reachability to the first computing system via at least one port of the first computing system; determining that the first failure affects reachability to at least one virtual machine hosted in a second computing system distinct from the first computing system; and constructing a notification packet comprising information regarding the first failure, wherein the notification packet is destined to a hypervisor running the virtual machine in the second computing system to allow the hypervisor to reconfigure the virtual machine. 7. The first computing system of claim 6 , wherein the method further comprises monitoring at least a link coupled to the first computing system, the port of the first computing system, or both. 8. The first computing system of claim 6 , wherein the method further comprises constructing a second notification packet comprising affected reachability information associated with the first failure, wherein the second notification packet is destined to a second hypervisor residing in a third computing system distinct from the first computing system. 9. The first computing system of claim 6 , wherein the method further comprises updating network topology information and corresponding reachability information upon detecting the first failure. 10. The first computing system of claim 6 , wherein the computing system is a member of a fabric switch; wherein the fabric switch includes a number of member switches; and wherein the fabric switch is controller as a single logical switch. 11. The first computing system of claim 6 , wherein the method further comprises associating a respective local port of the first computing system as an egress port for the notification packet. 12. A computer-executable method, comprising: determining, by a computer, a first failure which affects reachability to a switch via at least one port of the switch; determining that the first failure affects reachability to at least one virtual machine hosted in a computing device distinct from the switch; and constructing a notification packet comprising information regarding the first failure, wherein the notification packet is destined to a hypervisor running the virtual machine in the computing device to allow the hypervisor to reconfigure the virtual machine. 13. The method of claim 12 , further comprising monitoring at least a link coupled to the switch, the port of the switch, or both. 14. The method of claim 12 , further comprising constructing a second notification packet comprising affected reachability information associated with the first failure, wherein the second notification packet is destined to a second hypervisor residing in a second computing device distinct from the switch. 15. The method of claim 12 , further comprising updating network topology information and corresponding reachability information upon detecting the first failure. 16. The method of claim 12 , wherein the switch is a member of a fabric switch; wherein the fabric switch includes a number of member switches; and wherein the fabric switch is controlled as a single logical switch. 17. A switch means, comprising: one or more port means; a link tracking means for: determining a first failure which affects reachability to the switch means via at least one of the port means; determining that the first failure affects reachability to at least one virtual machine means hosted in a computing device means distinct from the switch means; and constructing a notification packet comprising information regarding the first failure, wherein the notification packet is destined to a hypervisor means running the virtual machine means in the computing device means to allow the hypervisor means to reconfigure the virtual machine means.
Localisation of faults · CPC title
Error detection · CPC title
Virtual LANs, VLANs, e.g. virtual private networks [VPN] (LAN interconnection over a bridge based backbone H04L12/462; encapsulation techniques H04L12/4633; routing of packets H04L45/00; packet switches H04L49/00; virtual private networks for security H04L63/0272) · CPC title
of virtual routers · CPC title
Virtual switches · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.