System and method for supporting automatic disabling of degraded links in an infiniband (IB) network

US9900293B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9900293-B2
Application numberUS-201213488250-A
CountryUS
Kind codeB2
Filing dateJun 4, 2012
Priority dateJun 3, 2011
Publication dateFeb 20, 2018
Grant dateFeb 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method can support automatic disabling of degraded links in an InfiniBand (IB) network. At least one node in a fabric can monitor one or more local ports of the at least one node for one or more error states associated with a link at the at least one node, wherein the link is connected to a local port of the at least one node. The at least one node further allows a subnet manager to observe the one or more error states associated with the link at the at least one node, and allows the subnet manager to set the link in a basic state if the observed error states exceed a threshold. In this basic state, the link allows only SMP traffic and prevents data traffic and non-SMP based management traffic.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for supporting automatic disabling of degraded links in a fabric network that includes a subnet manager (SM) and a plurality of network nodes, the method comprising: monitoring, via a daemon process on a first network node in the fabric network, a local port of the first network node for one or more errors, wherein the one or more errors are associated with a link that connects the first network node and a second network node in the fabric network, wherein the link is in a first state and is configured to transfer both normal data and subnet management packet (SMP) data; in response to the one or more errors exceeding a configurable threshold, performing, by the daemon process, a disabling operation on the link to put the link into a second state, wherein the link in the second state is configured to transfer only SMP data; recording, by the daemon process, the state of the link in a persistent storage, wherein the recorded state is configured to prevent the link from being automatically enabled when the first network node is reset; and sending, via the link, one or more SMP messages to the subnet manager, wherein the one or more management data messages indicate that the link is disabled; wherein the subnet manager, upon receiving the one or more management data messages, operates to use the link to perform further discovery in the second network node, invoke the daemon process to initiate a test on the link, determine that errors at the local port of the first network node have dropped below the configurable threshold, and change the link from the second state to the first state; and wherein the daemon process, in response to the state change of the link from the second state to the first state, operates to remove the recorded state from the persistence storage when the first network node is reset. 2. The method according to claim 1 , further comprising: requesting, via the subnet manager, the disabled link to be enabled. 3. The method according to claim 1 , further comprising: observing, via the subnet manager, the one or more errors using subnet management packet (SMP) methods. 4. The method according to claim 1 , further comprising: including, via the subnet manager, the enabled link in a subnet topology. 5. The method according to claim 1 , wherein the fabric network is an InfiniBand network that includes a plurality of subnets. 6. The method according to claim 1 , wherein the fabric network specifies one or more policies for defining a link in a non-routable state for observing errors. 7. The method according to claim 1 , where an automated logic is used to determine whether the one or more errors exceeds the configurable threshold. 8. A non-transitory machine readable storage medium having instructions stored thereon that when executed cause a system to perform the steps comprising: monitoring, via a daemon process on a first network node in the fabric network, a local port of the first network node for one or more errors, wherein the one or more errors are associated with a link that connects the first network node and a second network node in the fabric network, wherein the link is in a first state and is configured to transfer both normal data and subnet management packet (SMP) data; in response to the one or more errors exceeding a configurable threshold, performing, by the daemon process, a disabling operation on the link to put the link into a second state, wherein the link in the second state is configured to transfer only SMP data; recording, by the daemon process, the state of the link in a persistent storage, wherein the recorded state is configured to prevent the link from being automatically enabled when the first network node is reset; and sending, via the link, one or more SMP messages to the subnet manager, wherein the one or more management data messages indicate that the link is disabled; wherein the subnet manager, upon receiving the one or more management data messages, operates to use the link to perform further discovery in the second network node, invoke the daemon process to initiate a test on the link, determine that errors at the local port of the first network node have dropped below the configurable threshold, and change the link from the second state to the first state; and wherein the daemon process, in response to the state change of the link from the second state to the first state, operates to remove the recorded state from the persistence storage when the first network node is reset. 9. The non-transitory machine readable storage medium according to claim 8 , wherein the fabric network is an InfiniBand network that includes a plurality of subnets. 10. The non-transitory machine readable storage medium according to claim 8 , wherein the network node operates to request the disabled link to be enabled. 11. The non-transitory machine readable storage medium according to claim 8 , wherein the subnet manager observes the one or more errors using subnet management packet (SMP) methods. 12. The non-transitory machine readable storage medium according to claim 8 , wherein the subnet manager operates to include the enabled link in a subnet topology. 13. The non-transitory machine readable storage medium according to claim 8 , wherein the fabric network specifies one or more policies for defining a link in a non-routable state for observing errors. 14. A system for supporting automatic disabling of degraded links in a network, comprising: a computer one or more microprocessors; a fabric network executing on the computer, wherein the fabric network includes a first network node and a second network node, wherein the first network node includes a daemon process configured to monitor a local port of the network node for one or more errors associated with a link that connects the first network node and the second network node, wherein the link is in a first state and is configured to transfer both normal data and subnet management packet (SMP) data, perform a disabling operation on the link to put the link into a second state, in response to the one or more errors exceeding a configurable threshold, wherein the link in the second state is configured to transfer only SMP data, and record the state of the link in a persistent storage, wherein the recorded state is configured to prevent the link from being automatically enabled when the network node is reset; and a subnet manager in the fabric network, wherein the subnet manager operates to receive, via the link, one or more management data messages from the first network node, wherein the one or more management data messages indicate that the link is disabled; wherein the managing node, upon receiving the one or more management data messages, operates to use the link to perform further discovery in the second network node, invoke the daemon process to initiate a test on the link, determine that errors at the local port of the first network node have dropped below the configurable threshold, and change the link from the second state to the first state; and wherein the daemon process, in response to the state change of the link from the second state to the first state, operates to remove the recorded state from the persistence storage when the first network node is reset. 15. The system according to claim 14 , wherein the network node operates to request the disabled link to be enabled. 16. The system according to claim 14 , wherein the subnet manager operates to include the enabled link in a subnet topology. 17. The system a

Assignees

Inventors

Classifications

  • involving additional secure or trusted devices, e.g. TPM, smartcard, USB or software token (network architectures or network communication protocols for supporting authentication of entities using an additional device in a packet data network H04L63/0853) · CPC title

  • using passwords (cryptographic mechanisms or cryptographic arrangements for entity authentication using a predetermined code H04L9/3226) · CPC title

  • using challenge-response · CPC title

  • based on the identity of the terminal or configuration, e.g. MAC address, hardware or software configuration or device fingerprint · CPC title

  • wherein the sending and receiving network entities apply asymmetric encryption, i.e. different keys for encryption and decryption (cryptographic mechanisms or cryptographic arrangements for public-key encryption H04L9/30) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9900293B2 cover?
A system and method can support automatic disabling of degraded links in an InfiniBand (IB) network. At least one node in a fabric can monitor one or more local ports of the at least one node for one or more error states associated with a link at the at least one node, wherein the link is connected to a local port of the at least one node. The at least one node further allows a subnet manager t…
Who is the assignee on this patent?
Johnsen Bjørn Dag, Moxnes Dag Georg, Hodoba Predrag, and 1 more
What technology area does this patent fall under?
Primary CPC classification H04L63/0876. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).