Continuing operation of a quorum based system after failures

US10049011B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10049011-B2
Application numberUS-201615144950-A
CountryUS
Kind codeB2
Filing dateMay 3, 2016
Priority dateMay 3, 2016
Publication dateAug 14, 2018
Grant dateAug 14, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A processor-implemented method, for continuing operation of a quorum based system is provided. The method detects a loss of quorum. A plurality of speculative configurations is created, whereby each speculative configuration is isolated from other speculative configurations in the quorum based system. Each speculative configuration continues to order requests during the creation of speculative configurations. The method selects and starts one of the plurality of speculative configurations as a new operational configuration. Ordered requests continue to the new operational configuration. The original configuration of the quorum based system is restarted in response to the plurality of speculative configurations not being isolated.

First claim

Opening claim text (preview).

What is claimed is: 1. A processor-implemented method for continuing operation of a quorum based system comprising: detecting partitioning of the quorum based system as a result of a loss of quorum after one or more of a network failure, a software process failure, and a server hardware failure; creating a plurality of speculative configurations of one or more functioning nodes of the quorum based system in response to the detected partitioning, wherein each speculative configuration is isolated from other speculative configurations in the quorum based system, and wherein each speculative configuration continues to order requests while the plurality of speculative configurations is created; selecting and starting one of the plurality of speculative configurations as a new operational configuration, wherein the ordered requests are added to the new operational configuration; and restarting an original configuration of the quorum based system, in response to the plurality of speculative configurations not being isolated, wherein restarting the original configuration comprises: restarting a number of processes, wherein the number of processes is a minimum quorum of the original configuration; synchronizing state information among the restarted number of processes, wherein the state information includes notifying the restarted number of processes that the original configuration is no longer active; altering quorum definitions of the restarted number of processes and the processes in the new operational configuration to the original configuration; and restarting and integrating any remaining failed processes into the original configuration. 2. The method of claim 1 , wherein the starting further comprises: selecting the new operational configuration from the plurality of speculative configurations; stopping all processes belonging to the original configuration of the quorum based system but not listed in the new operational configuration; and altering quorum definitions of all of the processes belonging to the original configuration listed in the new operational configuration, wherein all of the processes belonging to the original configuration are removed from the original configuration and added to the new operational configuration. 3. The method of claim 1 , wherein the configurations of the plurality of speculative configurations are automatically created based on a plurality of partitioning rules. 4. The method of claim 3 , wherein based on one of the plurality of partitioning rules, the speculative configuration having a most processes is the new operational configuration. 5. The method of claim 3 , wherein based on one of the plurality of partitioning rules, the new operational configuration is the speculative configuration having a highest priority. 6. The method of claim 3 , wherein the new operational configuration is selected based on the plurality of partitioning rules, or is manually selected from a display of available speculative configurations. 7. A computer program product for continuing operation of a quorum based system comprising: a plurality of agents embodied on a computer readable storage medium, the plurality of agents including program instructions executable by a processor, the program instructions comprising: program instructions to detect partitioning of the quorum based system as a result of a loss of quorum after one or more of a network failure, a software process failure, and a server hardware failure; program instructions to create a plurality of speculative configurations of one or more functioning nodes of the quorum based system in response to the detected partitioning, wherein each speculative configuration is isolated from other speculative configurations in the quorum based system, and wherein each speculative configuration continues to order requests while the plurality of speculative configurations is created; program instructions to select and start one of the plurality of speculative configurations as a new operational configuration, wherein the ordered requests are added to the new operational configuration; and program instructions to restart an original configuration of the quorum based system, in response to the plurality of speculative configurations not being isolated, wherein the program instructions to restart the original configuration comprise: program instructions to restart a number of processes, wherein the number of processes is a minimum quorum of the original configuration; program instructions to synchronize state information among the restarted number of processes, wherein the state information includes notifying the restarted number of processes that the original configuration is no longer active; program instructions to alter quorum definitions of the restarted number of processes and the processes in the new operational configuration to the original configuration; and program instructions to restart and integrating any remaining failed processes into the original configuration. 8. The computer program product of claim 7 , wherein the program instructions to start further comprise: program instructions to select the new operational configuration from the plurality of speculative configurations; program instructions to stop all processes belonging to the original configuration of the quorum based system but not listed in the new operational configuration; and program instructions to alter quorum definitions of all of the processes belonging to the original configuration listed in the new operational configuration, wherein all of the processes belonging to the original configuration are removed from the original configuration and added to the new operational configuration. 9. The computer program product of claim 7 , wherein the configurations of the plurality of speculative configurations are automatically created based on a plurality of partitioning rules. 10. The computer program product of claim 9 , wherein based on one of the plurality of partitioning rules, the speculative configuration having a most processes is the new operational configuration. 11. The computer program product of claim 9 , wherein based on one of the plurality of partitioning rules, the new operational configuration is the speculative configuration having a highest priority. 12. A computer system for continuing operation of a quorum based system comprising: one or more processors, one or more computer-readable storage devices, and a plurality of program instructions stored on at least one of the one or more storage devices for execution by at least one of the one or more processors, the plurality of program instructions comprising: program instructions to detect partitioning of the quorum based system as a result of a loss of quorum after one or more of a network failure, a software process failure, and a server hardware failure; program instructions to create a plurality of speculative configurations of one or more functioning nodes of the quorum based system in response to the detected partitioning, wherein each speculative configuration is isolated from other speculative configurations in the quorum based system, and wherein each speculative configuration continues to order requests while the plurality of speculative configurations is created; program instructions to select and start one of the plurality of speculative configurations as a new operational configuration, wherein the ordered requests continue to the new operational configuration; and program instructions to restart an original configuration of the quorum based system, in response to the plurality of speculative configurations not being isolated, wherein

Assignees

Inventors

Classifications

  • Eliminating the failing redundant component · CPC title

  • G06F11/182Primary

    based on mutual exchange of the output between redundant processing components · CPC title

  • Active fault masking without idle spares · CPC title

  • G06F11/142Primary

    Reconfiguring to eliminate the error (group management mechanisms in a peer-to-peer network H04L67/1044) · CPC title

  • Real-time · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10049011B2 cover?
A processor-implemented method, for continuing operation of a quorum based system is provided. The method detects a loss of quorum. A plurality of speculative configurations is created, whereby each speculative configuration is isolated from other speculative configurations in the quorum based system. Each speculative configuration continues to order requests during the creation of speculative …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/182. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 14 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).