File-based cluster-to-cluster replication recovery

US9697092B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9697092-B2
Application numberUS-201514836999-A
CountryUS
Kind codeB2
Filing dateAug 27, 2015
Priority dateAug 27, 2015
Publication dateJul 4, 2017
Grant dateJul 4, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Resiliency in file replication scenarios via maintenance of multiple replication queues with differing susceptibility to various sources of data loss. A plurality of requests are received at a first file system location to write to one or more file system objects at the first file system location. Details about the plurality of requests are recorded in at least two replication queues which differ, at least in part, as to types of failures which will cause data loss. Responsive to detection of data loss in one of the replication queues, details about the plurality of requests are retrieved from another of the replication queues.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving, at a first file system location, a plurality of requests to write to one or more file system objects at the first file system location; recording details about the plurality of requests in both a primary and a secondary replication queue; and responsive to detection of data loss in the primary replication queue, retrieving the details about the plurality of requests from the secondary replication queue; wherein: both the primary and the secondary replication queues are at the first file system location; and the primary and secondary queues differ, at least in part, as to types of failures which will cause data loss. 2. The computer-implemented method of claim 1 further comprising: responsive to receiving the plurality of requests, writing to a first copy of the one or more file system objects at the first file system location; and responsive to the retrieved details, replicating results of the plurality of write requests to a second file system location by writing to a second copy of the one or more file system objects at the second file system location. 3. The computer-implemented method of claim 2 wherein: the details recorded in the secondary replication queue identify which file system object(s) are affected by the plurality of requests; and the results of the plurality of write requests are replicated to the second file system location by transferring the entirety of the file system objects affected by the plurality of write requests from the first file system location to the second file system location. 4. The computer-implemented method of claim 2 wherein: the details recorded in the secondary replication queue include change data for the file system objects affected by the plurality of write requests; and the results of the plurality of write requests are replicated to the second file system location by transferring only changed portions of the file system objects affected by the plurality of write requests from the first file system location to the second file system location. 5. The computer-implemented method of claim 1 wherein the details recorded in the secondary replication queue survive a restart of the first file system location. 6. The computer-implemented method of claim 1 wherein: the primary replication queue is an in-memory data structure stored in volatile storage at the first file system location; and the secondary replication queue is an inode stored in non-volatile storage at the first file system location. 7. The computer-implemented method of claim 1 wherein the first file system location is a first node of a clustered file system and the second file system location is a second node of the clustered file system. 8. A computer program product comprising a non-transitory computer readable storage medium having stored thereon: first instructions executable by a device to cause the device to receive, at a first file system location, a plurality of requests to write to one or more file system objects at the first file system location; second instructions executable by the device to cause the device to record details about the plurality of requests in both a primary and a secondary replication queue; and third instructions executable by the device to cause the device to retrieve, responsive to detection of data loss in the primary replication queue, the details about the plurality of requests from the secondary replication queue; wherein: both the primary and the secondary replication queues are at the first file system location; and the primary and secondary queues differ, at least in part, as to types of failures which will cause data loss. 9. The product of claim 8 wherein the non-transitory computer readable storage medium has further stored thereon: fourth instructions executable by the device to cause the device to write, responsive to receiving the plurality of requests, to a first copy of the one or more file system objects at the first file system location; and fifth instructions executable by the device to cause the device to replicate, responsive to the retrieved details, results of the plurality of write requests to a second file system location by writing to a second copy of the one or more file system objects at the second file system location. 10. The product of claim 9 wherein: the details recorded in the secondary replication queue include change data for the file system objects affected by the plurality of write requests; and the results of the plurality of write requests are replicated to the second file system location by transferring only changed portions of the file system objects affected by the plurality of write requests from the first file system location to the second file system location. 11. The product of claim 8 wherein the details recorded in the secondary replication queue survive a restart of the first file system location. 12. The product of claim 8 wherein: the primary replication queue is an in-memory data structure stored in volatile storage at the first file system location; and the secondary replication queue is an inode stored in non-volatile storage at the first file system location. 13. The product of claim 8 wherein the first file system location is a first node of a clustered file system and the second file system location is a second node of the clustered file system. 14. A computer system comprising: a processor(s) set; and a computer readable storage medium; wherein: the processor set is structured, located, connected and/or programmed to execute instructions stored on the computer readable storage medium; the instructions include: first instructions executable by a device to cause the device to receive, at a first file system location, a plurality of requests to write to one or more file system objects at the first file system location; second instructions executable by the device to cause the device to record details about the plurality of requests in both a primary and a secondary replication queue; and third instructions executable by the device to cause the device to retrieve, responsive to detection of data loss in the primary replication queue, the details about the plurality of requests from the secondary replication queue; both the primary and the secondary replication queues are at the first file system location; and the primary and secondary queues differ, at least in part, as to types of failures which will cause data loss. 15. The system of claim 14 wherein the instructions further include: fourth instructions executable by the device to cause the device to write, responsive to receiving the plurality of requests, to a first copy of the one or more file system objects at the first file system location; and fifth instructions executable by the device to cause the device to replicate, responsive to the retrieved details, results of the plurality of write requests to a second file system location by writing to a second copy of the one or more file system objects at the second file system location. 16. The system of claim 15 wherein: the details recorded in the secondary replication queue identify which file system object(s) are affected by the plurality of requests; and the results of the plurality of write requests are replicated to the second file system location by transferring the entirety of the file system objects affected by the plurality of write requests from the first file system location to the second file system location. 17. The system of

Assignees

Inventors

Classifications

  • the resynchronized component or unit being a persistent storage device (re-synchronization of failed mirror storage G06F11/2082; rebuild or reconstruction of parity RAID storage G06F11/1008) · CPC title

  • Database-specific techniques · CPC title

  • Redundant storage or storage space (G06F11/2056 takes precedence) · CPC title

  • Resetting or repowering · CPC title

  • involving logging of persistent data for recovery · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9697092B2 cover?
Resiliency in file replication scenarios via maintenance of multiple replication queues with differing susceptibility to various sources of data loss. A plurality of requests are received at a first file system location to write to one or more file system objects at the first file system location. Details about the plurality of requests are recorded in at least two replication queues which diff…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/1471. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 04 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).