Write performance in fault-tolerant clustered storage systems

US9916201B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9916201-B2
Application numberUS-201514811040-A
CountryUS
Kind codeB2
Filing dateJul 28, 2015
Priority dateDec 19, 2012
Publication dateMar 13, 2018
Grant dateMar 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments relate to supporting transaction data committed to a stable storage. Committed data in the cluster is stored in the persistent cache layer and replicated and stored in the cache layer of one or more secondary nodes. One copy is designated as a master copy and all other copies are designated as replica, with an exclusive write lock assigned to the master and a shared write lock extended to the replica. An acknowledgement of receiving the data is communicated following confirmation that the data has been replicated to each node designated to receive the replica. Managers and a director are provided to support management of the master copy and the replicas within the file system, including invalidation of replicas, fault tolerance associated with failure of a node holding a master copy, recovery from a failed node, recovered of the file system from a power failure, and transferring master and replica copies within the file system.

First claim

Opening claim text (preview).

We claim: 1. A computer program product comprising a computer readable hardware storage device having program code embodied therewith, the program code executable by a processing unit to: temporarily hold committed data in distributed non-volatile memory of nodes in a cluster; in response to receiving a synchronous write transaction in a file system, place data associated with the received write transaction in a page cache layer and replicate the received data within the page cache layer of one or more remote nodes in the cluster; distinguish between a master copy and a replica of the received data; and invalidate each replica on the one or more remote nodes in response to flushing the master copy to persistent storage. 2. The computer program product of claim 1 , wherein the program code to distinguish between the master copy and replica of the received data further comprising code to apply an existing cache policy to the master copy of the received data. 3. The computer program product of claim 1 , wherein the master copy is a first byte range in the cache layer. 4. The computer program product of claim 3 , further comprising the master copy having an exclusive cluster-wide write lock on the first byte range. 5. The computer program product of claim 1 , wherein the replica is a second byte range in the cache layer. 6. The computer program product of claim 5 , further comprising the replica having a shared cluster-wide write lock in the second byte range. 7. The computer program product of claim 1 , further comprising in response to recovery of the cluster from a power failure, program code to recover data from non-volatile memory content in each node, and identify master and replica copies from a characteristic of a data byte range, and validate master and replica copies by re-acquiring cluster-wide write locks. 8. A system comprising: a page cache layer integrated with a stable memory layer in a file system to temporarily hold committed data in distributed non-volatile memory of nodes in a cluster, each node having a processing unit in communication with memory; in response to receiving a synchronous write transaction in a file system, place data associated with the received write transaction in a page cache layer and replicate the received data within the page cache layer of one or more remote nodes in the cluster; distinguish between a master copy and a replica of the received data; and invalidate each replica on the one or more remote nodes in response to flushing the master copy to persistent storage. 9. The system of claim 8 , further comprising code to apply an existing cache policy to the master copy of the received data. 10. The system of claim 8 , wherein the master copy is a first byte range in the cache layer. 11. The system of claim 10 , further comprising the master copy having an exclusive cluster-wide write lock on the first byte range. 12. The system of claim 8 , wherein the replica is a second byte range in the cache layer. 13. The system of claim 12 , further comprising the replica having a shared cluster-wide write lock in the second byte range. 14. The system of claim 8 , further comprising in response to recovery of the cluster from a power failure, recover data from non-volatile memory content in each node, and identify master and replica copies from a characteristic of a data byte range, and validate master and replica copies by re-acquiring cluster-wide write locks.

Assignees

Inventors

Classifications

  • the problem or solution involving locking · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

  • using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9916201B2 cover?
Embodiments relate to supporting transaction data committed to a stable storage. Committed data in the cluster is stored in the persistent cache layer and replicated and stored in the cache layer of one or more secondary nodes. One copy is designated as a master copy and all other copies are designated as replica, with an exclusive write lock assigned to the master and a shared write lock exten…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/1415. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).