Parallel checksumming of data chunks of a shared data object using a log-structured file system

US9436722B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9436722-B1
Application numberUS-201313799264-A
CountryUS
Kind codeB1
Filing dateMar 13, 2013
Priority dateMar 13, 2013
Publication dateSep 6, 2016
Grant dateSep 6, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Checksum values are generated and used to verify the data integrity. A client executing in a parallel computing system stores a data chunk to a shared data object on a storage node in the parallel computing system. The client determines a checksum value for the data chunk; and provides the checksum value with the data chunk to the storage node that stores the shared object. The data chunk can be stored on the storage node with the corresponding checksum value as part of the shared object. The storage node may be part of a Parallel Log-Structured File System (PLFS), and the client may comprise, for example, a Log-Structured File System client on a compute node or burst buffer. The checksum value can be evaluated when the data chunk is read from the storage node to verify the integrity of the data that is read.

First claim

Opening claim text (preview).

What is claimed is: 1. A method performed by a Log-Structured File System client executing on one or more of a compute node and a burst buffer node in a parallel computing system for storing a data chunk generated by said parallel computing system to a shared data object on a storage node in said parallel computing system, comprising: processing said data chunk generated by said parallel computing system using said Log-Structured File System client, wherein said Log-Structured File System client executes on one or more of said compute node and said burst buffer node in said parallel computing system; determining, by said Log-Structured File System client, a checksum value for said data chunk as said data chunk is written by said Log-Structured File System client to said storage node, wherein a plurality of other Log-Structured File System clients determine a plurality of checksums for a corresponding plurality of other data chunks of said shared data object in parallel; and providing, by said Log-Structured File System client, said checksum value with said data chunk to said storage node for storage as part of said shared object, wherein said storage node stores said data chunk with said corresponding checksum value as part of said shared object with said plurality of checksums for said corresponding plurality of other data chunks of said shared data object, and wherein said shared object is shared by a plurality of distributed processes in said parallel computing system. 2. The method of claim 1 , wherein said storage node comprises a Log-Structured File server. 3. The method of claim 1 , wherein said storage node is part of a log structured virtual parallel file system. 4. The method of claim 1 , wherein said storage node is part of a Parallel Log-Structured File System (PLFS). 5. The method of claim 1 , further comprising the step of evaluating said checksum value when said data chunk is read from said storage node. 6. The method of claim 5 , further comprising the step of recomputing said checksum value when said data chunk is read from said storage node and comparing said recomputed checksum value to said stored checksum value. 7. The method of claim 6 , further comprising the step of providing a verified data chunk if said corresponding checksum values match. 8. The method of claim 6 , further comprising the step of correcting said data chunk if said corresponding checksum values do not match. 9. A tangible machine-readable recordable storage medium for storing a data chunk generated by a parallel computing system to a shared data object on a storage node in said parallel computing system, wherein one or more software programs when executed by one or more processing devices implement the steps of the method of claim 1 . 10. A compute node apparatus in a parallel computing system for executing a Log-Structured File System client that stores a data chunk generated by said parallel computing system to a shared data object on a storage node in said parallel computing system, said compute node apparatus comprising: a memory; and at least one hardware device operatively coupled to the memory and configured to: process said data chunk generated by said parallel computing system using said Log-Structured File System client, wherein said Log-Structured File System client executes on one or more of said compute node and said burst buffer node in said parallel computing system; determine, by said Log-Structured File System client, a checksum value for said data chunk as said data chunk is written by said Log-Structured File System client to said storage node, wherein a plurality of other Log-Structured File System clients determine a plurality of checksums for a corresponding plurality of other data chunks of said shared data object in parallel; and provide, by said Log-Structured File System client, said checksum value with said data chunk to said storage node for storage as part of said shared object, wherein said storage node stores said data chunk with said corresponding checksum value as part of said shared object with said plurality of checksums for said corresponding plurality of other data chunks of said shared data object, and wherein said shared object is shared by a plurality of distributed processes in said parallel computing system. 11. The apparatus of claim 10 , wherein said storage node comprises a Log-Structured File server. 12. The apparatus of claim 10 , wherein said apparatus comprises one or more of a compute node and a burst buffer node. 13. The apparatus of claim 10 , wherein said at least one hardware device is further configured to evaluate said checksum value when said data chunk is read from said storage node. 14. The apparatus of claim 13 , wherein said at least one hardware device is further configured to recompute said checksum value when said data chunk is read from said storage node and comparing said recomputed checksum value to said stored checksum value. 15. The apparatus of claim 14 , wherein said at least one hardware device is further configured to provide a verified data chunk if said corresponding checksum values match. 16. The apparatus of claim 14 , wherein said at least one hardware device is further configured to correct said data chunk if said corresponding checksum values do not match. 17. A method performed by a Log-Structured File System server node in a parallel computing system for storing a data chunk as a shared object, comprising: receiving, by said Log-Structured File System server, said data chunk with a corresponding checksum value from a Log-Structured File System client on a compute node in said parallel computing system, wherein said data chunk is generated by said parallel computing system using said Log-Structured File System client, and wherein said corresponding checksum value is generated by said Log-Structured File System client as said data chunk is written by said Log-Structured File System Client, wherein a plurality of other Log-Structured File System Clients determine a plurality of checksums for a corresponding plurality of other data chunks of said shared data object in parallel; and storing, by said Log-Structured File System server, said received data chunk and said corresponding checksum value to a shared data object on said storage node, wherein said storage node stores said data chunk with said corresponding checksum value as part of said shared object with said plurality of checksums for said corresponding plurality of other data chunks of said shared data object, and wherein said shared object is shared by a plurality of distributed processes in said parallel computing system. 18. A tangible machine-readable recordable storage medium for storing a data chunk as a shared object in a parallel computing system, wherein one or more software programs when executed by one or more processing devices implement the steps of the method of claim 17 . 19. A Log-Structured File System server apparatus in a parallel computing system for storing a data chunk as a shared object, said storage node apparatus comprising: a memory; and at least one hardware device operatively coupled to the memory and configured to: receive, by said Log-Structured File System server, said data chunk with a corresponding checksum value from a Log-Structured File System client on a compute node in said parallel computing system, wherein said data chunk is generated by said parallel computing system using Log-Structured File System client, and wherein said corresponding checksum value is generated by said

Assignees

Inventors

Classifications

  • in which an application is distributed across nodes in the network (software deployment G06F8/60; multiprogramming arrangements G06F9/46) · CPC title

  • Physics · mapped topic

  • Protocols for data compression, e.g. ROHC · CPC title

  • Parallel file systems, i.e. file systems supporting multiple processors · CPC title

  • Ensuring data consistency and integrity · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9436722B1 cover?
Checksum values are generated and used to verify the data integrity. A client executing in a parallel computing system stores a data chunk to a shared data object on a storage node in the parallel computing system. The client determines a checksum value for the data chunk; and provides the checksum value with the data chunk to the storage node that stores the shared object. The data chunk can b…
Who is the assignee on this patent?
Emc Corp, Los Alamos Nat Security Llc
What technology area does this patent fall under?
Primary CPC classification G06F17/30371. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 06 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).