Accelerated erasure coding for storage systems

US2017288704A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017288704-A1
Application numberUS-201615281172-A
CountryUS
Kind codeA1
Filing dateSep 30, 2016
Priority dateMar 30, 2016
Publication dateOct 5, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for generating coded fragments comprises receiving data to be encoded, splitting the data into a plurality of data fragments, identifying a first group of data fragments from among the plurality of data fragments using a coding matrix, summing the data fragments within the first group of data fragments to generate a first group sum, and using the first group sum to calculate at least a portion of two or more coded fragments.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: receiving data to be encoded; splitting the data into a plurality of data fragments; identifying a first group of data fragments from among the plurality of data fragments using a coding matrix; summing the data fragments within the first group of data fragments to generate a first group sum; and using the first group sum to calculate at least a portion of two or more coded fragments. 2 . The method of claim 1 wherein the coding matrix comprises a binary coding matrix, wherein identifying the first group of data fragments from among the plurality of data fragments comprises counting a number of 1's within rows of the coding matrix. 3 . The method of claim 2 wherein summing the data fragments within the first group of data fragments comprising XOR'ing the data fragments within the first group of data fragments. 4 . The method of claim 2 further comprising: updating two or more rows of the coding matrix to set 1's to 0's, wherein each of the two or more updated rows of the coding matrix is associated with a corresponding one of the two or more coded fragments. 5 . The method of claim 4 further comprising: identifying a second group of data fragments from among the plurality of data fragments using the updated coding matrix; summing the data fragments within the second group of data fragments to generate a second group sum; and using the second group sum to calculate at least a portion of two or more coded fragments. 6 . The method of claim 1 wherein identifying the first group of data fragments comprises using a greedy algorithm. 7 . The method of claim 6 wherein identifying a group of the data fragments using a greedy algorithm comprises: generating two or more groups of data fragments each having the same number of data fragments; for each of the two or more groups of data fragments, counting a number of coded fragments whose calculations include the sum of the data fragments within the group; and identifying the first group of data fragments as the group of data fragments from among the two or more groups of data fragments having the highest count. 8 . The method of claim 1 further comprising: storing the plurality of data fragments and the two or more coded fragments across multiple nodes of a distributed storage system. 9 . A system comprising: one or more processors; a volatile memory; and a non-volatile memory storing computer program code that when executed on the processor causes execution across the one or more processors of a process operable to perform the operations of: receiving data to be encoded; splitting the data into a plurality of data fragments; identifying a first group of data fragments from among the plurality of data fragments using a coding matrix; summing the data fragments within the first group of data fragments to generate a first group sum; and using the first group sum to calculate at least a portion of two or more coded fragments. 10 . The system of claim 9 wherein the coding matrix comprises a binary coding matrix, wherein the computer program code causes execution of a process operable to identify the first group of data fragments from among the plurality of data fragments by counting a number of 1's within rows of the coding matrix. 11 . The system of claim 10 wherein the computer program code causes execution of a process operable to sum the data fragments within the first group of data fragments by XOR'ing the data fragments within the first group of data fragments. 12 . The system of claim 10 wherein the computer program code causes execution of a process further operable to: update two or more rows of the coding matrix to set 1's to 0's, wherein each of the two or more updated rows of the coding matrix is associated with a corresponding one of the two or more coded fragments. 13 . The system of claim 12 wherein the computer program code causes execution of a process further operable to: identify a second group of data fragments from among the plurality of data fragments using the updated coding matrix; sum the data fragments within the second group of data fragments to generate a second group sum; and use the second group sum to calculate at least a portion of two or more coded fragments. 14 . The system of claim 9 wherein the computer program code causes execution of a process operable to identify the first group of data fragments using a greedy algorithm. 15 . The system of claim 14 wherein the computer program code causes execution of a process operable to identify a group of the data fragments using a greedy algorithm by: generating two or more groups of data fragments each having the same number of data fragments; for each of the two or more groups of data fragments, counting a number of coded fragments whose calculations include the sum of the data fragments within the group; and identifying the first group of data fragments as the group of data fragments from among the two or more groups of data fragments having the highest count. 16 . The system of claim 9 wherein the computer program code causes execution of a process further operable to: store the plurality of data fragments and the two or more coded fragments across multiple nodes of a distributed storage system. 17 . A computer program product tangibly embodied in a non-transitory computer-readable medium, the computer-readable medium storing program instructions that are executable to: receive data to be encoded; split the data into a plurality of data fragments; identify a first group of data fragments from among the plurality of data fragments using a coding matrix; sum the data fragments within the first group of data fragments to generate a first group sum; and use the first group sum to calculate at least a portion of two or more coded fragments.

Assignees

Inventors

Classifications

  • Specific encoding aspects, e.g. encoding by means of decoding · CPC title

  • Product codes · CPC title

  • H03M13/373Primary

    with erasure correction and erasure determination, e.g. for packet loss recovery or setting of erasures for the decoding of Reed-Solomon codes · CPC title

  • Reed-Solomon codes · CPC title

  • using codes or arrangements adapted for a specific type of error (G06F11/1048 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017288704A1 cover?
A method for generating coded fragments comprises receiving data to be encoded, splitting the data into a plurality of data fragments, identifying a first group of data fragments from among the plurality of data fragments using a coding matrix, summing the data fragments within the first group of data fragments to generate a first group sum, and using the first group sum to calculate at least a…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification H03M13/2909. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Oct 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).