Copying data based on overwritten probabilities

US11662937B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11662937-B2
Application numberUS-202117518976-A
CountryUS
Kind codeB2
Filing dateNov 4, 2021
Priority dateApr 20, 2021
Publication dateMay 30, 2023
Grant dateMay 30, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for replicating data involve: determining overwritten probabilities of a plurality of data blocks of a source storage device. The techniques further involve: classifying the plurality of data blocks as cold data blocks or hot data blocks based on the overwritten probabilities in the plurality of data blocks. The techniques further involve: replicating at least a portion of the cold data blocks to a target storage device prior to the hot data blocks. Accordingly, transmission resources used to re-replicate overwritten data during an initial replication period can be reduced, thereby achieving efficient data replication.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for replicating data, comprising: determining overwritten probabilities of a plurality of data blocks in a source storage device, each respective data block including an overwritten probability, of the overwritten probabilities, indicating a probability of the respective data block being overwritten in the source storage device before the plurality of data blocks are all replicated to a target storage device; classifying the plurality of data blocks as cold data blocks or hot data blocks based on the overwritten probabilities of the plurality of data blocks, overwritten probabilities of the hot data blocks being higher than overwritten probabilities of the cold data blocks; and replicating at least a portion of the cold data blocks to the target storage device prior to the hot data blocks. 2. The method according to claim 1 , wherein replicating at least the portion of the cold data blocks to the target storage device prior to the hot data blocks comprises at least one of the following: replicating the hot data blocks after replicating all of the cold data blocks to the target storage device; and replicating the hot data blocks after replicating a portion of the cold data blocks to the target storage device. 3. The method according to claim 1 , wherein determining the overwritten probabilities comprises determining each of the overwritten probabilities based on at least one of the following: a read and write history of a data block; a type of data block; and a overwritten probability of a neighboring data block. 4. The method according to claim 3 , wherein each of the plurality of data blocks comprises at least one data unit, and determining each of the overwritten probability of each data block probabilities comprises: determining a possibility of the at least one data unit, of the respective data block, being overwritten before the plurality of data blocks are all replicated to the target storage device; and determining the overwritten probability of the respective data block based at least on the possibility of the at least one data unit of the respective data block. 5. The method according to claim 4 , wherein determining the possibility of the at least one data unit of the respective data block comprises determining the possibility based on at least one of the following: a read and write history of a data unit; a type of data unit; and a possibility of a neighboring data unit being overwritten. 6. The method according to claim 1 , further comprising: when replicating the plurality of data blocks to the target storage device, re-replicating the plurality of data blocks in the source storage device, which are overwritten after being replicated to the target storage device, to the target storage device. 7. An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein, and the instructions, when executed by the processor, causing the device to execute actions comprising: determining overwritten probabilities of a plurality of data blocks in a source storage device, each respective data block including an overwritten probability, of the overwritten probabilities, indicating a probability of the respective data block being overwritten in the source storage device before the plurality of data blocks are all replicated to a target storage device; classifying the plurality of data blocks as cold data blocks or hot data blocks based on the overwritten probabilities of the plurality of data blocks, overwritten probabilities of the hot data blocks being higher than overwritten probabilities of the cold data blocks; and replicating at least a portion of the cold data blocks to the target storage device prior to the hot data blocks. 8. The device according to claim 7 , wherein replicating at least the portion of the cold data blocks to the target storage device prior to the hot data blocks comprises at least one of the following: replicating the hot data blocks after replicating all of the cold data blocks to the target storage device; and replicating the hot data blocks after replicating a portion of the cold data blocks to the target storage device. 9. The device according to claim 7 , wherein determining the overwritten probabilities comprises determining each of the overwritten probabilities based on at least one of the following: a read and write history of a data block; a type of data block; and a overwritten probability of a neighboring data block. 10. The device according to claim 9 , wherein each of the plurality of data blocks comprises at least one data unit, and determining each of the overwritten probability of each data block probabilities comprises: determining a possibility of the at least one data unit, of the respective data block, being overwritten before the plurality of data blocks are all replicated to the target storage device; and determining the overwritten probability of the respective data block based at least on the possibility of the at least one data unit of the respective data block. 11. The device according to claim 10 , wherein determining the possibility of the at least one data unit of the respective data block comprises determining the possibility based on at least one of the following: a read and write history of a data unit; a type of data unit; and a possibility of a neighboring data unit being overwritten. 12. The device according to claim 7 , wherein the actions further comprise: when replicating the plurality of data blocks to the target storage device, re-replicating the plurality of data blocks in the source storage device, which are overwritten after being replicated to the target storage device, to the target storage device. 13. A computer program product having a non-transitory computer readable medium which stores a set of instructions to replicate data; the set of instructions, when carried out by computerized circuitry, causing the computerized circuitry to perform a method of: determining overwritten probabilities of a plurality of data blocks in a source storage device, each respective data block including an overwritten probability, of the overwritten probabilities, indicating a probability of the respective data block being overwritten in the source storage device before the plurality of data blocks are all replicated to a target storage device; classifying the plurality of data blocks as cold data blocks or hot data blocks based on the overwritten probabilities of the plurality of data blocks, overwritten probabilities of the hot data blocks being higher than overwritten probabilities of the cold data blocks; and replicating at least a portion of the cold data blocks to the target storage device prior to the hot data blocks.

Assignees

Inventors

Classifications

  • G06F3/065Primary

    Replication mechanisms · CPC title

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • In-line storage system · CPC title

  • Management of blocks · CPC title

  • in relation to response time · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11662937B2 cover?
Techniques for replicating data involve: determining overwritten probabilities of a plurality of data blocks of a source storage device. The techniques further involve: classifying the plurality of data blocks as cold data blocks or hot data blocks based on the overwritten probabilities in the plurality of data blocks. The techniques further involve: replicating at least a portion of the cold d…
Who is the assignee on this patent?
Emc Ip Holding Co Llc, EMC IP Holding Company LLP
What technology area does this patent fall under?
Primary CPC classification G06F3/065. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 30 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).