Method, device and computer program product for data backup

US10983718B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10983718-B2
Application numberUS-201916285200-A
CountryUS
Kind codeB2
Filing dateFeb 26, 2019
Priority dateOct 26, 2018
Publication dateApr 20, 2021
Grant dateApr 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure relate to method, device and computer program product for data backup. In accordance with embodiments of the present disclosure, if a file is migrated between a first machine and a second machine, a data migration module compares the file with files already stored in the second machine, so as to only migrate modified and/or added data chunks in the files to the second machine. In this way, the performance of data migration has been improved.

First claim

Opening claim text (preview).

We claim: 1. A computer-implemented method, comprising: receiving a request to migrate a file from a first machine to a second machine, the file comprising a plurality of data chunks; determining, for a first data chunk of the plurality of data chunks, whether an identification of the first data chunk has been included in metadata associated with the migration, the metadata indicating information of a data chunk which has been previously migrated from the first machine to the second machine; in response to determining that the identification of the first data chunk is included in the metadata, determining a first position of the first data chunk in the second machine without migrating the first data chunk; and controlling migration of the plurality of data chunks based on the determination and the first position of the first data chunk. 2. The method of claim 1 , wherein controlling migration of the data chunks based on the determination comprises: in response to determining that the identification of the first data chunk fails to be included in the metadata, migrating the first data chunk to the second machine; and adding information of the first data chunk into the metadata. 3. The method of claim 1 , wherein controlling migration of the data chunk based on the determination comprises: in response to determining that the identification of the first data chunk is included in the metadata, determining a size of the first data chunk without migrating the first data chunk; and transmitting, to the second machine, an indication of the first position and the size of the first data chunk. 4. The method of claim 3 , further comprising: updating the metadata with a timestamp associated with the migration of the first data chunk. 5. The method of claim 3 , further comprising: in response to determining that an identification of a second data chunk has been included in metadata associated with the migration, determining a second position of the second data chunk in the second machine and a size of the second data chunk; determining, based on the first position, the second position and the size of the second data chunk, whether the first and the second data chunks are contiguous; in response to determining that the first and the second data chunks are contiguous, transmitting, to the second machine, an indication of the second position and a sum of sizes of the first and the second data chunks. 6. The method of claim 1 , wherein the identification is a hash value of the first data chunk, wherein determining whether the identification of the first data chunk has been included in metadata associated with the migration comprises: acquiring portions of a plurality of hash values of a further plurality of data chunks in the metadata; dividing the metadata into a plurality of subsets based on the acquired portions of the further plurality of hash values; acquiring a portion of the hash value of the first data chunk; selecting a target subset from the plurality of subsets based on a portion of the identification; comparing the identification with hash values of data chunks in the target subset; and in response to a data chunk matching with the identification being in the target subset, determining whether the identification of the first data chunk has been included in metadata associated with the migration. 7. An electronic device, comprising: at least one processing unit; and at least memory coupled to the at least one processing unit and storing machine-executable instructions, the instructions, when implemented by the at least one processing unit, causing the device to perform acts, comprising: receiving a request to migrate a file from a first machine to a second machine, the file comprising a plurality of data chunks; determining, for a first data chunk of the plurality of data chunks, whether an identification of the first data chunk has been included in metadata associated with the migration, the metadata indicating information of a data chunk which has been previously migrated from the first machine to the second machine; in response to determining that the identification of the first data chunk is included in the metadata, determining a first position of the first data chunk in the second machine without migrating the first data chunk; and controlling migration of the plurality of data chunks based on the determination and the first position of the first data chunk. 8. The electronic device of claim 7 , wherein controlling migration of the data chunk based on the determination comprises: in response to determining that the identification of the first data chunk fails to be included in the metadata, migrating the first data chunk to the second machine; and adding information of the first data chunk into the metadata. 9. The electronic device of claim 7 , wherein controlling migration of the data chunk based on the determination comprises: in response to determining that the identification of the first data chunk is included in the metadata, determining a size of the first data chunk without migrating the first data chunk; and transmitting, to the second machine, an indication of the first position and the size of the first data chunk. 10. The electronic device of claim 9 , wherein the actions include: updating the metadata with a timestamp associated with the migration of the first data chunk. 11. The electronic device of claim 9 , wherein the actions further include: in response to determining that an identification of a second data chunk has been included in metadata associated with the migration, determining a second position of the second data chunk in the second machine and a size of the second data chunk; determining, based on the first position, the second position and the size of the second data chunk, whether the first and the second data chunks are contiguous; in response to determining that the first and the second data chunks are contiguous, transmitting, to the second machine, an indication of the second position and a sum of sizes of the first and the second data chunks. 12. The electronic device of claim 7 , wherein the identification is a hash value of the first data chunk, wherein determining whether the identification of the first data chunk has been included in metadata associated with the migration comprises: acquiring portions of a plurality of hash values of a further plurality of data chunks in the metadata; dividing the metadata into a plurality of subsets based on the acquired portions of the further plurality of hash values; acquiring a portion of the hash value of the first data chunk; selecting a target subset from the plurality of subsets based on a portion of the identification; comparing the identification with hash values of data chunks in the target subset; and in response to a data chunk matching with the identification being in the target subset, determining whether the identification of the first data chunk has been included in metadata associated with the migration. 13. A computer program product, tangibly stored on a non-transitory computer-readable medium and including machine executable instructions, the machine-executable instructions, when executed, causing a machine to: receive a request to migrate a file from a first machine to a second machine, the file comprising a plurality of data chunks; determine, for a first data chunk of the plurality of data chunks, whether an identification of the first data chunk has been included in metadata associated with the migration, the metadata indicating information of a data chunk which has been previously migrated from the fir

Assignees

Inventors

Classifications

  • Backup scheduling policy · CPC title

  • In-line storage system · CPC title

  • G06F3/065Primary

    Replication mechanisms · CPC title

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • G06F3/0647Primary

    Migration mechanisms · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10983718B2 cover?
Embodiments of the present disclosure relate to method, device and computer program product for data backup. In accordance with embodiments of the present disclosure, if a file is migrated between a first machine and a second machine, a data migration module compares the file with files already stored in the second machine, so as to only migrate modified and/or added data chunks in the files to…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F11/1461. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).