Optimized tender processing of hash-based replicated data

US10496668B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10496668-B1
Application numberUS-201615195109-A
CountryUS
Kind codeB1
Filing dateJun 28, 2016
Priority dateJun 28, 2016
Publication dateDec 3, 2019
Grant dateDec 3, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system, computer program product, and computer-executable method of optimizing replication between a first Content Addressable Storage (CAS) system and a second CAS system, the system, computer program product, and computer-executable method including analyzing the first CAS system and the second CAS system to determine an amount of differences between the first CAS system and the second CAS system, upon determining that the amount of differences are below a threshold amount, sending a set of replication information to the second CAS system, wherein the set of information includes a first address and a first hash for a portion of data stored on the first CAS system, analyzing, at the second CAS system, the set of replication information to determine whether to request that the first CAS system send data associated with the set of replication information; upon a negative determination, sending the data associated with the set of replication information.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-executable method of optimizing replication between a first Content Addressable Storage (CAS) system and a second CAS system, the computer-executable method comprising: analyzing the first CAS system and the second CAS system to determine an amount of differences between the first CAS system and the second CAS system; upon determining that the amount of differences is below a threshold amount, sending a set of replication information to the second CAS system, wherein the set of information includes a first address and a first hash for a portion of data stored on the first CAS system; analyzing, at the second CAS system, the set of replication information to determine whether to request that the first CAS system send data associated with the set of replication information, wherein the analyzing comprises determining, at the second CAS system, hash handles associated with the first address and the first hash; upon a negative determination, sending the data associated with the set of replication information. 2. The computer-executable method of claim 1 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; searching a hash metadata table (HMD), at the second CAS system, to determine whether the first hash handle exists; upon a negative determination, returning a failure signal to the first CAS system. 3. The computer-executable method of claim 1 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; determining whether the first hash handle exists on the second CAS system; upon a positive determination, comparing the first hash handle with a second hash handle, wherein the second hash handle is associated with a second address within an address to hash handle (A2H) table on the second CAS system; upon determining that the first hash handle and the second hash handle are identical, returning a success signal to the first CAS system. 4. The computer-executable method of claim 3 , wherein the analyzing further comprises: upon determining that the first hash handle and the second hash handle are not identical, updating the second handle with the first hash handle; and returning a success signal to the first CAS system. 5. The computer-executable method of claim 1 , further comprising: upon a positive determination, sending a second set of replication information. 6. A system, comprising: a distributed data storage system including a first Content Addressable Storage (CAS) system and a second CAS system; and computer-executable program logic encoded in memory of one or more computers enabled to optimize replication between the first CAS system and the second CAS system, wherein the computer-executable program logic is configured for the execution of: analyzing the first CAS system and the second CAS system to determine an amount of differences between the first CAS system and the second CAS system; upon determining that the amount of differences is below a threshold amount, sending a set of replication information to the second CAS system, wherein the set of information includes a first address and a first hash for a portion of data stored on the first CAS system; analyzing, at the second CAS system, the set of replication information to determine whether to request that the first CAS system send data associated with the set of replication information, wherein the analyzing comprises determining, at the second CAS system, hash handles associated with the first address and the first hash; upon a negative determination, sending the data associated with the set of replication information. 7. The system of claim 6 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; searching a hash metadata table (HMD), at the second CAS system, to determine whether the first hash handle exists; upon a negative determination, returning a failure signal to the first CAS system. 8. The system of claim 6 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; determining whether the first hash handle exists on the second CAS system; upon a positive determination, comparing the first hash handle with a second hash handle, wherein the second hash handle is associated with a second address within an address to hash handle (A2H) table on the second CAS system; upon determining that the first hash handle and the second hash handle are identical, returning a success signal to the first CAS system. 9. The system of claim 8 , wherein the analyzing further comprises: upon determining that the first hash handle and the second hash handle are not identical, updating the second handle with the first hash handle; and returning a success signal to the first CAS system. 10. The system of claim 6 , wherein the computer-executable program logic is further configured for the execution of: upon a positive determination, sending a second set of replication information. 11. A computer program product for optimizing replication between a first Content Addressable Storage (CAS) system and a second CAS system, the computer program product comprising: a non-transitory computer readable medium encoded with computer-executable code, the code configured to enable the execution of: analyzing the first CAS system and the second CAS system to determine an amount of differences between the first CAS system and the second CAS system; upon determining that the amount of differences is below a threshold amount, sending a set of replication information to the second CAS system, wherein the set of information includes a first address and a first hash for a portion of data stored on the first CAS system; analyzing, at the second CAS system, the set of replication information to determine whether to request that the first CAS system send data associated with the set of replication information, wherein the analyzing comprises determining, at the second CAS system, hash handles associated with the first address and the first hash; upon a negative determination, sending the data associated with the set of replication information. 12. The computer program product of claim 11 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; searching a hash metadata table (HMD), at the second CAS system, to determine whether the first hash handle exists; upon a negative determination, returning a failure signal to the first CAS system. 13. The computer program product of claim 11 , wherein the analyzing comprises: creating, at the second CAS system, a first hash handle associated with the first hash received from the first CAS system; determining whether the first hash handle exists on the second CAS system; upon a positive determination, comparing the first hash handle with a second hash handle, wherein the second hash handle is associated with a second address within an address to hash handle (A2H) table on the second CAS system; upon determining that the first hash handle and the second hash handle are identical, returning a success signal to the first CAS system. 14. The computer program product of claim 13 , wherein the analyzing further comprises: upon determining that the first hash handle and the second

Assignees

Inventors

Classifications

  • G06F16/273Primary

    Asynchronous replication or reconciliation · CPC title

  • Hash tables · CPC title

  • G06F16/27Primary

    Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10496668B1 cover?
A system, computer program product, and computer-executable method of optimizing replication between a first Content Addressable Storage (CAS) system and a second CAS system, the system, computer program product, and computer-executable method including analyzing the first CAS system and the second CAS system to determine an amount of differences between the first CAS system and the second CAS …
Who is the assignee on this patent?
Emc Corp, Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/273. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 03 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).