Replication to the cloud

US10545987B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10545987-B2
Application numberUS-201414577110-A
CountryUS
Kind codeB2
Filing dateDec 19, 2014
Priority dateDec 19, 2014
Publication dateJan 28, 2020
Grant dateJan 28, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and computer readable storage mediums for replicating data to the cloud. A first dataset is replicated from a first storage subsystem to the cloud, with only the changes made to the first dataset since a previous replication event being replicated to the cloud. Additionally, the data may be deduplicated and compressed before being replicated from the first storage subsystem to the cloud. In some cases, the first dataset may be replicated to the cloud in a stream format which is not directly usable and then, at a later point in time, loaded onto a second storage subsystem from the cloud.

First claim

Opening claim text (preview).

What is claimed is: 1. A storage system for replicating data to a cloud-based server, the system configured to: by a storage controller, take a first snapshot of a local dataset stored on one or more storage devices, take a second snapshot of the local dataset; wherein the second snapshot includes only changes made to the local dataset since the first snapshot was taken; deduplicate and compress the second snapshot; and send to the cloud-based server, the deduplicated and compressed second snapshot in a stream of data including first data that references second data within the stream, the stream further including third data that references previously replicated data already stored. 2. The storage system as recited in claim 1 , wherein the cloud-based server is configured to apply the deduplicated and compressed data to a remote dataset. 3. The storage system as recited in claim 1 , wherein the system is further configured to send an identifier (ID) to the cloud-based server, wherein the ID is associated with the second snapshot of the local dataset. 4. The storage system as recited in claim 1 , wherein the system is further configured to encrypt the deduplicated and compressed second snapshot prior to sending the deduplicated and compressed second snapshot to the cloud-based server. 5. The storage system as recited in claim 1 , wherein the cloud-based server is further configured to: select, from a plurality of systems, a second system for restoring the remote dataset responsive to an indication of failure of the local dataset; and send the stream to the second system for restoration. 6. A method comprising: performing, by a storage controller: taking a first snapshot of a local dataset stored on one or more storage devices, taking a second snapshot of the local dataset; wherein the second snapshot includes only changes made to the local dataset since the first snapshot was taken; deduplicating and compressing the second snapshot; and sending to the cloud-based server, the deduplicated and compressed second snapshot in a stream of data including first data that references second data within the stream, the stream further including third data that references previously replicated data already stored. 7. The method as recited in claim 6 , further comprising applying the deduplicated and compressed data to a remote dataset at the cloud-based server. 8. The method as recited in claim 6 , further comprising sending an identifier (ID) to the cloud-based server, wherein the ID is associated with the second snapshot of the local dataset. 9. The method as recited in claim 6 , further comprising encrypting the deduplicated and compressed second snapshot prior to sending the deduplicated and compressed snapshot to the cloud-based server. 10. The method as recited in claim 6 , further comprising: selecting, from a plurality of storage systems, a second storage system by the cloud-based server for restoring the remote dataset responsive to an indication of failure of the local dataset; and sending the stream to the second storage system from the cloud-based server for restoration. 11. A non-transitory computer readable storage medium storing program instructions, wherein the program instructions are executable by a processor to: by a storage controller: take a first snapshot of a local dataset stored on one or more storage devices, take a second snapshot of the local dataset; wherein the second snapshot includes only changes made to the local dataset since the first snapshot was taken; deduplicate and compress the second snapshot; and send to the cloud-based server, the deduplicated and compressed second snapshot in a stream of data including first data that references second data within the stream, the stream further including third data that references previously replicated data already stored. 12. The non-transitory computer readable storage medium as recited in claim 11 , wherein the program instructions are further executable by a processor to apply the deduplicated and compressed data to a remote dataset at the cloud-based server. 13. The non-transitory computer readable storage medium as recited in claim 11 , wherein the program instructions are further executable by a processor to send an identifier (ID) to the cloud-based server, wherein the ID is associated with the second snapshot of the local dataset. 14. The non-transitory computer readable storage medium as recited in claim 11 , wherein the program instructions are further executable by a processor to: select, from a plurality of storage systems, a second storage system by the cloud-based server for restoring the remote dataset responsive to an indication of failure of the local dataset; and send the stream to the second storage system from the cloud-based server for restoration.

Assignees

Inventors

Classifications

  • De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title

  • Using snapshots, i.e. a logical point-in-time copy of the data · CPC title

  • using compression, e.g. sparse files · CPC title

  • Management of the backup or restore process · CPC title

  • using de-duplication of the data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10545987B2 cover?
Systems, methods, and computer readable storage mediums for replicating data to the cloud. A first dataset is replicated from a first storage subsystem to the cloud, with only the changes made to the first dataset since a previous replication event being replicated to the cloud. Additionally, the data may be deduplicated and compressed before being replicated from the first storage subsystem to…
Who is the assignee on this patent?
Pure Storage Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/27. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 28 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).