Replication to the cloud

US2016196320A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016196320-A1
Application numberUS-201414577110-A
CountryUS
Kind codeA1
Filing dateDec 19, 2014
Priority dateDec 19, 2014
Publication dateJul 7, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and computer readable storage mediums for replicating data to the cloud. A first dataset is replicated from a first storage subsystem to the cloud, with only the changes made to the first dataset since a previous replication event being replicated to the cloud. Additionally, the data may be deduplicated and compressed before being replicated from the first storage subsystem to the cloud. In some cases, the first dataset may be replicated to the cloud in a stream format which is not directly usable and then, at a later point in time, loaded onto a second storage subsystem from the cloud.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising: a first storage subsystem comprising a local dataset; and a cloud-based server comprising a remote dataset; wherein the first storage subsystem is configured to: identify a change to the local dataset; deduplicate and compress data associated with the change; and send the deduplicated and compressed data to the cloud-based server; wherein the cloud-based server is configured to: receive the deduplicated and compressed data; and store an identification of said change. 2 . The system as recited in claim 1 , wherein the cloud-based server is configured to apply said change indicated by the deduplicated and compressed data to the remote dataset. 3 . The system as recited in claim 1 , wherein the first storage subsystem is further configured to: take a snapshot of the local dataset, wherein the snapshot includes only changes made to the local dataset since a previous snapshot was taken; deduplicate and compress the snapshot; and send the deduplicated and compressed snapshot to the cloud-based server. 4 . The system as recited in claim 3 , wherein the first storage subsystem is further configured to send a medium identifier (ID) to the cloud-based server, wherein the medium ID is associated with the snapshot of the local dataset. 5 . The system as recited in claim 3 , wherein the first storage subsystem is further configured to encrypt the deduplicated and compressed snapshot prior to sending the deduplicated and compressed snapshot to the cloud-based server. 6 . The system as recited in claim 3 , wherein the first storage subsystem is further configured to send the deduplicated and compressed snapshot as a stream of tuples, wherein a first tuple references a second tuple within the stream of tuples, and wherein a third tuple references previously replicated data. 7 . The system as recited in claim 6 , wherein the cloud-based server is further configured to: select, from a plurality of storage subsystems, a second storage subsystem for restoring the remote dataset responsive to an indication of failure of the local dataset; and send the stream of tuples to the second storage subsystem for restoration. 8 . A method comprising: identifying a change to a local dataset at a first storage subsystem; deduplicating and compressing data associated with the change; sending the deduplicated and compressed data to a cloud-based server; receiving the deduplicated and compressed data at the cloud-based server; and storing an identification of said change at the cloud-based server. 9 . The method as recited in claim 8 , further comprising applying said change indicated by the deduplicated and compressed data to a remote dataset at the cloud-based server. 10 . The method as recited in claim 8 , further comprising: taking a snapshot of the local dataset at the first storage subsystem, wherein the snapshot includes only changes made to the local dataset since a previous snapshot was taken; deduplicating and compressing the snapshot; and sending the deduplicated and compressed snapshot to the cloud-based server. 11 . The method as recited in claim 10 , further comprising sending a medium identifier (ID) to the cloud-based server, wherein the medium ID is associated with the snapshot of the local dataset. 12 . The method as recited in claim 10 , further comprising encrypting the deduplicated and compressed snapshot prior to sending the deduplicated and compressed snapshot to the cloud-based server. 13 . The method as recited in claim 10 , further comprising sending the deduplicated and compressed snapshot as a stream of tuples, wherein a first tuple references a second tuple within the stream of tuples, and wherein a third tuple references previously replicated data. 14 . The method as recited in claim 13 , further comprising: selecting, from a plurality of storage subsystems, a second storage subsystem by the cloud-based server for restoring the remote dataset responsive to an indication of failure of the local dataset; and sending the stream of tuples to the second storage subsystem from the cloud-based server for restoration. 15 . A non-transitory computer readable storage medium storing program instructions, wherein the program instructions are executable by a processor to: identify a change to a local dataset at a first storage subsystem; deduplicate and compress data associated with the change; send the deduplicated and compressed data to a cloud-based server; receive the deduplicated and compressed data at the cloud-based server; and store an identification of said change at the cloud-based server. 16 . The non-transitory computer readable storage medium as recited in claim 15 , wherein the program instructions are further executable by a processor to apply said change indicated by the deduplicated and compressed data to a remote dataset at the cloud-based server. 17 . The non-transitory computer readable storage medium as recited in claim 15 , wherein the program instructions are further executable by a processor to: take a snapshot of the local dataset at the first storage subsystem, wherein the snapshot includes only changes made to the local dataset since a previous snapshot was taken; deduplicate and compress the snapshot; and send the deduplicated and compressed snapshot to the cloud-based server. 18 . The non-transitory computer readable storage medium as recited in claim 17 , wherein the program instructions are further executable by a processor to send a medium identifier (ID) to the cloud-based server, wherein the medium ID is associated with the snapshot of the local dataset. 19 . The non-transitory computer readable storage medium as recited in claim 17 , wherein the program instructions are further executable by a processor to send the deduplicated and compressed snapshot as a stream of tuples, wherein a first tuple references a second tuple within the stream of tuples, and wherein a third tuple references previously replicated data. 20 . The non-transitory computer readable storage medium as recited in claim 19 , wherein the program instructions are further executable by a processor to: select, from a plurality of storage subsystems, a second storage subsystem by the cloud-based server for restoring the remote dataset responsive to an indication of failure of the local dataset; and send the stream of tuples to the second storage subsystem from the cloud-based server for restoration.

Assignees

Inventors

Classifications

  • Hardware arrangements for backup · CPC title

  • by selection of backup contents · CPC title

  • De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title

  • Replication mechanisms · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016196320A1 cover?
Systems, methods, and computer readable storage mediums for replicating data to the cloud. A first dataset is replicated from a first storage subsystem to the cloud, with only the changes made to the first dataset since a previous replication event being replicated to the cloud. Additionally, the data may be deduplicated and compressed before being replicated from the first storage subsystem to…
Who is the assignee on this patent?
Pure Storage Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/27. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 07 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).