What technology area does this patent fall under?

Primary CPC classification G06F11/1453. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jan 14 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Multi stream deduplicated backup of collaboration server data

US2016011945A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2016011945-A1
Application number	US-201514859152-A
Country	US
Kind code	A1
Filing date	Sep 18, 2015
Priority date	Dec 19, 2012
Publication date	Jan 14, 2016
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques to backup collaboration server data are disclosed. An indication to begin backup of a collaboration server dataset is received. An associated directory is walked in a prescribed order to divide the dataset into a prescribe number of approximately equal-sized subsets. A separate subset-specific thread is used to back up the subsets in parallel. In some embodiments in which the collaboration data is stored in multiple volumes, a volume-based approach is used to back up the volumes in parallel, e.g., one volume per thread. In some embodiments, transaction logs are backed up in parallel with volumes of collaboration data.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of backing up data, comprising: receiving an indication associated with a backup of a collaboration server dataset; dividing the dataset into a prescribed number of subsets; and using a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup. 2 . The method of claim 1 , wherein the corresponding subset was determined based at least in part on a prescribed order according to which the dataset is divided. 3 . The method of claim 1 , wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated. 4 . The method of claim 3 , wherein the associated directory is walked in the same prescribed order in a subsequent backup operation. 5 . The method of claim 1 , wherein each de-duplicating backup process instance uses a corresponding local cache located at the collaboration server. 6 . The method of claim 5 , wherein each de-duplicating backup process instance determines whether the corresponding local cache stores a file included in the subset corresponding to the thread. 7 . The method of claim 1 , wherein the prescribed number of subsets corresponds to a configured maximum number of concurrent threads used to back up the subsets in parallel. 8 . The method claim 1 , further comprising generating for each subset a corresponding list of files. 9 . The method of claim 8 , further comprising streaming each corresponding list of files to a corresponding thread. 10 . The method of claim 1 , wherein the de-duplicating backup process instances corresponding to the plurality of threads have associated therewith a corresponding de-duplicating backup process instance-specific local cache. 11 . The method of claim 10 , wherein the de-duplicating backup process instances corresponding to the plurality of threads are configured to store in the associated corresponding de-duplicating backup process instance-specific local cache a hash value representative of data that has been transferred by the de-duplicating backup process instance to a remote backup server. 12 . A system to back up collaboration server data, comprising: a memory of other storage device configured to store a directory associated with a collaboration server dataset; and a processor configured to: receive an indication associated with a backup of a collaboration server dataset; divide the dataset into a prescribed number of subsets; and user a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup. 13 . The system of claim 12 , wherein the corresponding subset was determined based at least in part on a prescribed order according to which the dataset is divided. 14 . The system of claim 12 , wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated. 15 . The system of claim 14 , wherein the associated directory is walked in the same prescribed order in a subsequent backup operation. 16 . The system of claim 12 , wherein each de-duplicating backup process instance uses a corresponding local cache located at the collaboration server. 17 . The system of claim 16 , wherein each de-duplicating backup process instance determines whether the corresponding local cache stores a file included in the subset corresponding to the thread. 18 . The system of claim 9 , wherein the prescribed number of subsets corresponds to a configured maximum number of concurrent threads used to back up the subsets in parallel. 19 . The system of claim 9 , wherein the processor is further configured to generate for each subset a corresponding list of files and to stream each corresponding list of files to a corresponding thread with which that list of files is associated. 20 . A computer program product to back up data, the computer program product being embodied in a non-transitory computer readable storage medium and comprising computer instructions for: receiving an indication associated with a backup of a collaboration server dataset; dividing the dataset into a prescribed number of subsets; and using a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup.

Assignees

Emc Corp

Inventors

Classifications

G06F11/1464
for networked environments · CPC title
G06F11/1453Primary
using de-duplication of the data · CPC title
G06F2201/84
Using snapshots, i.e. a logical point-in-time copy of the data · CPC title
G06F16/172
Caching, prefetching or hoarding of files · CPC title
G06F11/1458
Management of the backup or restore process · CPC title

Patent family

Related publications grouped by family.

View patent family 54290356

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016011945A1 cover?: Techniques to backup collaboration server data are disclosed. An indication to begin backup of a collaboration server dataset is received. An associated directory is walked in a prescribed order to divide the dataset into a prescribe number of approximately equal-sized subsets. A separate subset-specific thread is used to back up the subsets in parallel. In some embodiments in which the collabo…
Who is the assignee on this patent?: Emc Corp
What technology area does this patent fall under?: Primary CPC classification G06F11/1453. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jan 14 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).