Gold image library management system to reduce backup storage and bandwidth utilization

US11513904B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11513904-B2
Application numberUS-202017124957-A
CountryUS
Kind codeB2
Filing dateDec 17, 2020
Priority dateDec 17, 2020
Publication dateNov 29, 2022
Grant dateNov 29, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. During backup, user content data is copied from the client to the data protection target, and Gold image data is copied to the CDPT. It is also referenced in the DP target to prevent redundant storage. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: providing a data protection (DP) target for storing user content data generated by one or more data sources deployed as clients running one or more operating system and application programs; providing a common data protection target (CDPT) accessible to but separate from the data protection target for storing structural data for the one or more operating system and application programs, wherein the structural data comprises operating system and application data defined by a manufacturer and different from the user content data; copying, during a backup operation of a client, the user content data is copied from the client to the DP target and copying the structural data to the CDPT; referencing the structural data in the DP target to prevent redundant storage of the structural data in the DP target; querying the CDPT to determine if the structural data for the client already exists in the CDPT, and if so, causing the DP target to not store the structural data in the DP target and instead use the reference to indicate the CDPT as the location of the structural data corresponding to the backed up user content data, and if not, using the structural data stored in the DP target, wherein the reference comprises a DP target ID field comprising a unique identifier for each DP target storing the user content data; and defining, first, a local DP target chunk for the DP target having the user content data, a hash value, and a size value, second, a CDPT target chunk for the CDPT target having the structural data, a hash value, a size value, and the DP target ID field, third, a remote DP target chunk for the CDPT relative to the DP target and comprising the hash value, a binary index value, and an identifier for the CDPT (CDPT ID), wherein each entry DP target ID field refers to a specific DP target device that references the CDPT target chunk, and further wherein the DP target ID field contains a DP target ID either zero or one time exactly, and refers to a local chunk or a remote chunk on the CDPT device, wherein a local chunk has a non-zero size field and chunk data. 2. The method of claim 1 further comprising, during a restore operation, building a restore stream from the DP target and the CDPT to combine the user content data and the corresponding structural data for the restored data set. 3. The method of claim 1 wherein, if the size field is zero, there is no data portion and the CDPT ID references the remote CDPT where the chunk data resides. 4. A method of storing structural data for operating systems and applications separately from user content data to reduce redundant storage of data in backup systems, comprising: storing content data generated by use of the operating systems and applications in one or more data protection (DP) targets; storing the structural data in a dedicated, centralized common data protection target (CDPT), wherein the structural data is stored as data chunks; augmenting each data chunk in the CDPT with a data structure that has a list of identifiers for each DP target that refers to any CDPT chunk one or more times; and checking, upon a backup operation, by each DP target whether a backed up data chunk is present in the CDPT, and if not, storing it the data chunk in the DP target, otherwise if the data is present in the CDPT, causing the chunk on the DP target to contain only the chunk's hash, an identifier of the CDPT containing the data, and a size of zero; notifying the CDPT by the DP target that the chunk is being used; providing the ID of the DP target for addition by the CDPT to the chunk on the CDPT; adding, to a DP target ID list, the CDPT ID from the chunk on the CDPT; removing a data portion from a local DPT chunk; and adding the CDPT ID and setting a size field to zero. 5. The method of claim 4 further comprising performing a restore operation by: examining a metadata catalog for a client being restored; iterating though all data chunks by hash to build a restore stream; and retrieving chunks present in the CDPT for addition to the restore stream. 6. A system comprising: a data protection (DP) target for storing user content data generated by one or more data sources deployed as clients running one or more operating system and application programs; a common data protection target (CDPT) accessible to but separate from the data protection target for storing structural data for the one or more operating system and application programs, wherein the structural data comprises operating system and application data defined by a manufacturer and different from the user content data; and a first server component copying, during a backup operation of a client, the user content data is copied from the client to the DP target and copying the structural data to the CDPT, and referencing the structural data in the DP target to prevent redundant storage of the structural data in the DP target; a second server component querying the CDPT to determine if the structural data for the client already exists in the CDPT, and if so, causing the DP target to not store the structural data in the DP target and instead use the reference to indicate the CDPT as the location of the structural data corresponding to the backed up user content data, and if not, using the structural data stored in the DP target, wherein the reference comprises a DP target ID field comprising a unique identifier for each DP target storing the user content data; and a third server component, defining a CDPT target chunk for the CDPT target and comprising the structural data, a hash value, a size value, and the DP target ID field, and defining a remote DP target chunk for the CDPT relative to the DP target and comprising the a hash value, a binary index value, and an identifier for the CDPT (CDPT ID), wherein each entry DP target ID field refers to a specific DP target device that references the CDPT target chunk, and wherein the DP target ID field will contain a DP target ID either zero or one time exactly, and will refer to a local chunk or a remote chunk on the CDPT device, wherein a local chunk has a non-zero size field and chunk data. 7. The system of claim 6 further comprising a restoration component building a restore stream from the DP target and the CDPT to combine the user content data and the corresponding structural data for the restored data set.

Assignees

Inventors

Classifications

  • Backup restoration techniques · CPC title

  • for networked environments · CPC title

  • using de-duplication of the data · CPC title

  • Using snapshots, i.e. a logical point-in-time copy of the data · CPC title

  • Aggregation; Duplicate elimination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11513904B2 cover?
Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. During backup, user content data is copied from the client to the data protection target, and Gold ima…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F11/1453. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 29 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).