Storage system and computer system
US-2021326046-A1 · Oct 21, 2021 · US
US11513904B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11513904-B2 |
| Application number | US-202017124957-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 17, 2020 |
| Priority date | Dec 17, 2020 |
| Publication date | Nov 29, 2022 |
| Grant date | Nov 29, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. During backup, user content data is copied from the client to the data protection target, and Gold image data is copied to the CDPT. It is also referenced in the DP target to prevent redundant storage. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method comprising: providing a data protection (DP) target for storing user content data generated by one or more data sources deployed as clients running one or more operating system and application programs; providing a common data protection target (CDPT) accessible to but separate from the data protection target for storing structural data for the one or more operating system and application programs, wherein the structural data comprises operating system and application data defined by a manufacturer and different from the user content data; copying, during a backup operation of a client, the user content data is copied from the client to the DP target and copying the structural data to the CDPT; referencing the structural data in the DP target to prevent redundant storage of the structural data in the DP target; querying the CDPT to determine if the structural data for the client already exists in the CDPT, and if so, causing the DP target to not store the structural data in the DP target and instead use the reference to indicate the CDPT as the location of the structural data corresponding to the backed up user content data, and if not, using the structural data stored in the DP target, wherein the reference comprises a DP target ID field comprising a unique identifier for each DP target storing the user content data; and defining, first, a local DP target chunk for the DP target having the user content data, a hash value, and a size value, second, a CDPT target chunk for the CDPT target having the structural data, a hash value, a size value, and the DP target ID field, third, a remote DP target chunk for the CDPT relative to the DP target and comprising the hash value, a binary index value, and an identifier for the CDPT (CDPT ID), wherein each entry DP target ID field refers to a specific DP target device that references the CDPT target chunk, and further wherein the DP target ID field contains a DP target ID either zero or one time exactly, and refers to a local chunk or a remote chunk on the CDPT device, wherein a local chunk has a non-zero size field and chunk data. 2. The method of claim 1 further comprising, during a restore operation, building a restore stream from the DP target and the CDPT to combine the user content data and the corresponding structural data for the restored data set. 3. The method of claim 1 wherein, if the size field is zero, there is no data portion and the CDPT ID references the remote CDPT where the chunk data resides. 4. A method of storing structural data for operating systems and applications separately from user content data to reduce redundant storage of data in backup systems, comprising: storing content data generated by use of the operating systems and applications in one or more data protection (DP) targets; storing the structural data in a dedicated, centralized common data protection target (CDPT), wherein the structural data is stored as data chunks; augmenting each data chunk in the CDPT with a data structure that has a list of identifiers for each DP target that refers to any CDPT chunk one or more times; and checking, upon a backup operation, by each DP target whether a backed up data chunk is present in the CDPT, and if not, storing it the data chunk in the DP target, otherwise if the data is present in the CDPT, causing the chunk on the DP target to contain only the chunk's hash, an identifier of the CDPT containing the data, and a size of zero; notifying the CDPT by the DP target that the chunk is being used; providing the ID of the DP target for addition by the CDPT to the chunk on the CDPT; adding, to a DP target ID list, the CDPT ID from the chunk on the CDPT; removing a data portion from a local DPT chunk; and adding the CDPT ID and setting a size field to zero. 5. The method of claim 4 further comprising performing a restore operation by: examining a metadata catalog for a client being restored; iterating though all data chunks by hash to build a restore stream; and retrieving chunks present in the CDPT for addition to the restore stream. 6. A system comprising: a data protection (DP) target for storing user content data generated by one or more data sources deployed as clients running one or more operating system and application programs; a common data protection target (CDPT) accessible to but separate from the data protection target for storing structural data for the one or more operating system and application programs, wherein the structural data comprises operating system and application data defined by a manufacturer and different from the user content data; and a first server component copying, during a backup operation of a client, the user content data is copied from the client to the DP target and copying the structural data to the CDPT, and referencing the structural data in the DP target to prevent redundant storage of the structural data in the DP target; a second server component querying the CDPT to determine if the structural data for the client already exists in the CDPT, and if so, causing the DP target to not store the structural data in the DP target and instead use the reference to indicate the CDPT as the location of the structural data corresponding to the backed up user content data, and if not, using the structural data stored in the DP target, wherein the reference comprises a DP target ID field comprising a unique identifier for each DP target storing the user content data; and a third server component, defining a CDPT target chunk for the CDPT target and comprising the structural data, a hash value, a size value, and the DP target ID field, and defining a remote DP target chunk for the CDPT relative to the DP target and comprising the a hash value, a binary index value, and an identifier for the CDPT (CDPT ID), wherein each entry DP target ID field refers to a specific DP target device that references the CDPT target chunk, and wherein the DP target ID field will contain a DP target ID either zero or one time exactly, and will refer to a local chunk or a remote chunk on the CDPT device, wherein a local chunk has a non-zero size field and chunk data. 7. The system of claim 6 further comprising a restoration component building a restore stream from the DP target and the CDPT to combine the user content data and the corresponding structural data for the restored data set.
Backup restoration techniques · CPC title
for networked environments · CPC title
using de-duplication of the data · CPC title
Using snapshots, i.e. a logical point-in-time copy of the data · CPC title
Aggregation; Duplicate elimination · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.