Data storage space management
US-9830111-B1 · Nov 28, 2017 · US
US11928030B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11928030-B2 |
| Application number | US-202016835657-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 31, 2020 |
| Priority date | Mar 31, 2020 |
| Publication date | Mar 12, 2024 |
| Grant date | Mar 12, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method includes creating a deduplicated universal share (US) of data objects, which in turn includes receiving a US of the data objects, deduplicating the US, wherein deduplicating the US includes: hashing segments of the US to generate respective US segment fingerprints; comparing US segment fingerprints to fingerprints for respective segments held in deduplication storage in order to identify segments in the deduplication storage that equate to the US segments, respectively, of the US; storing identifiers that directly or indirectly identify locations, respectively, of the segments, respectively, in the deduplication storage that equate to the US segments, respectively, of the US. After creating the deduplicated universal share, a deduplicated backup of the US is created without reassembling the US from segments held in the deduplication storage, the creating the deduplicated backup including: creating a list that comprises copies of the stored identifiers, and storing the list.
Opening claim text (preview).
What is claimed is: 1. A method comprising: creating a deduplicated copy of data objects, the creating comprising: receiving a copy of the data objects; deduplicating the copy, wherein deduplicating the copy comprises: hashing segments of the copy to generate respective copy segment fingerprints; comparing the copy segment fingerprints to fingerprints for respective segments held in deduplication storage in order to identify segments in the deduplication storage that equate to segments, respectively, of the copy; creating and storing a first list of identifiers that directly or indirectly identify locations, respectively, of the segments, respectively, in the deduplication storage that equate to segments, respectively, of the copy; after creating the deduplicated copy, creating a deduplicated backup copy of the data objects without reassembling the copy from segments held in the deduplication storage, the creating the deduplicated backup copy comprising: creating and storing a second list that comprises a plurality of entries, each entry corresponding to a respective data object of the data objects, wherein each entry comprises a respective plurality of identifiers copied from the first list; creating and storing a catalog that maps each entry in the second list to its respective data object of the data objects. 2. The method of claim 1 wherein the data objects comprise files of a filesystem. 3. The method of claim 2 further comprising an act of reassembling the copy using segments held in the deduplication storage, the second list, and the catalog. 4. The method of claim 3 further comprising an act of replicating the reassembled copy to a storage device. 5. The method of claim 1 wherein the copy of the data objects is received at a storage device via standard interfaces thereto via a customized script executing on a production server. 6. The method of claim 1 wherein the identifiers are stored in the first list in a format that is different from a format of the second list of identifiers. 7. A non-transitory computer-readable media comprising executable instructions, wherein a method is implement in response to executing the instructions, the method comprising: creating a deduplicated copy of data objects, the creating comprising: receiving a copy of the data objects; deduplicating the copy, wherein deduplicating the copy comprises: hashing segments of the copy to generate respective copy segment fingerprints; comparing the copy segment fingerprints to fingerprints for respective segments held in deduplication storage in order to identify segments in the deduplication storage that equate to segments, respectively, of the copy; creating and storing a first list of identifiers that directly or indirectly identify locations, respectively, of the segments, respectively, in the deduplication storage that equate to segments, respectively, of the copy; after creating the deduplicated copy, creating a deduplicated backup copy of the data objects without reassembling the copy from segments held in the deduplication storage, the creating the deduplicated backup copy comprising: creating and storing a second list that comprises a plurality of entries, each entry corresponding to a respective data object of the data objects, wherein each entry comprises a respective plurality of identifiers copied from the first list; creating and storing a catalog that maps each entry in the second list to its respective data object of the data objects. 8. The non-transitory computer-readable media of claim 7 wherein the data objects comprise files of a filesystem. 9. The non-transitory computer-readable media of claim 8 wherein the method further comprises an act of reassembling the copy using segments held in the deduplication storage, the second list, and the catalog. 10. The non-transitory computer-readable media of claim 9 wherein the method further comprises an act of replicating the reassembled copy to a storage device. 11. The non-transitory computer-readable media of claim 7 wherein the copy of the data objects is received at a storage device via standard interfaces thereto via a customized script executing on a production server. 12. The non-transitory computer-readable media of claim 7 wherein the identifiers are stored in the first list in a format that is different from a format of the second list of identifiers. 13. A computer system comprising: one or more processors; a computer-readable storage medium coupled to the one or more processors; and a plurality of instructions, encoded in the computer-readable storage medium, wherein a method is implemented in response to executing the instructions, the method comprising: creating a deduplicated copy of data objects, the creating comprising: receiving a copy of the data objects; deduplicating the copy, wherein deduplicating the copy comprises: hashing segments of the copy to generate respective copy segment fingerprints; comparing the copy segment fingerprints to fingerprints for respective segments held in deduplication storage in order to identify segments in the deduplication storage that equate to segments, respectively, of the copy; creating and storing a first list of identifiers that directly or indirectly identify locations, respectively, of the segments, respectively, in the deduplication storage that equate to segments, respectively, of the copy; after creating the deduplicated copy, creating a deduplicated backup copy of the data objects without reassembling the copy from segments held in the deduplication storage, the creating the deduplicated backup copy comprising: creating and storing a second list that comprises a plurality of entries, each entry corresponding to a respective data object of the data objects, wherein each entry comprises a respective plurality of identifiers copied from the first list; creating and storing a catalog that maps each entry in the second list to its respective data object of the data objects. 14. The computer system of claim 13 wherein the method further comprises an act of reassembling the copy using segments held in the deduplication storage, the second list, and the catalog. 15. The computer system of claim 13 wherein the method further comprises an act of replicating the reassembled copy to a storage device.
using de-duplication of the data · CPC title
by selection of backup contents · CPC title
Backup scheduling policy · CPC title
for networked environments · CPC title
Backup restoration techniques · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.