De-duplication with partitioning advice and automation
US-9213715-B2 · Dec 15, 2015 · US
US9256612B1 · US · B1
| Field | Value |
|---|---|
| Publication number | US-9256612-B1 |
| Application number | US-201313915072-A |
| Country | US |
| Kind code | B1 |
| Filing date | Jun 11, 2013 |
| Priority date | Jun 11, 2013 |
| Publication date | Feb 9, 2016 |
| Grant date | Feb 9, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A computer-implemented method for managing references in deduplicating data systems may include (1) identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment, (2) identifying an additional data object to be stored by the deduplicating data system that includes the data segment, (3) determining whether a reference limit associated with the first instance of the data segment has been reached, and (4) storing, based at least in part on determining that the reference limit associated with the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system. Various other methods, systems, and computer-readable media are also disclosed.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for managing references in deduplicating data systems, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising: identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; identifying an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; determining that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment: limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system. 2. The computer-implemented method of claim 1 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which data segments stored within the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 3. The computer-implemented method of claim 1 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 4. The computer-implemented method of claim 1 , wherein preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a container within the deduplicating data system that is separate and distinct from the container within the deduplicating data system within which the first instance of the data segment is stored. 5. The computer-implemented method of claim 1 , further comprising: determining whether any data objects within the deduplicating data system reference the first instance of the data segment; deleting the first instance of the data segment based at least in part on determining that no data objects within the deduplicating data system reference the first instance of the data segment. 6. The computer-implemented method of claim 1 , further comprising: detecting that the first instance of the data segment has become corrupt; using the second instance of the data segment to repair the first instance of the data segment. 7. The computer-implemented method of claim 1 , further comprising: detecting that the second instance of the data segment has become corrupt; using the first instance of the data segment to repair the second instance of the data segment. 8. A system for managing references in deduplicating data systems, the system comprising: an identifying module that identifies: a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; a determining module that determines that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment: limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; a storing module that prevents storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system; at least one processor that executes the identifying module, the determining module, and the storing module. 9. The system of claim 8 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which data segments stored within the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; the storing module stores the second instance of the data segment within the deduplicating data system by storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 10. The system of claim 8 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; the storing module stores the second instance of the data segment within the deduplicating data system
Physics · mapped topic
De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title
Details of free space management performed by the file system (saving storage space on storage systems G06F3/0608; management of blocks in storage devices G06F3/064) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.