Systems and methods for managing references in deduplicating data systems

US9256612B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9256612-B1
Application numberUS-201313915072-A
CountryUS
Kind codeB1
Filing dateJun 11, 2013
Priority dateJun 11, 2013
Publication dateFeb 9, 2016
Grant dateFeb 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method for managing references in deduplicating data systems may include (1) identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment, (2) identifying an additional data object to be stored by the deduplicating data system that includes the data segment, (3) determining whether a reference limit associated with the first instance of the data segment has been reached, and (4) storing, based at least in part on determining that the reference limit associated with the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system. Various other methods, systems, and computer-readable media are also disclosed.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for managing references in deduplicating data systems, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising: identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; identifying an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; determining that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment: limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system. 2. The computer-implemented method of claim 1 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which data segments stored within the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 3. The computer-implemented method of claim 1 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 4. The computer-implemented method of claim 1 , wherein preventing storage-space fragmentation within the deduplicating data system by storing the second instance of the data segment within the deduplicating data system comprises storing the second instance of the data segment within a container within the deduplicating data system that is separate and distinct from the container within the deduplicating data system within which the first instance of the data segment is stored. 5. The computer-implemented method of claim 1 , further comprising: determining whether any data objects within the deduplicating data system reference the first instance of the data segment; deleting the first instance of the data segment based at least in part on determining that no data objects within the deduplicating data system reference the first instance of the data segment. 6. The computer-implemented method of claim 1 , further comprising: detecting that the first instance of the data segment has become corrupt; using the second instance of the data segment to repair the first instance of the data segment. 7. The computer-implemented method of claim 1 , further comprising: detecting that the second instance of the data segment has become corrupt; using the first instance of the data segment to repair the second instance of the data segment. 8. A system for managing references in deduplicating data systems, the system comprising: an identifying module that identifies: a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; a determining module that determines that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment: limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; a storing module that prevents storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system; at least one processor that executes the identifying module, the determining module, and the storing module. 9. The system of claim 8 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which data segments stored within the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; the storing module stores the second instance of the data segment within the deduplicating data system by storing the second instance of the data segment within a second container within the deduplicating data system that is separate and distinct from the first container. 10. The system of claim 8 , wherein: the first instance of the data segment is stored within a first container that is stored within the deduplicating data system; the first container also stores a plurality of additional data segments; the age limit of the first instance of the data segment comprises an age limit of the first container that indicates an age of the first container after which the first container cannot be referenced by any additional data objects that are later stored to the deduplicating data system; the storing module stores the second instance of the data segment within the deduplicating data system

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title

  • Details of free space management performed by the file system (saving storage space on storage systems G06F3/0608; management of blocks in storage devices G06F3/064) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9256612B1 cover?
A computer-implemented method for managing references in deduplicating data systems may include (1) identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment, (2) identifying an additional data object to …
Who is the assignee on this patent?
Symantec Corp
What technology area does this patent fall under?
Primary CPC classification G06F17/30156. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).