Archive systems and methods

US9690789B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9690789-B2
Application numberUS-201113316370-A
CountryUS
Kind codeB2
Filing dateDec 9, 2011
Priority dateDec 9, 2011
Publication dateJun 27, 2017
Grant dateJun 27, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Archive systems and methods are presented. In one embodiment, an archival information storage configuration method comprises: performing an information accessing process including determining if the information is associated with an archive process; and performing an archive storage boundary determination process including establishing archive storage boundaries based upon characteristics indicating potential sharing of the information and potential impacts on performance of archival storage operations. In one exemplary implementation, the archive storage boundary determination process comprises: performing an information mining process including identifying an indication the information is potentially shared; and performing an archival boundary selection process including selecting an archive storage boundary based in at least part upon results of the information mining process.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: identifying information that is associated with an archive process; evaluating the information on at least one of a content source level or an archived repository source level to determine that the information is shared by at least one group of users or applications; determining an archive storage boundary for archive duplication checking, the determining being based at least in part upon the determination that the information is shared by the at least one group of users or applications, wherein a limit is placed on a number of users or applications allowed in the at least one group such that the archive process is not excessively delayed for searching duplicative information shared by the at least one group of users or applications; and determining, after determining the archive storage boundary, that the information is duplicate information based on the archive duplication checking using the archive storage boundary. 2. The method of claim 1 , wherein determining the archive storage boundary is further based at least in part upon a comparison of a plurality of potential archive storage boundaries, wherein the archive storage boundary is selected from the plurality of potential archive storage boundaries. 3. The method of claim 1 , wherein determining the archive storage boundary comprises: selecting a preliminary first archive storage boundary; performing a sample duplication check on information within the preliminary first archive storage boundary; comparing a result of performing the sample duplication check on information within the preliminary first archive storage boundary to a result of a sample duplication check on information within a second archive storage boundary; and selecting the preliminary first archive storage boundary or the second archive storage boundary based upon a result of the comparing. 4. The method of claim 1 wherein, the determining the archive storage boundary comprises pattern recognition. 5. The method of claim 4 , wherein the pattern recognition comprises using data access patterns. 6. The method of claim 4 , wherein the pattern recognition is performed on a content source level. 7. The method of claim 4 , wherein the pattern recognition is performed on an archived repository source level. 8. A non-transitory tangible computer readable medium having stored thereon, computer executable instructions that when executed by a processor cause the processor to perform operations comprising: identifying, by the processor, information that is associated with an archive process; evaluating the information on at least one of a content source level or an archived repository source level to determine that the information is shared by at least one group of users or applications; determining an archive storage boundary for archive duplication checking, the determining being based at least in part upon the determination that the information is shared by the at least one group of users or applications, wherein a limit is placed on a number of users or applications allowed in the at least one group such that the archive process is not excessively delayed for searching duplicative information shared by the at least one group of users or applications; and determining, after determining the archive storage boundary, that the information is duplicate information based on the archive duplication checking using the archive storage boundary. 9. The non-transitory tangible computer readable medium of claim 8 , wherein determining the archive storage boundary is further based at least in part upon a comparison of a plurality of potential archive storage boundaries, wherein the archive storage boundary is selected from the plurality of potential archive storage boundaries. 10. The non-transitory tangible computer readable medium of claim 8 , wherein determining the archive storage boundary comprises: selecting a preliminary first archive storage boundary; performing a sample duplication check on information within the preliminary first archive storage boundary; comparing a result of performing the sample duplication check on information within the preliminary first archive storage boundary to result of a sample duplication check on information within a second archive storage boundary; and selecting the preliminary first archive storage boundary or the second archive storage boundary based upon a result of the comparing. 11. The non-transitory tangible computer readable medium of claim 8 , wherein determining the archive storage boundary comprises pattern recognition. 12. The non-transitory tangible computer readable medium of claim 11 , wherein the pattern recognition comprises using data access patterns. 13. The non-transitory tangible computer readable medium of claim 11 , wherein the pattern recognition is performed on a content source level. 14. The non-transitory tangible computer readable medium of claim 11 , wherein the pattern recognition is performed on an archived repository source level. 15. A computer system comprising: a processor coupled to a non-transitory computer readable storage media and executing computer readable code which causes the processor to perform operations comprising: identifying information that is associated with an archive process; evaluating the information on at least one of a content source level or an archived repository source level to determine that the information is shared by at least one group of users or applications; determining an archive storage boundary for archive duplication checking, the determining being based at least in part upon the determination that the information is shared by the at least one group of users or applications, wherein a limit is placed on a number of users or applications allowed in the at least one group such that the archive process is not excessively delayed for searching duplicative information shared by the at least one group of users or applications; and determining, after determining the archive storage boundary, that the information is duplicate information based on the archive duplication checking using the archive storage boundary. 16. The computer system of claim 15 , wherein determining the archive storage boundary is further based at least in part upon a comparison of a plurality of potential archive storage boundaries, wherein the archive storage boundary is selected from the plurality of potential archive storage boundaries. 17. The computer system of claim 15 , wherein determining the archive storage boundary comprises: selecting a preliminary first archive storage boundary; performing a sample duplication check on information within the preliminary first archive storage boundary; comparing a result of performing the sample duplication check on information within the preliminary first archive storage boundary to a result of a sample duplication check on information within a second archive storage boundary; and selecting the preliminary first archive storage boundary or the second archive boundary based upon a result of the comparing. 18. The computer system of claim 15 , wherein determining the archive storage boundary comprises pattern recognition. 19. The computer system of claim 18 , wherein the pattern recognition comprises using data access patterns. 20. The computer system of claim 18 , wherein the pattern recognition is performed on an archived repository source level.

Assignees

Inventors

Classifications

  • G06F16/113Primary

    Details of archiving (lifecycle management in storage systems G06F3/0649; point-in-time backing up or restoration of persistent data G06F11/1446) · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9690789B2 cover?
Archive systems and methods are presented. In one embodiment, an archival information storage configuration method comprises: performing an information accessing process including determining if the information is associated with an archive process; and performing an archive storage boundary determination process including establishing archive storage boundaries based upon characteristics indic…
Who is the assignee on this patent?
Dwivedi Alok, Veritas Technologies Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/113. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 27 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).