Systems and methods for using metadata to enhance data identification operations

US9606994B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9606994-B2
Application numberUS-201514814173-A
CountryUS
Kind codeB2
Filing dateJul 30, 2015
Priority dateNov 28, 2005
Publication dateMar 28, 2017
Grant dateMar 28, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the index and storing of the metadata can facilitate, for example, enhanced data management operations, enhanced data identification operations, enhanced storage operations, data classification for organizing and storing the metadata, cataloging of metadata for the stored metadata, and/or user interfaces for managing data. In various embodiments, the metabase can be configured in different ways. For example, the metabase can be stored separately from the data objects so as to allow obtaining of information about the data objects without accessing the data objects or a data structure used by a file system.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for identifying data interactions performed within a computer network, the method comprising: identifying data interactions made by at least one application to data objects stored on one or more primary storage devices; obtaining with one or more processors comprising computer hardware, first metadata about the data objects from the one or more primary storage devices; updating with one or more processors comprising computer hardware, a first metabase with the first metadata; receiving a query comprising selection criteria associated with moving data from the one or more primary storage devices to one or more secondary storage devices; identifying with the first metabase, one or more of the data objects which meet the selection criteria without accessing the data objects stored on the primary storage device; copying with one or more processors comprising computer hardware the one or more of the data objects identified by the first metabase to the one or more secondary storage devices; and storing second metadata in a second metabase, the second metadata comprising information about the one or more data objects copied to the one or more secondary storage devices. 2. The method of claim 1 wherein the first and second metabases are stored separately from the data objects. 3. The method of claim 1 further comprising deleting the first metadata from the first metabase associated with the one or more data objects copied to the secondary storage. 4. The method of claim 1 wherein updating the first metabase comprises: determining whether a selected entry in an index of data interactions has an existing record in the first metabase; if no record exists corresponding to the selected entry, creating a new record in the first metabase; and updating the existing record or the new record with at least a part of the information obtained from the selected entry. 5. The method of claim 1 wherein the selection criteria comprises a permission level. 6. The method of claim 1 wherein the first metadata further comprises at least one of: a last modified time, a last accessed time, a data object size and an application type. 7. The method of claim 1 wherein the first metadata is associated with email and information about the data interactions is generated by a journaling program. 8. The method of claim 7 wherein the first metadata comprises at least one of the group consisting of: to/from information for email, creation date, last accessed time, and application type. 9. The method of claim 1 wherein the one or more data objects copied to secondary storage are deleted from the first metabase based on the second metadata in the second metabase. 10. The method of claim 1 wherein the one or more storage devices associated with secondary memory are selected to store the data objects transferred to secondary storage based on the first metadata in the first metabase. 11. A system for identifying data interactions performed within a computer network, the system comprising: a monitor agent comprising at least computer hardware, the monitor agent receives information about data interactions made by at least one other application to data objects stored on one or more primary storage devices, wherein the monitor agent obtains first metadata about the data objects from the one or more primary storage devices; a first metabase, wherein the monitor agent updates the first metabase with the first metadata; and a storage manager comprising at least computer hardware, the storage manager receives selection criteria associated with moving data from the one or more primary storage devices to one or more secondary storage devices, wherein the storage manager identifies with the first metabase, one or more data objects which meet the selection criteria without accessing the data objects stored on the one or more primary storage devices; wherein the storage manager directs the copying of the one or more data objects identified by the first metabase to the one or more secondary storage devices; wherein the storage manager directs storage of second metadata in a second metabase, the second metadata comprising information about the one or more data objects copied to the one or more secondary storage devices. 12. The system of claim 11 wherein the first and second metabases are stored separately from the data objects. 13. The system of claim 11 wherein the storage manager directs deletion of the first metadata from the first metabase associated with the one or more data objects copied to the secondary storage. 14. The system of claim 11 wherein the monitor agents updates the first metadata by: determining whether a selected entry in an index of data interactions has an existing record in the first metabase; if no record exists corresponding to the selected entry, creating a new record in the first metabase; and updating the existing record or the new record with at least a part of the information obtained from the selected entry. 15. The system of claim 11 wherein the selection criteria comprises a permission level. 16. The system of claim 11 wherein the first metadata further comprises at least one of: a last modified time, a last accessed time, a data object size and an application type. 17. The system of claim 11 wherein the first metadata is associated with email and the information about the data interactions is generated by a journaling program. 18. The system of claim 17 wherein the first metadata comprises at least one of the group consisting of: to/from information for email, creation date, last accessed time, and application type. 19. The system of claim 11 wherein the one or more data objects copied to secondary storage are deleted from the first metabase based on the second metadata in the second metabase. 20. The system of claim 11 wherein the one or more storage devices associated with secondary memory are selected to store the data objects transferred to secondary storage based on the first metadata in the first metabase.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9606994B2 cover?
Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the inde…
Who is the assignee on this patent?
Commvault Systems Inc
What technology area does this patent fall under?
Primary CPC classification G06F3/0605. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).