Predicting scale of data migration between production and archive storage systems, such as for enterprise customers having large and/or numerous files

US2016342633A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016342633-A1
Application numberUS-201514810546-A
CountryUS
Kind codeA1
Filing dateJul 28, 2015
Priority dateMay 20, 2015
Publication dateNov 24, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This application discloses a system for presenting data to a user based on requests to move data from primary storage to secondary storage. The system receives a request to move data that satisfies at least one criterion, from primary to secondary data storage. The system then identifies an initial database query from the request, and further builds a nested database query from the initial database query according to a database schema. Next, the system estimates a size of a result of executing the nested database query. Finally, the system provides for display to the user the estimated size.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method of predicting an amount of data storage needed when moving data from primary data storage to secondary or archive data storage, the method comprising: receiving, at a client computing device, a request to move data that satisfies at least one criterion, wherein the moving of data is from a primary data storage system to a secondary data storage system, and wherein the at least one criterion is related to the age or the size of data; providing an initial database query based on the request, wherein the initial database query is related to an initial table in the database; building a nested database query from the initial database query according to a database schema, wherein building the nested database query includes determining additional tables linked to the initial table; predicting a size of data that will be returned as a result of executing the nested database query; and providing the predicted size before moving the data from the primary data storage system to the secondary data storage system. 2 . The computer-implemented method of claim 1 , further comprising presenting a user interface to a user showing a list of predetermined requests, wherein the request received is from the user and is chosen from the list. 3 . The computer-implemented method of claim 1 , wherein the building includes traversing the database schemas according to foreign keys, and further comprising determining whether the secondary storage system has sufficient data storage space based on the predicted size. 4 . The computer-implemented method of claim 1 , wherein the building includes computing a count of a number of records that match the nested database query, and wherein the building includes computing an aggregate size from sizes of the records that match the nested database query. 5 . The computer-implemented method of claim 1 , wherein the building includes computing a count of records in a database table referenced in the nested database query and a size of a record in the database table. 6 . The computer-implemented method of claim 1 , further comprising determining whether to satisfy the request based on the predicted size, and wherein the determining is based on a computing load of the primary storage system, a computing load of the secondary storage system, an amount of free space on the primary storage system, an amount of free space on the secondary storage system, or a user need of data affected by the request. 7 . The computer-implemented method of claim 1 , further comprising: storing, by a database gate component, a mapping of data to a plurality of different data storage locations, including the primary storage system storing at least one production database and the secondary storage system storing at least one archive database; receiving a data request from an application component, wherein the application logic component manages or receives data from a user interface; at a database gate component, translating the data request to one or more database queries based on a mapping of data between production and archive databases associated with the primary and secondary storage; causing to execute at least two database queries against data in the primary and secondary storage systems; and returning to the application component a combined result of executing the database queries; receiving at a reporting server a copy of metadata from the primary storage system; receiving, at the reporting server, a request for a report of data on the primary storage system; and producing a data report at the reporting server in response to the request, wherein the reporting server generates the data report using the copy of metadata received from the primary storage system. 8 . At least one non-transitory, computer-readable medium carrying instructions, which when executed by at least one data processor, performs a method of presenting data to a user based on requests to move data from primary storage to secondary storage, comprising: receiving, from a client computing device, a request from a user to move data that satisfies a criterion, from a primary storage system to a secondary storage system; identifying an initial database query from the request; building a nested database query from the initial database query according to a database schema; estimating a size of a result of executing the nested database query; and providing the estimated size to be displayed to the user. 9 . The computer-readable medium of claim 8 , wherein the criterion is related to an age or size of the data. 10 . The computer-readable medium of claim 8 , further comprising providing data for a user interface to show to a user a list of predetermined requests, wherein the request received from the user is chosen from the list. 11 . The computer-readable medium of claim 8 , wherein the building includes traversing the database schemas according to foreign keys. 12 . The computer-readable medium of claim 8 , wherein the building includes computing a count of a number of records that match the nested database query, and wherein the building includes computing an aggregate of sizes of the records that match the nested database query. 13 . The computer-readable medium of claim 8 , wherein the building includes computing a count of records in a database table referenced in the nested database query and a size of a record in the database table. 14 . The computer-readable medium of claim 8 , wherein the method further comprises determining whether to satisfy the request based on the estimated size, and wherein the determining is based on a computing load of the primary storage system, a computing load of the secondary storage system, an amount of free space on the primary storage system, an amount of free space on the secondary storage system, or a user need of data affected by the request. 15 . The computer-readable medium of claim 8 , further comprising: storing, by a database gate component, a mapping of data in production and archive databases respectively on the primary and secondary storage systems; receiving a data request from an application component, wherein the application logic component receives data from a user interface; at a database gate component, translating the data request to one or more database queries based on a mapping of data between production and archive databases; causing to execute at least two database queries against data in the primary and secondary storage systems; and returning to the application component a combined result of executing the database queries; receiving at a reporting server a copy of metadata from the primary storage system; receiving, at the reporting server, a request for a report of data on the primary storage system; and producing a data report at the reporting server in response to the request, wherein the reporting server generates the data report using the copy of metadata received from the primary storage system. 16 . A system configured to present data to a user related to requests to move data from primary storage to secondary storage, the system comprising: at least one processor; at least one memory coupled to the processor and storing instructions for execution by the processor, wherein the instructions encode a method comprising: receiving a request from a user to move data that satisfies a criterion from a primary storage system to a secondary storage system; providing the request to a remote computing device to provide an estimated size of dat

Assignees

Inventors

Classifications

  • Database migration support · CPC title

  • Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP · CPC title

  • G06F16/22Primary

    Indexing; Data structures therefor; Storage structures · CPC title

  • between a Database Management System and a front-end application · CPC title

  • Managing data history or versioning (querying versioned data G06F16/2474; querying temporal data G06F16/2477) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016342633A1 cover?
This application discloses a system for presenting data to a user based on requests to move data from primary storage to secondary storage. The system receives a request to move data that satisfies at least one criterion, from primary to secondary data storage. The system then identifies an initial database query from the request, and further builds a nested database query from the initial data…
Who is the assignee on this patent?
Commvault Systems Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 24 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).