Methods and systems for data backup based on data classification

US11030054B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11030054-B2
Application numberUS-201916257498-A
CountryUS
Kind codeB2
Filing dateJan 25, 2019
Priority dateJan 25, 2019
Publication dateJun 8, 2021
Grant dateJun 8, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for intelligent backup of data are disclosed. The methods include maintaining a plurality of data storage systems in communication with an external metadata management system, operating the metadata management system to store metadata corresponding to data residing on the plurality of data storage systems, identifying a candidate data set residing on at least one of the plurality of data storage systems on which at least one backup action should be performed based on information included in the metadata management system, and identifying the at least one backup action in response to identifying the candidate data set.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-executed method comprising: maintaining a plurality of data storage systems for storing electronic data containing one or more processors having circuitry and logic that perform calculations and logic operations in communication with an external metadata management system containing one or more processors having circuitry and logic that perform calculations and logic operations; operating the metadata management system to store metadata corresponding to the electronic data stored on the plurality of data storage systems; identifying, using information included in the metadata management system, a candidate electronic data set stored on at least one of the plurality of data storage systems on which at least one of a plurality of backup actions should be performed; in response to identifying the candidate electronic data set, identifying the at least one of the plurality of backup actions, wherein identifying the at least one of a plurality of backup actions comprises: identifying one or more custom tags for metadata corresponding to the candidate electronic data set; and using the one or more custom tags to identify the at least one of a plurality of backup actions; and executing the at least one of a plurality of backup actions on the candidate electronic data set stored on the plurality of data storage systems, wherein each of the plurality of backup actions comprises storing the candidate electronic data set in secondary storage different than the plurality of data storage systems and in a form different than the format stored on the plurality of data storage systems. 2. The method of claim 1 , wherein the at least one of a plurality of backup actions comprises at least one of the following: a full backup, a partial backup, or a snapshot operation. 3. The method of claim 1 , wherein identifying the at least one of a plurality of backup actions comprises: extracting one or more facets of the candidate electronic data set stored with the metadata in the metadata management system; and using the one or more facets to identify the at least one backup action. 4. The method of claim 3 , wherein the extracted one or more facets are identified by performing data analytics on the candidate electronic data set. 5. The method of claim 3 , wherein the extracted one or more facets are identified by performing data analytics on at least one component of metadata corresponding to the candidate electronic data set. 6. The method of claim 1 , wherein identifying the candidate electronic data set residing on at least one of the plurality of data storage systems on which at least one of a plurality of backup actions should be performed comprises receiving a query from a user that includes one or more rules for selecting the candidate electronic data set using metadata stored in the metadata management system. 7. The method of claim 1 , wherein identifying the candidate electronic data set residing on at least one of the plurality of data storage systems on which at least one of a plurality of backup actions should be performed comprises identifying the candidate electronic data set based on metadata received in response to a data operation event performed on the candidate electronic data set. 8. The method of claim 1 , further comprising: identifying a backup level associated with the candidate electronic data set based on at least one of the group consisting of: one or more facets extracted from the candidate electronic data set and stored with the metadata in the metadata management system, one or more facets extracted from metadata associated with the candidate electronic data set and stored with the metadata in the metadata management system, one or more custom tags corresponding to metadata associated with the candidate electronic data set, and combinations thereof; and using the backup level to identify the at least one of a plurality of backup actions. 9. A non-transitory computer readable medium comprising programming instructions executable on a processor that when executed cause the processor to: maintain a plurality of data storage systems for storing electronic data in communication with an external metadata management system; operate the metadata management system to store metadata corresponding to the electronic data stored on the plurality of data storage systems; identify, using information included in the metadata management system, a candidate electronic data set residing on at least one of the plurality of data storage systems on which at least one of a plurality of backup actions should be performed, wherein causing the processor to identify the at least one of a plurality of backup actions comprises causing the processor to: identify one or more custom tags for metadata corresponding to the candidate electronic data set; and use the one or more custom tags to identify the at least one of a plurality of backup actions; in response to identifying the candidate electronic data set, identify the at least one of a plurality of backup actions; and execute the at least one of a plurality of backup actions on the candidate electronic data set stored on the plurality of data storage systems, wherein each of the plurality of backup actions comprises storing the candidate electronic data set in secondary storage different than the plurality of data storage systems and in a form different than the format stored on the plurality of data storage systems. 10. The non-transitory computer readable medium of claim 9 , wherein the at least one backup action comprises at least one of the following: a full backup, a partial backup, a copy operation, or a snapshot operation. 11. The non-transitory computer readable medium of claim 9 , wherein causing the processor to identify the at least one of a plurality of backup actions comprises causing the processor to: extract one or more facets of the candidate electronic data set stored with the metadata in the metadata management system; and use the one or more facets to identify the at least one of a plurality of backup actions. 12. The non-transitory computer readable medium of claim 11 , wherein the extracted one or more facets are identified by performing data analytics on the candidate electronic data set. 13. The non-transitory computer readable medium of claim 11 , wherein the extracted one or more facets are identified by performing data analytics on at least one component of metadata corresponding to the candidate electronic data set. 14. The non-transitory computer readable medium of claim 11 , further comprising programming instructions that when executed cause the processor to: identify a backup level associated with the candidate electronic data set based on at least one of the group consisting of: one or more facets extracted from the candidate electronic data set and stored with the metadata in the metadata management system, one or more facets extracted from metadata associated with the candidate electronic data set and stored with the metadata in the metadata management system, one or more custom tags corresponding to metadata associated with the candidate electronic data set, and combinations thereof; and use the backup level to identify the at least one of a plurality of backup actions. 15. The non-transitory computer readable medium of claim 11 , further comprising programming instructions that when executed cause the processor to: receive a real-time alert comprising a threat to the at least one of a plurality of backup actions; and identify at least one remedial action for countering the threat.

Assignees

Inventors

Classifications

  • by selection of backup contents · CPC title

  • where the computing system component is a storage system, e.g. DASD based or network based (digital input from or digital output to record carriers G06F3/06; digital recording or reproducing G11B20/18; for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS], H04L67/1097) · CPC title

  • Monitoring arrangements determined by the means or processing involved in reporting the monitored data (error or fault reporting or logging G06F11/0766) · CPC title

  • Event-based monitoring · CPC title

  • Details of searching files based on file metadata · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11030054B2 cover?
Systems and methods for intelligent backup of data are disclosed. The methods include maintaining a plurality of data storage systems in communication with an external metadata management system, operating the metadata management system to store metadata corresponding to data residing on the plurality of data storage systems, identifying a candidate data set residing on at least one of the plur…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/1451. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 08 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).