System and method for automating formation and execution of a backup strategy

US12174706B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12174706-B2
Application numberUS-202117488489-A
CountryUS
Kind codeB2
Filing dateSep 29, 2021
Priority dateNov 22, 2017
Publication dateDec 24, 2024
Grant dateDec 24, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems and method for forming and executing a backup strategy. In one aspect, an exemplary method comprises forming a respective backup strategy for each respective file of a plurality of files stored in a data source based on a frequency of occurrence, a desired recovery time, and a criticality of data loss for the respective file. The method further comprises executing the respective backup strategy for the respective file.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for forming and executing a backup strategy comprising: forming a respective backup strategy for each respective file of a plurality of files stored in a data source, by: determining a frequency of occurrence for the respective file based on (1) metadata regarding a location of the respective file, (2) whether an authorship of the respective file matches an owner of the data source, and (3) by comparing at least a portion of the respective file to a frequency database indicating commonality of various files; adding a secure backup strategy to the respective backup strategy for the respective file, when the frequency of occurrence is greater than a frequency of occurrence threshold; adding a distributed backup to the respective backup strategy, when a criticality of data loss for the respective file is higher than a predetermined criticality threshold; adding an instant copying to the respective backup strategy, when the respective file is determined as having a desired recovery time less than a predetermined recovery time threshold; copying the respective file to a distributed backup storage device, when the instant copying is added to the backup strategy; and executing each respective backup strategy for the plurality of files. 2. The method of claim 1 , further comprising: generating the frequency database by inspecting files in a second data source; hashing the files in the second data source; associating the hashes of the files with commonality and availability values in the second data source, wherein comparing at least the portion of the respective file to the frequency database comprises comparing a hash of the respective file to the hashes of the files. 3. The method of claim 2 , wherein the second data source is the Internet. 4. The method of claim 1 , further comprising: adding a local backup to the respective backup strategy, when the criticality of data loss is lower than the predetermined criticality threshold. 5. The method of claim 1 , further comprising: categorizing the respective file into a hierarchy of logical types according to properties of the respective file by: analyzing metadata associated with the respective file; analyzing contents of the respective file; and classifying the respective file based on the metadata associated with the respective file and the contents of the respective file; and wherein forming the respective backup strategy for the respective file is further according to the categorization of the respective file. 6. The method of claim 1 , further comprising: evaluating a confidentiality of the respective file prior to forming the respective backup strategy; determining a degree of confidentiality, when the respective file is confidential; selecting a strength of one or more encryption algorithms according to the determined degree of confidentiality; and encrypting the respective file with the selected encryption algorithm. 7. The method of claim 1 , the respective backup strategy further being formed according to one or more of: an importance, a recovery time, and a recovery point objective of the respective file. 8. The method of claim 1 , wherein the frequency of occurrence is determined based on a deep learning analysis of one of: a local data set and an external data set, wherein the local data set comprises an archive and the external data set comprises the Internet. 9. The method of claim 8 , wherein the determination of the frequency of occurrence based on the deep learning analysis comprises one or more of: considering the authorship of the respective file, considering the metadata regarding the location of the respective file in combination with the authorship of the respective file, and considering an availability of the respective file in the local and external data sets. 10. A system for forming and executing a backup strategy, comprising: a hardware processor configured to: form a respective backup strategy for each respective file of a plurality of files stored in a data source, by: determining a frequency of occurrence for the respective file based on (1) metadata regarding a location of the respective file, (2) whether an authorship of the respective file matches an owner of the data source, and (3) by comparing at least a portion of the respective file to a frequency database indicating commonality of various files; adding a secure backup strategy to the respective backup strategy for the respective file, when the frequency of occurrence is greater than a frequency of occurrence threshold; adding a distributed backup to the respective backup strategy, when a criticality of data loss for the respective file is higher than a predetermined criticality threshold; adding an instant copying to the respective backup strategy, when the respective file is determined as having a desired recovery time less than a predetermined recovery time threshold; copying the respective file to a distributed backup storage device, when the instant copying is added to the backup strategy; and execute each respective backup strategy for the plurality of files. 11. The system of claim 10 , wherein the hardware processor is further configured to: generate the frequency database by inspecting files in a second data source; hash the files in the second data source; associate the hashes of the files with commonality and availability values in the second data source, wherein the hardware processor is further configured to compare at least the portion of the respective file to the frequency database by comparing a hash of the respective file to the hashes of the files. 12. The system of claim 11 , wherein the second data source is the Internet. 13. The system of claim 10 , wherein the hardware processor is further configured to: add a local backup to the respective backup strategy, when the criticality of data loss is lower than the predetermined criticality threshold. 14. The system of claim 10 , wherein the hardware processor is further configured to: categorize the respective file into a hierarchy of logical types according to properties of the respective file by: analyzing metadata associated with the respective file; analyzing contents of the respective file; and classifying the respective file based on the metadata associated with the respective file and the contents of the respective file; and wherein forming the respective backup strategy for the respective file is further according to the categorization of the respective file. 15. The system of claim 10 , wherein the hardware processor is further configured to: evaluate a confidentiality of the respective file prior to forming the respective backup strategy; determine a degree of confidentiality, when the respective file is confidential; select a strength of one or more encryption algorithms according to the determined degree of confidentiality; and encrypt the respective file with the selected encryption algorithm. 16. The system of claim 10 , the respective backup strategy further being formed according to one or more of: an importance, a recovery time, and a recovery point objective of the respective file. 17. The system of claim 10 , wherein the frequency of occurrence is determined based on a deep learning analysis of one of: a local data set and an external data set, wherein the local data set comprises an archive and the external data set comprises the Internet. 18. The system of claim 17 , wherein the determination of the frequency of occurrence based on the deep

Assignees

Inventors

Classifications

  • Backup scheduling policy · CPC title

  • Improving or facilitating administration, e.g. storage management · CPC title

  • using management policies (point-in-time backing up or restoration of persistent data G06F11/1446; file migration policies for HSM systems G06F16/185) · CPC title

  • Replication mechanisms · CPC title

  • Management of files · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12174706B2 cover?
Disclosed herein are systems and method for forming and executing a backup strategy. In one aspect, an exemplary method comprises forming a respective backup strategy for each respective file of a plurality of files stored in a data source based on a frequency of occurrence, a desired recovery time, and a criticality of data loss for the respective file. The method further comprises executing t…
Who is the assignee on this patent?
Acronis Int Gmbh
What technology area does this patent fall under?
Primary CPC classification G06F11/1451. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 24 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).