Method for lineage sampling to efficiently detect corruptions

US2024403286A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024403286-A1
Application numberUS-202418800404-A
CountryUS
Kind codeA1
Filing dateAug 12, 2024
Priority dateSep 30, 2022
Publication dateDec 5, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Corruption detection in backups is disclosed. Backups that are received into a backup environment are stored in corresponding lineages. A detection engine is configured to perform corruption detection operations on the most recent backups in each of the lineages based on a sample frequency. Corruption detection operations may also be performed randomly and based on unexpected or unusual changes in backup metadata.

First claim

Opening claim text (preview).

1 . A method for performing a corruption detection operation that includes read operations in a data protection system, comprising: receiving backups into the data protection system configured to store the backups, wherein the backups are associated with corresponding lineages, wherein each of the lineages is associated with a sample frequency that is independent of a backup cadence; selecting lineages subject to the corruption detection operation based on the sample frequencies of the lineages and based on a read capacity associated with the data protection system; and performing the corruption detection operation on a most recent backup in each of the selected lineages according to the associated sample frequency. 2 . The method of claim 1 , further comprising pruning at least one of the selected linages when the read capacity is above a threshold. 3 . The method of claim 2 , wherein pruning includes skipping a lineage with a comparatively lower sample frequency or skipping a lineage stored in a lower tier of storage. 4 . The method of claim 1 , wherein the backups comprise synthetic full backups, incremental backups, and/or always full backups and wherein the corruption detection operation is not performed on some of the backups in some of the lineages. 5 . The method of claim 1 , wherein the corruption detection operation includes generating statistics, inferences, and/or probabilities related to malware or other unwanted content. 6 . The method of claim 1 , wherein the sample frequency is included in metadata associated with the backups. 7 . The method of claim 1 , further comprising selecting at least one of the lineages randomly rather than the associated sample frequency. 8 . The method of claim 1 , further comprising triggering dynamic scans based on metadata changes. 9 . The method of claim 8 , wherein metadata changes that trigger dynamic scans include a file size change greater than a threshold amount or an unexpected input/output (IO) pattern. 10 . The method of claim 1 , wherein the backup environment comprises a physical or virtual appliance that is accessed via an air gap. 11 . The method of claim 1 , further comprising recovering from a copy that is clean. 12 . A non-transitory storage medium having stored therein instructions that are executable by one or more hardware processors to perform operations for performing a corruption detection operation that includes read operations in a data protection system, the operations comprising: receiving backups into the data protection system configured to store the backups, wherein the backups are associated with corresponding lineages, wherein each of the lineages is associated with a sample frequency that is independent of a backup cadence; selecting lineages subject to the corruption detection operation based on the sample frequencies of the lineages and based on a read capacity associated with the data protection system; and performing the corruption detection operation on a most recent backup in each of the selected lineages according to the associated sample frequency. 13 . The method of claim 1 , further comprising pruning at least one of the selected linages when the read capacity is above a threshold. 14 . The method of claim 2 , wherein pruning includes skipping a lineage with a comparatively lower sample frequency or skipping a lineage stored in a lower tier of storage. 15 . The method of claim 1 , wherein the backups comprise synthetic full backups, incremental backups, and/or always full backups and wherein the corruption detection operation is not performed on some of the backups in some of the lineages. 16 . The method of claim 1 , wherein the corruption detection operation includes generating statistics, inferences, and/or probabilities related to malware or other unwanted content. 17 . The method of claim 1 , wherein the sample frequency is included in metadata associated with the backups. 18 . The method of claim 1 , further comprising selecting at least one of the lineages randomly rather than the associated sample frequency. 19 . The method of claim 1 , further comprising triggering dynamic scans based on metadata changes, wherein metadata changes that trigger dynamic scans include a file size change greater than a threshold amount or an unexpected input/output (IO) pattern. 20 . The method of claim 1 , wherein the backup environment comprises a physical or virtual appliance that is accessed via an air gap, further comprising recovering from a copy that is clean.

Assignees

Inventors

Classifications

  • by selection of backup contents · CPC title

  • Using snapshots, i.e. a logical point-in-time copy of the data · CPC title

  • Ensuring data consistency and integrity · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024403286A1 cover?
Corruption detection in backups is disclosed. Backups that are received into a backup environment are stored in corresponding lineages. A detection engine is configured to perform corruption detection operations on the most recent backups in each of the lineages based on a sample frequency. Corruption detection operations may also be performed randomly and based on unexpected or unusual changes…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F11/1451. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 05 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).