Shared scan output in incremental data analysis systems

US2016259809A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016259809-A1
Application numberUS-201514636221-A
CountryUS
Kind codeA1
Filing dateMar 3, 2015
Priority dateMar 3, 2015
Publication dateSep 8, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Solutions are provided that use shared scan phases and scan output for various file-level incremental data analysis systems. In one embodiment, a shared scan phase is initiated for a plurality of files in a file system. During the shared scan phase, one or more rules are applied to the files in the file system to identify files on which to perform one or more operations. Shared scan output is created that includes information describing the identified files and operations to be performed on the identified files. Embodiments of the present invention can reduce the amount of time and computing resources that would otherwise be consumed by performing separate walkthroughs of a file system during separate scan phases.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for creating shared scan output for file-level incremental data analysis processes, the method comprising: initiating, by one or more computer processors, a shared scan phase for a plurality of files in a file system; during the shared scan phase, applying, by one or more computer processors, one or more rules to each of the plurality of files in the file system to identify files on which to perform one or more operations; and creating, by one or more computer processors, shared scan output that includes information describing the identified files and the one or more operations to be performed on the identified files. 2 . The method of claim 1 , further comprising: creating, by one or more computer processors, one or more fault lists including failed files on which operations have not been successfully performed by one or more file-level incremental data analysis processes; and adding, by one or more computer processors, the failed files from the one or more fault lists to the shared scan output. 3 . The method of claim 2 , wherein the failed files from the one or more fault lists are added to the shared scan output, such that one or more file-level incremental data analysis processes will perform one or more operations on the failed files prior to performing one or more operations on remaining files included in the shared scan output. 4 . The method of claim 1 , wherein the shared scan output comprises file location information for each of the identified files, along with flags indicating operations that should be performed on each of the identified files by one or more file-level incremental data analysis processes. 5 . The method of claim 1 , wherein the one or more operations comprise one or more of: a backup operation, a replication operation, a delete operation, and a modification operation. 6 . The method of claim 5 , further comprising: copying, by one or more computer processors, one or more of the identified files to a backup storage pool; and copying, by one or more computer processors, one or more of the identified files to a replication storage pool. 7 . The method of claim 5 , further comprising: deleting, by one or more computer processors, one or more of the identified files from the file system. 8 . A computer program product for creating shared scan output for file-level incremental data analysis processes, the computer program product comprising: one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising: program instructions to initiate a shared scan phase for a plurality of files in a file system; program instructions to, during the shared scan phase, apply one or more rules to each of the plurality of files in the file system to identify files on which to perform one or more operations; and program instructions to create shared scan output that includes information describing the identified files and the one or more operations to be performed on the identified files. 9 . The computer program product of claim 8 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to create one or more fault lists including failed files on which operations have not been successfully performed by one or more file-level incremental data analysis processes; and program instructions to add the failed files from the one or more fault lists to the shared scan output. 10 . The computer program product of claim 9 , wherein the failed files from the one or more fault lists are added to the shared scan output, such that one or more file-level incremental data analysis processes will perform one or more operations on the failed files prior to performing one or more operations on remaining files included in the shared scan output. 11 . The computer program product of claim 8 , wherein the shared scan output comprises file location information for each of the identified files, along with flags indicating operations that should be performed on each of the identified files by one or more file-level incremental data analysis processes. 12 . The computer program product of claim 8 , wherein the one or more operations comprise one or more of: a backup operation, a replication operation, a delete operation, and a modification operation. 13 . The computer program product of claim 12 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to copy one or more of the identified files to a backup storage pool; and program instructions to copy one or more of the identified files to a replication storage pool. 14 . The computer program product of claim 12 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to delete one or more of the identified files from the file system. 15 . A computer system for creating shared scan output for file-level incremental data analysis processes, the computer system comprising: one or more computer processors; one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media for execution by at least one of the one or more processors, the program instructions comprising: program instructions to initiate a shared scan phase for a plurality of files in a file system; program instructions to, during the shared scan phase, apply one or more rules to each of the plurality of files in the file system to identify files on which to perform one or more operations; and program instructions to create shared scan output that includes information describing the identified files and the one or more operations to be performed on the identified files. 16 . The computer system of claim 15 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to create one or more fault lists including failed files on which operations have not been successfully performed by one or more file-level incremental data analysis processes; and program instructions to add the failed files from the one or more fault lists to the shared scan output. 17 . The computer system of claim 16 , wherein the failed files from the one or more fault lists are added to the shared scan output, such that one or more file-level incremental data analysis processes will perform one or more operations on the failed files prior to performing one or more operations on remaining files included in the shared scan output. 18 . The computer system of claim 15 , wherein the shared scan output comprises file location information for each of the identified files, along with flags indicating operations that should be performed on each of the identified files by one or more file-level incremental data analysis processes. 19 . The computer system of claim 15 , wherein the one or more operations comprise one or more of: a backup operation, a replication operation, a delete operation, and a modification operation. 20 . The computer system of claim 19 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to copy one or more of the identified files to a backup storage pool; and program instructions to copy one or more of the identi

Assignees

Inventors

Classifications

  • by selection of backup contents · CPC title

  • G06F16/178Primary

    Techniques for file synchronisation in file systems · CPC title

  • Database-specific techniques · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016259809A1 cover?
Solutions are provided that use shared scan phases and scan output for various file-level incremental data analysis systems. In one embodiment, a shared scan phase is initiated for a plurality of files in a file system. During the shared scan phase, one or more rules are applied to the files in the file system to identify files on which to perform one or more operations. Shared scan output is c…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/178. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).