File list generation method, system, and program, and file list generation device

US8965941B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-8965941-B2
Application numberUS-201313753208-A
CountryUS
Kind codeB2
Filing dateJan 29, 2013
Priority dateFeb 15, 2012
Publication dateFeb 24, 2015
Grant dateFeb 24, 2015

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A file list generation device creates, at high speed, a list of added, changed, or deleted file data by acquiring the directory URLs of search target files from a first storage, counting the number of directories in each layer through analysis of the directory URLs, and, based on the result of the counting, determines a dividing policy to divide a scan target directory scanning operation, which is to be performed with a distributed server cluster. Directories of the search target files are divided in accordance with the determined dividing policy, and a snapshot of a current directory tree is acquired.

First claim

Opening claim text (preview).

What is claimed is: 1. A file list generation method, comprising the steps of: storing index information created about search target files into a first storage area; determining a dividing policy to divide a scan target directory scanning operation to be performed with a distributed server cluster, by acquiring directory URLs of the search target files from the index information stored in the first storage area, and counting the number of directories in each layer through analysis of the directory URLs, the scan target directory scanning operation being divided based on the result of the counting; dividing directories of the search target files in accordance with the determined dividing policy; acquiring a snapshot of a current directory tree of scan target files, and storing the snapshot into a second storage area; scanning, in a distributed manner, the search target files in respective directories in the snapshot of the current directory tree and a snapshot of a directory tree of search target files acquired prior to the determination of the divided policy and stored in a third storage area, the scanning being performed with the distributed server cluster on the basis of division units determined in the determining the dividing policy; and comparing the results of the distributed scans, and creating and outputting a list of files that are added, deleted, or changed after the creation of the index information. 2. A file list generation method, comprising the steps of: storing index information created about search target files into a first storage area; copying a current directory tree of the search target files, and storing the copied current directory tree as a backup directory into a second storage area; determining a dividing policy to divide a scan target directory scanning operation to be performed with a distributed server cluster, by acquiring directory URLs of the search target files from the index information stored in the first storage area, and counting the number of directories in each layer through analysis of the directory URLs, the scan target directory scanning operation being divided based on the result of the counting; dividing the backup directory of the search target files in accordance with the determined dividing policy; scanning, in a distributed manner, the search target files in respective directories in the backup directory of the current directory tree and a backup directory tree of search target files acquired prior to the determination of the divided policy and stored in a third storage area, the scanning being performed with the distributed server cluster on the basis of division units determined in the determining the dividing policy; and comparing the results of the distributed scans, and creating and outputting a list of files that are added, deleted, or changed after the creation of the index information. 3. The file list generation method according to claim 1 , wherein the step of determining the dividing policy includes the steps of determining a dividing policy of setting the number of division units as one to perform a search target directory scanning operation when the number of directories counted from the highest layer of directories of search target files is smaller than a predetermined number, performing a first scan, with each one division unit being layers from a directory in the highest layer to a predetermined layer when the number of directories counted from the highest layer exceeds the predetermined number, and determining a dividing policy to scan lower layers of scan target directories that are found through the first scan, with each one division unit being a depth of one layer. 4. The file list generation method according to claim 2 , wherein the step of determining the dividing policy includes the steps of determining a dividing policy of setting the number of division units as one to perform a search target directory scanning operation when the number of directories counted from the highest layer of directories of search target files is smaller than a predetermined number, performing a first scan, with each one division unit being layers from a directory in the highest layer to a predetermined layer when the number of directories counted from the highest layer exceeds the predetermined number, and determining a dividing policy to scan lower layers of scan target directories that are found through the first scan, with each one division unit being a depth of one layer. 5. A file list generation system comprising: a processor; and a first storage area, connected to said processor, that stores index information created about search target files and instructions that, when executed by the processor, instruct the processor to perform acts including: acquiring directory URLs of the search target files from the index information stored in the first storage area, counting the number of directories in each layer through analysis of the directory URLs, and, based on the counting, determining a dividing policy to divide a scan target directory scanning operation to be performed with a distributed server cluster; dividing directories of the search target files in accordance with the dividing policy, acquires a snapshot of a current directory tree of scan target files, and storing the snapshot into a second storage area; scanning, in a distributed manner, the search target files in respective directories in the snapshot of the current directory tree and a snapshot of a directory tree of search target files acquired prior to the determination of the divided policy and stored in a third storage area, the scanning being performed with the distributed server cluster on the basis of division units determined by the dividing policy; and comparing the results of the distributed scans performed by the scanning, and creating and outputting a list of files that are added, deleted, or changed after the creation of the index information. 6. A file list generation system comprising: a processor; a first storage area, connected to said processor, that stores index information created about search target files; and a second storage area, connected to said processor, that copies a current directory tree of the search target files, and stores the copied current directory tree as a backup directory, wherein the first storage area stores instructions that, when executed by the processor, instruct the processor to perform acts including: acquiring directory URLs of the search target files from the index information stored in the first storage area, counting the number of directories in each layer through analysis of the directory URLs, and, based on the counting, determining a dividing policy to divide a scan target directory scanning operation to be performed with a distributed server cluster; dividing the backup directory of the search target files in accordance with the determined dividing policy; scanning, in a distributed manner, the search target files in respective directories in the backup directory of the current directory tree stored in the second storage area and a backup directory tree of search target files acquired prior to the determination of the divided policy and stored in a third storage area, the scanning being performed with the distributed server cluster on the basis of division units determined in the dividing policy; and comparing the results of the distributed scans, and creates and outputs a list of files that are added, deleted, or changed after the creation of the index information. 7. The file list generation system according to claim 5 , wherein the determining of the dividing policy includes: determining a dividing policy of setting the number of division units as one to perform a search tar

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • G06F16/13Primary

    File access structures, e.g. distributed indices (arrangements of input from, or output to, record carriers G06F3/06) · CPC title

  • G06F16/11Primary

    File system administration, e.g. details of archiving or snapshots (error detection or correction of the data by redundancy in operations G06F11/14) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US8965941B2 cover?
A file list generation device creates, at high speed, a list of added, changed, or deleted file data by acquiring the directory URLs of search target files from a first storage, counting the number of directories in each layer through analysis of the directory URLs, and, based on the result of the counting, determines a dividing policy to divide a scan target directory scanning operation, which…
Who is the assignee on this patent?
Hitachi Solutions Ltd
What technology area does this patent fall under?
Primary CPC classification G06F17/30091. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 24 2015 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).