Historical network data-based scanning thread generation

US2018121297A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2018121297-A1
Application numberUS-201615341878-A
CountryUS
Kind codeA1
Filing dateNov 2, 2016
Priority dateNov 2, 2016
Publication dateMay 3, 2018
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments disclosed herein address the need to more efficiently backup a network-based storage environment that may be remote from a primary storage environment. For example, embodiments herein can provide a more efficient backup of a storage managed by a third-party entity. To improve the backup process, embodiments herein may optimize the number of scanning threads that are used to identify files that are to be backed up by, for example, analyzing the characteristics of the network and/or the network storage system to determine a number of scanning threads that will enable faster scanning of the network storage system while at the same time not overburden or be slowed down by a network between the network-based storage environment and the primary storage environment.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method of performing multi-threaded scanning of a network storage system, the computer-implemented method comprising: as implemented by a data agent within a remote storage access system comprising one or more hardware processors and configured with specific computer-executable instructions, determining whether historical network characteristics of a network used to communicate with a network storage system exist; and in response to determining that the historical network characteristics exist: accessing the historical network characteristics; selecting an amount of scanning threads to initiate based at least in part on the historical network characteristics, wherein the scanning threads are configured to scan a network storage repository of the network storage system to identify files to backup; triggering initiation of the amount of the scanning threads at the network storage system; and initiating scanning of the network storage repository using the scanning threads to identify the files to backup. 2 . The computer-implemented method of claim 1 , wherein scanning the network storage repository comprises traversing a file structure at the network storage repository. 3 . The computer-implemented method of claim 1 , further comprising: receiving a scan result from the network storage system corresponding to a result of scanning the network storage repository; and initiating backup of the network storage repository based at least in part on the scan result. 4 . The computer-implemented method of claim 3 , wherein the scan result identifies files to be backed up and a location of the files at the network storage repository. 5 . The computer-implemented method of claim 3 , further comprising: receiving files to be backed up responsive to the initiating backup of the network storage repository; and providing the files to a media agent for backup at a secondary storage system. 6 . The computer-implemented method of claim 1 , wherein the historical network characteristics are determined based at least in part on communication with the network storage system during performance of a particular number of backup processes occurring during a particular time period prior to the accessing of the historical network characteristics. 7 . The computer-implemented method of claim 1 , wherein the historical network characteristics are determined based at least in part on the transmission of a number of test packets during a time period occurring prior to the accessing of the historical network characteristics. 8 . The computer-implemented method of claim 1 , further comprising determining current network characteristics of the network, wherein the selecting the amount of the scanning threads is based at least in part on an aggregation of the historical network characteristics and the current network characteristics. 9 . The computer-implemented method of claim 8 , wherein the historical network characteristics and the current network characteristics are weighted differently in the aggregation of the historical network characteristics and the current network characteristics. 10 . The computer-implemented method of claim 8 , further comprising updating the historical network characteristics based on the current network characteristics for subsequent scanning of the network storage repository. 11 . The computer-implemented method of claim 1 , wherein, in response to determining that the historical network characteristics do not exist, the computer-implemented method further comprises: determining current network characteristics of the network; and selecting the amount of the scanning threads based at least in part on the current network characteristics. 12 . A system for performing multi-threaded scanning of a network storage system, the system comprising: a remote storage access system implemented in computer hardware, the remote storage access system configured to: determine whether historical network characteristics of a network used to communicate with a network storage system exist; and in response to determining that the historical network characteristics exist: access the historical network characteristics; select an amount of scanning threads to initiate based at least in part on the historical network characteristics, wherein the scanning threads are configured to scan a network storage repository of the network storage system to identify files to backup; trigger initiation of the amount of the scanning threads at the network storage system; and initiate scanning of the network storage repository using the scanning threads to identify the files to backup. 13 . The system of claim 12 , wherein the remote storage access system is further configured to: receive a scan result from the network storage system corresponding to a result of scanning the network storage repository; and initiate backup of the network storage repository based at least in part on the scan result. 14 . The system of claim 13 , wherein the remote storage access system is further configured to: receive files to be backed up responsive to the initiating backup of the network storage repository; and provide the files to a media agent for backup at a secondary storage system. 15 . The system of claim 12 , wherein the historical network characteristics are determined based at least in part on communication with the network storage system during performance of a particular number of backup processes occurring during a particular time period prior to the accessing of the historical network characteristics. 16 . The system of claim 12 , wherein the historical network characteristics are determined based at least in part on the transmission of a number of test packets during a time period occurring prior to the accessing of the historical network characteristics. 17 . The system of claim 12 , wherein the remote storage access system is further configured to determine current network characteristics of the network, wherein the selecting the amount of the scanning threads is based at least in part on an aggregation of the historical network characteristics and the current network characteristics. 18 . The system of claim 17 , wherein the historical network characteristics and the current network characteristics are weighted differently in the aggregation of the historical network characteristics and the current network characteristics. 19 . The system of claim 17 , wherein the remote storage access system is further configured to update the historical network characteristics based on the current network characteristics for subsequent scanning of the network storage repository. 20 . The system of claim 12 , wherein, in response to determining that the historical network characteristics do not exist, the remote storage access system is further configured to: determine current network characteristics of the network; and select the amount of the scanning threads based at least in part on the current network characteristics.

Assignees

Inventors

Classifications

  • for networked environments · CPC title

  • Virtual · CPC title

  • for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS] · CPC title

  • maintaining the standby controller/processing unit updated (initialisation or re-synchronisation thereof G06F11/1658 and subgroups) · CPC title

  • where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018121297A1 cover?
Embodiments disclosed herein address the need to more efficiently backup a network-based storage environment that may be remote from a primary storage environment. For example, embodiments herein can provide a more efficient backup of a storage managed by a third-party entity. To improve the backup process, embodiments herein may optimize the number of scanning threads that are used to identify…
Who is the assignee on this patent?
Commvault Systems Inc
What technology area does this patent fall under?
Primary CPC classification G06F11/1464. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 03 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).