Redundant and robust distributed deduplication data storage system

US2017193003A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017193003-A1
Application numberUS-201615299299-A
CountryUS
Kind codeA1
Filing dateOct 20, 2016
Priority dateDec 30, 2015
Publication dateJul 6, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are provided herein for automatically configuring newly installed secondary storage computing devices and managing secondary storage computing devices when one or more become unavailable. For example, a storage manager can then detect the computing resources available to the newly installed secondary storage computing device, assign a role to the newly installed secondary storage computing device based on the detected computing resources, configure the newly installed secondary storage computing device with deduplication and storage policies used by the other secondary storage computing devices, re-partition secondary storage devices to allocate memory for the newly installed secondary storage computing device, and instruct other secondary storage computing devices to replicate their managed data such that the newly installed secondary storage computing device has access to the replicated data.

First claim

Opening claim text (preview).

What is claimed is: 1 . A networked information management system configured to automatically configure installed data storage computers, the networked information management system comprising: a first data storage computer comprising computer hardware, wherein the first data storage computer is a first control node configured to manage first deduplication information stored in a first deduplication database and direct read and write requests to secondary nodes, and wherein the first data storage computer manages first data in a secondary storage device; a second data storage computer comprising computer hardware, wherein the second data storage computer is installed in the networked information management system after the first data storage computer; and a storage manager comprising computer hardware configured to: detect that the second data storage computer is installed in the networked information management system, determine whether the second data storage computer is a second control node or a first secondary node based on computing resources available to the second data storage computer, partition the secondary storage device such that a first portion of the secondary storage device is assigned to the first data storage computer and a second portion of the secondary storage device is assigned to the second data storage computer, and instruct the first data storage computer to replicate the first data and transmit the replicated first data to the second data storage computer for storage in the second portion of the secondary storage device. 2 . The networked information management system of claim 1 , wherein the first data storage computer is configured with a deduplication policy and a storage policy, and wherein the storage manager is further configured to configure the second data storage computer with the deduplication policy and the storage policy. 3 . The networked information management system of claim 1 , wherein the second data storage computer is the second control node, and wherein the storage manager is further configured to instruct the first data storage computer to replicate the first deduplication information and transmit the replicated first deduplication information to the second data storage computer for storage in a second deduplication database. 4 . The networked information management system of claim 3 , wherein the second deduplication database comprises a third portion allocated to the first data storage computer and a fourth portion allocated to the second data storage computer, and wherein the replicated first deduplication information is stored in the third portion of the second deduplication database. 5 . The networked information management system of claim 3 , wherein the replicated first deduplication information comprises at least one of a data block signature, a storage location of a data block, or a count of a number of times the data block is used. 6 . The networked information management system of claim 1 , further comprising a third data storage computer comprising computer hardware, wherein the third data storage computer is a second secondary node, wherein a third portion of the secondary storage device is assigned to the third data storage computer, and wherein the third portion comprises the replicated first data and replicated second data corresponding to the second data storage computer. 7 . The networked information management system of claim 6 , wherein the second data storage computer is the first secondary node, wherein the second data storage computer is unavailable, and wherein the storage manager is further configured to: receive a read request intended for the second data storage computer; and transmit the read request to the first data storage computer, wherein the first data storage computer routes the read request to the third data storage computer instead of the second data storage computer such that the third data storage computer can retrieve a portion of the replicated second data that corresponds with the read request. 8 . The networked information management system of claim 1 , wherein the second portion of the secondary storage device comprises a third portion allocated to the first data storage computer and a fourth portion allocated to the second data storage computer, and wherein the replicated first data is stored in the third portion. 9 . A computer-implemented method for automatically configuring installed data storage computers, the computer-implemented method comprising: detecting that a first data storage computer is installed in a networked information management system, wherein the first data storage computer is a first control node configured to manage first deduplication information stored in a first deduplication database and direct read and write requests to secondary nodes, and wherein the first data storage computer manages first data in a secondary storage device; detecting that a second data storage computer is installed in the networked information management system, wherein the second data storage computer is installed in the networked information management system after the first data storage computer; determining whether the second data storage computer is a second control node or a first secondary node based on computing resources available to the second data storage computer; partitioning the secondary storage device such that a first portion of the secondary storage device is assigned to the first data storage computer and a second portion of the secondary storage device is assigned to the second data storage computer; and instructing the first data storage computer to replicate the first data and transmit the replicated first data to the second data storage computer for storage in the second portion of the secondary storage device. 10 . The computer-implemented method of claim 9 , wherein the first data storage computer is configured with a deduplication policy and a storage policy, and wherein the computer-implemented method further comprises configuring the second data storage computer with the deduplication policy and the storage policy. 11 . The computer-implemented method of claim 9 , wherein the second data storage computer is the second control node, and wherein the computer-implemented method further comprises instructing the first data storage computer to replicate the first deduplication information and transmit the replicated first deduplication information to the second data storage computer for storage in a second deduplication database. 12 . The computer-implemented method of claim 11 , wherein the second deduplication database comprises a third portion allocated to the first data storage computer and a fourth portion allocated to the second data storage computer, and wherein the replicated first deduplication information is stored in the third portion of the second deduplication database. 13 . The computer-implemented method of claim 11 , wherein the replicated first deduplication information comprises at least one of a data block signature, a storage location of a data block, or a count of a number of times the data block is used. 14 . The computer-implemented method of claim 9 , wherein a third data storage computer is a second secondary node, wherein a third portion of the secondary storage device is assigned to the third data storage computer, and wherein the third portion comprises the replicated first data and replicated second data corresponding to the second data storage computer. 15 . The computer-implemented method of claim 14 , wherein the second data storage compute

Assignees

Inventors

Classifications

  • Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title

  • Virtual · CPC title

  • Redundant storage or storage space (G06F11/2056 takes precedence) · CPC title

  • Solving problems relating to consistency · CPC title

  • for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017193003A1 cover?
Systems and methods are provided herein for automatically configuring newly installed secondary storage computing devices and managing secondary storage computing devices when one or more become unavailable. For example, a storage manager can then detect the computing resources available to the newly installed secondary storage computing device, assign a role to the newly installed secondary st…
Who is the assignee on this patent?
Commvault Systems Inc
What technology area does this patent fall under?
Primary CPC classification H04L67/1097. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Jul 06 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).