Reliability enhancement in a distributed storage system

US9336091B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9336091-B2
Application numberUS-201414198592-A
CountryUS
Kind codeB2
Filing dateMar 6, 2014
Priority dateMar 6, 2014
Publication dateMay 10, 2016
Grant dateMay 10, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Machines, systems and methods for enhancing data recovery in a data storage system, the method comprising determining whether one or more data storage mediums in a data storage system are unavailable; determining data that are at a risk of loss, due to said one or more data storage mediums being unavailable; from among the data that is determined to be at the risk of loss, identifying data that is highly vulnerable to loss; and creating one or more temporary replicas of the data that is highly vulnerable to loss.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for enhancing data recovery in a data storage system, the method comprising: determining whether one or more data storage mediums in a data storage system are unavailable; determining data associated with the one or more data storage mediums in the data storage system that is at a risk of loss, due to the one or more data storage mediums unavailable; identifying a subset of data that is highly vulnerable to loss from the data that is determined to be at the risk of loss, wherein the data that is highly vulnerable to loss is determined based on calculating a probability that the data will be lost during a future failure event; and creating one or more temporary replicas of the data that is highly vulnerable to loss. 2. The method of claim 1 wherein the data that is at the risk of loss comprises data stored on the one or more unavailable data storage mediums. 3. The method of claim 1 wherein at least one temporary replica is created on an external storage device or a solid state storage device. 4. The method of claim 1 wherein the data that is highly vulnerable to loss comprises data with a number of replicas that is less than a predetermined threshold. 5. The method of claim 1 wherein the data that is highly vulnerable to loss is determine based on considering one or more factors that are correlated to possibility of a future failure event. 6. The method of claim 5 wherein the one or more factors comprise at least one of age of a disk on which the data is stored, age of system components in the data storage system, number of viable replicas for the data, error rate associated with reading the data. 7. The method of claim 5 wherein the one or more factors comprise one or more events that influence the probability of data loss or the potential for recovery of the data from one or more replicas of the data. 8. The method of claim 1 wherein number of the temporary replicas created is reduced, in response to determining that the number of replicas for the data is restored to a predetermine threshold. 9. The method of claim 1 wherein a temporary replica is used to recover from data loss. 10. A system for enhancing data recovery in a data storage system, the system comprising: one or more computer processors; one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media for execution by at least one of the one or more computer processors, the program instructions comprising: program instructions to determine whether one or more data storage mediums in a data storage system are unavailable; program instructions to determine data associated with the one or more data storage mediums in the data storage system that is at a risk of loss, due to the one or more data storage mediums being unavailable; program instructions to identify a subset of data that is highly vulnerable to loss from the data that is determined to be at the risk of loss, wherein the data that is highly vulnerable to loss is determined based on calculating a probability that the data will be lost during a future failure event; and program instructions to create one or more temporary replicas of the data that is highly vulnerable to loss. 11. The system of claim 10 wherein the data that is at the risk of loss comprises data stored on the one or more unavailable data storage mediums. 12. The system of claim 10 wherein at least one temporary replica is created on an external storage device or a solid state storage device. 13. The system of claim 10 wherein the data that is highly vulnerable to loss comprises data with a number of replicas that is less than a predetermined threshold. 14. A computer program product comprising a computer readable storage medium having a computer readable program, wherein the computer readable program when executed on a computer causes the computer to: determine whether one or more data storage mediums in a data storage system are unavailable; determine data associated with the one or more data storage mediums in the data storage system that is at a risk of loss, due to the one or more data storage mediums being unavailable; identify a subset of data that is highly vulnerable to loss from the data that is determined to be at the risk of loss, wherein the data that is highly vulnerable to loss is determined based on calculating a probability that the data will be lost during a future failure event; and create one or more temporary replicas of the data that is highly vulnerable to loss. 15. The computer program product of claim 14 wherein the data that is at the risk of loss comprises data stored on the one or more unavailable data storage mediums. 16. The computer program product of claim 14 wherein at least one temporary replica is created on an external storage device or a solid state storage device. 17. The computer program product of claim 14 wherein the data that is highly vulnerable to loss comprises data with a number of replicas that is less than a predetermined threshold.

Assignees

Inventors

Classifications

  • Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title

  • by selection of backup contents · CPC title

  • Redundant storage or storage space (G06F11/2056 takes precedence) · CPC title

  • Management of the backup or restore process · CPC title

  • Management of the data involved in backup or backup restore · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9336091B2 cover?
Machines, systems and methods for enhancing data recovery in a data storage system, the method comprising determining whether one or more data storage mediums in a data storage system are unavailable; determining data that are at a risk of loss, due to said one or more data storage mediums being unavailable; from among the data that is determined to be at the risk of loss, identifying data that…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/1451. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 10 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).