Restoring Lost Data

US2023273865A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023273865-A1
Application numberUS-202217587514-A
CountryUS
Kind codeA1
Filing dateJan 28, 2022
Priority dateJan 28, 2022
Publication dateAug 31, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Restoring lost data including detecting that a portion of the dataset stored in a first storage system has become unavailable, obtaining an identifier for the portion of the dataset, locating, using the identifier, a replacement portion of the dataset that is stored at one or more other storage systems, and writing, to the dataset that is stored in the first storage system, the replacement portion of the dataset as a replacement of the portion of the dataset that has become unavailable, where the writing occurs automatically, without user intervention.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of restoring lost data, the method comprising: detecting that a portion of the dataset stored in a first storage system has become unavailable; obtaining an identifier for the portion of the dataset; locating, using the identifier, a replacement portion of the dataset that is stored at one or more other storage systems; and writing, to the dataset that is stored in the first storage system, the replacement portion of the dataset as a replacement of the portion of the dataset that has become unavailable, where the writing occurs automatically, without user intervention. 2 . The method of claim 1 , further comprising: obtaining replication metadata that was generated when replicating the dataset between multiple storage systems, wherein the identifier for the portion of the dataset is obtained based on the replication metadata. 3 . The method of claim 1 , wherein the first storage system and the one or more other storage systems are included in a fleet of storage systems, and wherein the identifier is unique within each storage system within the fleet of storage systems. 4 . The method of claim 1 , wherein the identifier is a deduplication hash value. 5 . The method of claim 1 , wherein the identifier is a file system identifier. 6 . The method of claim 1 , wherein the identifier is a private identifier that is not visible to applications that utilize the dataset. 7 . The method of claim 1 , wherein the identifier is a cblock identifier. 8 . The method of claim 1 , further comprising: obtaining the identifier from an application that was utilizing the dataset prior to the portion of the dataset becoming unavailable. 9 . The method of claim 1 , further comprising: determining a logical location at the first storage system that was associated with the portion of the dataset; and wherein the replacement dataset portion is written to the logical location. 10 . The method of claim 1 , further comprising: querying, using one or more of a segment ID and a segment offset value, the one or more other storage systems for portions of the dataset; and determining that the replacement portion of the dataset is available at a corresponding segment ID and segment offset value at the one or more other storage systems. 11 . The method of claim 1 , wherein a similarity of the replacement portion of the dataset to the portion of the dataset satisfies a threshold similarity. 12 . The method of claim 1 , further comprising: determining that a first timestamp associated with storage of the replacement portion of the dataset is not earlier than a second timestamp associated with storage of the portion of the dataset. 13 . The method of claim 1 , further comprising: determining that a first checksum value associated with the replacement portion of the dataset equals a second checksum value associated with the portion of the dataset. 14 . The method of claim 1 , further comprising: determining that a first error correction code value associated with the replacement portion of the dataset equals a second error correction code value associated with the portion of the dataset. 15 . An apparatus for restoring lost data, the apparatus comprising a computer processor, a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: detecting that a portion of the dataset stored in a first storage system has become unavailable; obtaining an identifier for the portion of the dataset; locating, using the identifier, a replacement portion of the dataset that is stored at one or more other storage systems; and writing, to the dataset that is stored in the first storage system, the replacement portion of the dataset as a replacement of the portion of the dataset that has become unavailable, where the writing occurs automatically, without user intervention. 16 . The apparatus of claim 15 further comprising computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: obtaining replication metadata that was generated when replicating the dataset between multiple storage systems, wherein the identifier for the portion of the dataset is obtained based on the replication metadata. 17 . The apparatus of claim 15 further comprising computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: obtaining the identifier from an application that was utilizing the dataset prior to the portion of the dataset becoming unavailable. 18 . The apparatus of claim 15 further comprising computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: determining a location at the first storage system that was associated with the portion of the dataset; and wherein the replacement dataset portion is written to the location. 19 . The apparatus of claim 15 further comprising computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: querying, using one or more of a segment ID and a segment offset value, the one or more other storage systems for portions of the dataset; and determining that the replacement portion of the dataset is available at a corresponding segment ID and segment offset value at the one or more other storage systems. 20 . The apparatus of claim 15 further comprising computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of: determining that a first timestamp associated with storage of the replacement portion of the dataset is not earlier than a second timestamp associated with storage of the portion of the dataset.

Assignees

Inventors

Classifications

  • Redundant storage control functionality · CPC title

  • Redundant storage or storage space (G06F11/2056 takes precedence) · CPC title

  • Backup restoration techniques · CPC title

  • using de-duplication of the data · CPC title

  • Using snapshots, i.e. a logical point-in-time copy of the data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023273865A1 cover?
Restoring lost data including detecting that a portion of the dataset stored in a first storage system has become unavailable, obtaining an identifier for the portion of the dataset, locating, using the identifier, a replacement portion of the dataset that is stored at one or more other storage systems, and writing, to the dataset that is stored in the first storage system, the replacement port…
Who is the assignee on this patent?
Pure Storage Inc
What technology area does this patent fall under?
Primary CPC classification G06F11/1469. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 31 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).