Resilience of data storage systems by managing partial failures of solid state drives

US10719398B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10719398-B1
Application numberUS-201715653341-A
CountryUS
Kind codeB1
Filing dateJul 18, 2017
Priority dateJul 18, 2017
Publication dateJul 21, 2020
Grant dateJul 21, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Each SSD in an array is divided into splits that are contiguous LBA ranges of equal capacity. The splits are managed as if they were sub-drives or logical drives and used in RAID groups or other forms of redundancy schemes. A processor external to the SSD collects pending defect data log from the SSDs and determines which splits are associated with reported defects. Failed splits are rebuilt in spare splits using RAID or another redundancy scheme. Repaired splits are returned to service as spare splits. Irreparable splits are marked as lost. In the case of SSD failure the data splits of the SSD are rebuilt to a spare SSD. If enough failed splits of the failed SSD can be repaired then the repaired SSD may be returned to service.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: a storage array comprising: at least one computing node comprising at least one processor and non-transitory memory; a drive array comprising a plurality of SSDs (solid state drives), the drive array in communication with the computing node; and at least one processor, external with respect to the SSDs, that implements instructions comprising: organizing instructions that organize the SSDs as a plurality of splits, each split comprising a contiguous range of logical addresses presented by one of the SSDs and having a capacity equal to other splits of the plurality of splits; grouping instructions that group a plurality of spare splits of the plurality of splits into a redundancy group; monitoring instructions that detect a failed split of the redundancy group on a non-failed one of the SSDs; and rebuilding instructions that rebuild the failed split on a spare split. 2. The apparatus of claim 1 wherein the organizing instructions organize the splits into logical slices of equal capacity corresponding to a track, a cylinder, a fraction of SSD capacity, or a percentage of SSD capacity. 3. The apparatus of claim 1 wherein the logical addresses are LBAs (logical block addresses). 4. The apparatus of claim 1 wherein non-contiguous physical addresses map to the contiguous range of logical addresses of each split. 5. The apparatus of claim 4 wherein the non-contiguous physical addresses of each split map to a plurality of dies. 6. The apparatus of claim 1 wherein the monitoring instructions process pending defect data logs from the SSDs to identify splits with which reported defects are associated. 7. The apparatus of claim 6 wherein the rebuilding instructions select the spare split from an SSD having a greatest number of spare splits. 8. The apparatus of claim 1 comprising remediation instructions that attempt to repair the failed split. 9. The apparatus of claim 8 wherein the remediation instructions return repaired splits to service as spare splits and mark unrepairable splits as lost splits. 10. The apparatus of claim 1 wherein the rebuilding instructions rebuild all failed data splits of a failed SSD on a spare SSD, and the remediation instructions return repaired SSDs to service and mark unrepairable SSDs as lost. 11. A method comprising: in a storage array comprising at least one computing node comprising at least one processor and non-transitory memory, and a drive array comprising a plurality of SSDs (solid state drives), the drive array in communication with the computing node: externally with respect to the SSDs, organizing the SSDs as a plurality of splits, each split comprising a contiguous range of logical addresses presented by one of the SSDs and having a capacity equal to other splits of the plurality of splits; grouping a plurality of spare splits of the plurality of splits into a redundancy group; detecting a failed split of the redundancy group on a non-failed one of the SSDs; and rebuilding the failed split on a spare split. 12. The method of claim 11 comprising organizing the splits into logical slices of equal capacity corresponding to a track, a cylinder, a fraction of SSD capacity, or a percentage of SSD capacity. 13. The method of claim 11 wherein the logical addresses are LBAs (logical block addresses), and non-contiguous physical addresses map to the contiguous range of logical addresses of each split, and the non-contiguous physical addresses of each split map to a plurality of dies, and comprising organizing each split as a contiguous range of LBAs. 14. The method of claim 11 comprising processing pending defect data logs from the SSDs to identify splits with which reported defects are associated. 15. The method of claim 14 comprising selecting the spare split from an SSD having a greatest number of spare splits. 16. The method of claim 11 comprising attempting to repair the failed split. 17. The method of claim 16 comprising returning repaired splits to service as spare splits. 18. The method of claim 17 comprising marking unrepairable splits as lost splits. 19. The method of claim 11 comprising rebuilding all failed data splits of a failed SSD on a spare SSD. 20. The method of claim 19 comprising returning repaired SSDs to service and marking unrepairable SSDs as lost.

Assignees

Inventors

Classifications

  • Reconstruction on already foreseen single or plurality of spare disks · CPC title

  • Management of space entities, e.g. partitions, extents, pools · CPC title

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • in relation to life time, e.g. increasing Mean Time Between Failures [MTBF] · CPC title

  • Non-volatile semiconductor memory arrays · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10719398B1 cover?
Each SSD in an array is divided into splits that are contiguous LBA ranges of equal capacity. The splits are managed as if they were sub-drives or logical drives and used in RAID groups or other forms of redundancy schemes. A processor external to the SSD collects pending defect data log from the SSDs and determines which splits are associated with reported defects. Failed splits are rebuilt in…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G11C29/52. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 21 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).