Systems and methods for selecting a set of storage nodes for use in reconstructing data on a faulted node in an erasure-coded system
US-10275312-B1 · Apr 30, 2019 · US
US11099954B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11099954-B2 |
| Application number | US-201916655382-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 17, 2019 |
| Priority date | Oct 31, 2018 |
| Publication date | Aug 24, 2021 |
| Grant date | Aug 24, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques provide for rebuilding data. Such techniques involve: obtaining health status information related to a first disk of a storage system, the first disk being divided into a plurality of disk units, and the health status information indicating a failure of a first disk unit of the plurality of disk units; determining a data block stored in the first disk unit based on a mapping between data blocks for the storage system and storage locations; and rebuilding the data block into a second disk of the storage system when maintaining accessibility of other data blocks in other disk units of the first disk than the first disk unit. Accordingly, it is possible to improve the data rebuilding efficiency when a disk fails partly and to continue utilizing the storage space portion in the disk that is not failed, without making the disk be offline temporarily.
Opening claim text (preview).
We claim: 1. A method of rebuilding data, comprising: obtaining health status information related to a first disk of a storage system, wherein the first disk comprises a solid state disk having multiple channels, the first disk being divided into a plurality of disk units, wherein each one of the plurality of disk units corresponds to a different one of the multiple channels of the solid state disk, and the health status information indicating a failure of a first disk unit of the plurality of disk units; determining a data block stored in the first disk unit based on a mapping between data blocks for the storage system and storage locations; and rebuilding the data block into a second disk of the storage system, while maintaining accessibility of other data blocks in other disk units of the first disk than the first disk unit. 2. The method of claim 1 , further comprising: marking the first disk unit as inaccessible so as to prevent the first disk unit from being reallocated for storing data. 3. The method of claim 1 , wherein the storage system comprises a distribution storage system. 4. The method of claim 3 , wherein the first and second disks are located at different physical nodes of the distribution storage system. 5. The method of claim 1 , wherein a copy of the data block is stored in a third disk of the plurality of disks, the first and third disks being distributed at different physical nodes, and wherein rebuilding the data block into the second disk comprises: rebuilding the data block into the second disk with the copy. 6. The method of claim 1 , wherein the data block and a plurality of further data blocks are comprised in a data chunk, the plurality of further data blocks being stored in further disks different from the first disk in the storage system, and wherein rebuilding the data block into the second disk comprising: rebuilding the data block into the second disk with the plurality of further data blocks. 7. The method of claim 1 , wherein obtaining the health status information comprises: receiving the health status information from disk firmware associated with the first disk. 8. The method of claim 1 , wherein the availability of the other data blocks of other disk units of the first disk is maintained simultaneously with the rebuilding of the data block into the second disk of the storage system without making the first disk offline. 9. The method of claim 1 , wherein the health status information further indicates that the disk units of the first disk other than the first disk unit are healthy. 10. An electronic device, comprising: a processor; and a memory coupled to the processor and having instructions stored therein, the instructions, when executed by the processor, causing the device to perform acts including: obtaining health status information related to a first disk of a storage system, wherein the first disk comprises a solid state disk having multiple channels, the first disk being divided into a plurality of disk units, wherein each one of the plurality of disk units corresponds to a different one of the multiple channels of the solid state disk, and the health status information indicating a failure of a first disk unit of the plurality of disk units; determining a data block stored in the first disk unit based on a mapping between data blocks for the storage system and storage locations; and rebuilding the data block into a second disk of the storage system, while maintaining accessibility of other data blocks in other disk units of the first disk than the first disk unit. 11. The device of claim 10 , wherein the acts further comprise: marking the first disk unit as inaccessible so as to prevent the first disk unit from being reallocated for storing data. 12. The device of claim 10 , wherein the storage system comprises a distribution storage system. 13. The device of claim 12 , wherein the first and second disks are located at different physical nodes of the distribution storage system. 14. The device of claim 10 , wherein a copy of the data block is stored in a third disk of the plurality of disks, the first and third disks being distributed at different physical nodes, and wherein rebuilding the data block into the second disk includes: rebuilding the data block into the second disk with the copy. 15. The device of claim 10 , wherein the data block and a plurality of further data blocks are comprised in a data chunk, the plurality of further data blocks being stored in further disks different from the first disk in the storage system, and wherein rebuilding the data block into the second disk includes: rebuilding the data block into the second disk using the plurality of further data blocks. 16. The device of claim 7 , wherein obtaining the health status information comprises: receiving the health status information from disk firmware associated with the first disk. 17. A computer program product having a non-transitory computer readable medium which stores a set of instructions that, when carried out by a computerized device, causes the computerized device to: obtain health status information related to a first disk of a storage system, wherein the first disk comprises a solid state disk having multiple channels, the first disk being divided into a plurality of disk units, wherein each one of the plurality of disk units corresponds to a different one of the multiple channels of the solid state disk, and the health status information indicating a failure of a first disk unit of the plurality of disk units; determine a data block stored in the first disk unit based on a mapping between data blocks for the storage system and storage locations; and rebuild the data block into a second disk of the storage system, while maintaining accessibility of other data blocks in other disk units of the first disk than the first disk unit. 18. The computer program product of claim 17 , wherein the computer executable instructions, when executed, further cause the device to: mark the first disk unit as inaccessible so as to prevent the first disk unit from being reallocated for storing data. 19. The computer program product of claim 17 , wherein the storage system includes a distribution storage system. 20. The computer program product of claim 19 , wherein the first and second disks are located at different physical nodes of the distribution storage system. 21. The computer program product of claim 17 , wherein a copy of the data block is stored in a third disk of the plurality of disks, the first and third disks being distributed at different physical nodes, and wherein the computer executable instructions, when executed, further cause the device to: rebuild the data block into the second disk with the copy. 22. The computer program product of claim 17 , wherein the data block and a plurality of further data blocks are comprised in a data chunk, the plurality of further data blocks being stored in further disks different from the first disk in the storage system, and wherein the computer executable instructions, when executed, further cause the device to: rebuild the data block into the second disk with the plurality of further data blocks. 23. The computer program product of claim 17 , wherein the computer executable instructions, when executed, further cause the device to: receive the health status information from disk firmware associated with the first disk.
Redundant storage or storage space (G06F11/2056 takes precedence) · CPC title
Root cause analysis, i.e. error or fault diagnosis (in a hardware test environment G06F11/22; in a software test environment G06F11/36) · CPC title
Securing storage systems · CPC title
Magnetic disk device · CPC title
Management of blocks · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.