Systems and methods for persistent cache logging
US-9842053-B2 · Dec 12, 2017 · US
US10372560B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10372560-B2 |
| Application number | US-201715447880-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 2, 2017 |
| Priority date | Jun 22, 2016 |
| Publication date | Aug 6, 2019 |
| Grant date | Aug 6, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method, computer program product, and computer system for monitoring health of at least one storage device of a cache in a clustered system. A recovery journal may be maintained, wherein the recovery journal may identify whether one or more chunks of data stored in the cache have been dumped from the at least one storage device to persistent storage in the clustered system. A state of the at least one storage device may be determined based upon, at least in part, the health of the at least one storage device. A recovery action may be performed on the one or more chunks of data stored in the at least one storage device based upon, at least in part, the state of the at least one storage device.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method comprising: monitoring health of at least one storage device of a cache in a clustered system; maintaining a recovery journal, wherein the recovery journal identifies whether one or more chunks of data stored in the cache have been dumped from the at least one storage device to persistent storage in the clustered system; determining a state of the at least one storage device based upon, at least in part, the health of the at least one storage device, wherein the state of the at least one storage device is determined to be a second state if the at least one storage device is incapable of completing I/O requests; and performing a recovery action on the one or more chunks of data stored in the at least one storage device based upon, at least in part, the state of the at least one storage device, wherein the recovery action includes: invalidating at least a portion of the one or more chunks of data stored in the at least one storage device if the state of the at least one storage device is the second state; and recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 2. The computer-implemented method of claim 1 wherein the state of the at least one storage device is determined to be a first state if at least one I/O request directed to the at least one storage device fails. 3. The computer-implemented method of claim 2 wherein the recovery action includes attempting to dump each of the one or more chunks of data stored in the at least one storage device to the persistent storage if the state of the at least one storage device is the first state, wherein each chunk of data successfully dumped is removed from the recovery journal. 4. The computer-implemented method of claim 3 wherein the recovery action further includes recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 5. The computer-implemented method of claim 1 wherein the recovery action includes restoring the recovery journal from a file system on the at least one storage device if the recovery journal is one of corrupted and lost. 6. A computer program product residing on a non-transitory computer readable storage medium having a plurality of instructions stored thereon which, when executed across one or more processors, cause at least a portion of the one or more processor to perform operations comprising: monitoring health of at least one storage device of a cache in a clustered system; maintaining a recovery journal, wherein the recovery journal identifies whether one or more chunks of data stored in the cache have been dumped from the at least one storage device to persistent storage in the clustered system; determining a state of the at least one storage device based upon, at least in part, the health of the at least one storage device, wherein the state of the at least one storage device is determined to be a first state if at least one I/O request directed to the at least one storage device fails; and performing a recovery action on the one or more chunks of data stored in the at least one storage device based upon, at least in part, the state of the at least one storage device, wherein the recovery action includes attempting to dump each of the one or more chunks of data stored in the at least one storage device to the persistent storage if the state of the at least one storage device is the first state, wherein each chunk of data successfully dumped is removed from the recovery journal. 7. The computer program product of claim 6 wherein the recovery action further includes recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 8. The computer program product of claim 6 wherein the state of the at least one storage device is determined to be a second state if the at least one storage device is incapable of completing I/O requests. 9. The computer program product of claim 8 wherein the recovery action further includes: invalidating at least a portion of the one or more chunks of data stored in the at least one storage device if the state of the at least one storage device is the second state; and recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 10. The computer program product of claim 6 wherein the recovery action includes restoring the recovery journal from a file system on the at least one storage device if the recovery journal is one of corrupted and lost. 11. A computing system including one or more processors and one or more memories configured to perform operations comprising: monitoring health of at least one storage device of a cache in a clustered system; maintaining a recovery journal, wherein the recovery journal identifies whether one or more chunks of data stored in the cache have been dumped from the at least one storage device to persistent storage in the clustered system; determining a state of the at least one storage device based upon, at least in part, the health of the at least one storage device, wherein the state of the at least one storage device is determined to be a second state if the at least one storage device is incapable of completing I/O requests; and performing a recovery action on the one or more chunks of data stored in the at least one storage device based upon, at least in part, the state of the at least one storage device, wherein the recovery action includes: invalidating at least a portion of the one or more chunks of data stored in the at least one storage device if the state of the at least one storage device is the second state; and recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 12. The computing system of claim 11 wherein the state of the at least one storage device is determined to be a first state if at least one I/O request directed to the at least one storage device fails. 13. The computing system of claim 12 wherein the recovery action includes attempting to dump each of the one or more chunks of data stored in the at least one storage device to the persistent storage if the state of the at least one storage device is the first state, wherein each chunk of data successfully dumped is removed from the recovery journal. 14. The computing system of claim 13 wherein the recovery action further includes recovering at least a portion of the one or more chunks of data that remain in the recovery journal. 15. The computing system of claim 11 wherein the recovery action includes restoring the recovery journal from a file system on the at least one storage device if the recovery journal is one of corrupted and lost.
Solving problems relating to consistency · CPC title
involving logging of persistent data for recovery · CPC title
where the computing system component is a storage system, e.g. DASD based or network based (digital input from or digital output to record carriers G06F3/06; digital recording or reproducing G11B20/18; for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS], H04L67/1097) · CPC title
Monitoring specific for caches · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.