Increasing data performance by transferring data between storage tiers using workload characteristics

US11656780B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11656780-B2
Application numberUS-201916242955-A
CountryUS
Kind codeB2
Filing dateJan 8, 2019
Priority dateJan 8, 2019
Publication dateMay 23, 2023
Grant dateMay 23, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method, according to one embodiment, includes: receiving one or more suggestions which correspond to placement of data in storage, where the one or more suggestions are based on data workload characteristics. The one or more suggestions are used to identify portions of actual data stored in actual storage which correspond to the one or more suggestions. For each of the identified portions of the actual data stored in the first tier, the one or more suggestions is further used to determine whether to transfer the given identified portion of the actual data to the second tier. Moreover, in response to determining to transfer at least one of the identified portions of the actual data to the second tier, one or more instructions are sent to transfer the at least one of the identified portions of the actual data from the first tier to the second tier.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving one or more suggestions which correspond to placement of data in storage, wherein the one or more suggestions are based on data workload characteristics; using the one or more suggestions to identify portions of actual data stored in actual storage which correspond to the one or more suggestions, wherein the actual storage includes: a first tier having two or more shared nodes, and a second tier having at least one shared nothing node, wherein each of the two or more shared nodes are coupled to a same one or more data storage components in the actual storage, wherein the first tier is configured to stripe data across the two or more shared nodes, with a proviso that the second tier is not configured to stripe data across two or more of the at least one shared nothing node; for each of the identified portions of the actual data stored in the first tier, using the one or more suggestions to determine whether to transfer the given identified portion of the actual data to the second tier; and in response to determining to transfer at least one of the identified portions of the actual data to the second tier, sending one or more instructions to transfer the at least one of the identified portions of the actual data from the first tier to the second tier. 2. The computer-implemented method of claim 1 , wherein each of the at least one shared nothing node in the second tier includes special purpose hardware, wherein the special purpose hardware is a graphics processing unit. 3. The computer-implemented method of claim 2 , comprising: sending the one or more suggestions to an administrator for approval; in response to the administrator approving the one or more suggestions, receiving an indication that the one or more suggestions have been approved; in response to the administrator not approving the one or more suggestions, receiving one or more alternative suggestions which correspond to placement of data in storage; using the one or more alternative suggestions to identify portions of the actual data stored in the actual storage which correspond to the one or more alternative suggestions; for each of the identified portions of the actual data stored in the first tier, using the one or more alternative suggestions to determine whether to transfer the given identified portion of the actual data to the second tier; and in response to determining to transfer at least one of the identified portions of the actual data to the second tier, sending one or more instructions to transfer the at least one of the identified portions of the actual data from the first tier to the second tier. 4. The computer-implemented method of claim 1 , wherein each of the at least one shared nothing node implement a general parallel file system for shared nothing clusters (GPFS-SNC) mode. 5. The computer-implemented method of claim 1 , wherein the first and second tiers are included in a same namespace. 6. The computer-implemented method of claim 1 , comprising: using the one or more suggestions to identify portions of newly received data which correspond thereto; for each of the identified portions of the newly received data, using the one or more suggestions to determine whether to store the given identified portion of the newly received data in the second tier; in response to determining to store the given identified portion of the newly received data in the second tier, sending one or more instructions to store the given identified portion of the newly received data in the second tier; and in response to determining to not store the given identified portion of the newly received data in the second tier, sending one or more instructions to store the given identified portion of the newly received data in the first tier. 7. The computer-implemented method of claim 1 , wherein the data workload characteristics are generated using corresponding file types and/or corresponding portions of a file. 8. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions readable and/or executable by a processor to cause the processor to: receive, by the processor, one or more suggestions which correspond to placement of data in storage, wherein the one or more suggestions are based on data workload characteristics; use, by the processor, the one or more suggestions to identify portions of actual data stored in actual storage which correspond to the one or more suggestions, wherein the actual storage includes: a first tier having two or more shared nodes, and a second tier having at least one shared nothing node, wherein each of the two or more shared nodes are coupled to a same one or more data storage components in the actual storage, wherein the first tier is configured to stripe data across the two or more shared nodes, with a proviso that the second tier is not configured to stripe data across two or more of the at least one shared nothing node; for each of the identified portions of the actual data stored in the first tier, use, by the processor, the one or more suggestions to determine whether to transfer the given identified portion of the actual data to the second tier; and in response to determining to transfer at least one of the identified portions of the actual data to the second tier, send, by the processor, one or more instructions to transfer the at least one of the identified portions of the actual data from the first tier to the second tier. 9. The computer program product of claim 8 , wherein each of the at least one shared nothing node in the second tier includes special purpose hardware, wherein the special purpose hardware is a solid state drive cache, wherein each of the at least one shared nothing node are coupled to a different data storage component in the second tier of the actual storage by a respective Peripheral Component Interconnect Express bus. 10. The computer program product of claim 8 , the program instructions readable and/or executable by the processor to cause the processor to: send, by the processor, the one or more suggestions to an administrator for approval; in response to the administrator approving the one or more suggestions, receive, by the processor, an indication that the one or more suggestions have been approved, and one or more supplemental suggestions, wherein the one or more supplemental suggestions are used along with the one or more suggestions to identify the portions of actual data stored in the actual storage; in response to the administrator not approving the one or more suggestions, receive, by the processor, one or more alternative suggestions which correspond to placement of data in storage; use, by the processor, the one or more alternative suggestions to identify portions of the actual data stored in the actual storage which correspond to the one or more alternative suggestions; for each of the identified portions of the actual data stored in the first tier, use, by the processor, the one or more alternative suggestions to determine whether to transfer the given identified portion of the actual data to the second tier; and in response to determining to transfer at least one of the identified portions of the actual data to the second tier, send, by the processor, one or more instructions to transfer the at least one of the identified portions of the actual data from the first tier to the second tier. 11. The computer program product of claim 8 , wherein the first and second tiers are included in a same namespace. 12. The computer program product of claim 8 , the program instructions r

Assignees

Inventors

Classifications

  • by facilitating the interaction with a user or administrator · CPC title

  • G06F3/0649Primary

    Lifecycle management · CPC title

  • Improving or facilitating administration, e.g. storage management · CPC title

  • Hybrid storage combining heterogeneous device types, e.g. hierarchical storage, hybrid arrays · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11656780B2 cover?
A computer-implemented method, according to one embodiment, includes: receiving one or more suggestions which correspond to placement of data in storage, where the one or more suggestions are based on data workload characteristics. The one or more suggestions are used to identify portions of actual data stored in actual storage which correspond to the one or more suggestions. For each of the id…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F3/0649. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 23 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).