Storage system configuration based on workload characteristics and performance metrics

US11354061B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11354061-B2
Application numberUS-202016746238-A
CountryUS
Kind codeB2
Filing dateJan 17, 2020
Priority dateJan 17, 2020
Publication dateJun 7, 2022
Grant dateJun 7, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

One or more aspects of the present disclosure relate to providing storage system configuration recommendations. System configurations of one or more storage devices can be determined based on their respective collected telemetry information. Performance of storage devices having different system configurations can be predicted based on one or more of: the collected telemetry information and each of the different system configurations. In response to receiving one or more requested performance characteristics and workload conditions, one or more recommended storage device configurations can be provided for each request based on the predicted performance characteristics, the requested performance characteristics, and the workload conditions.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising at least one processor configured to: determine system configurations of each of one or more storage devices based on their respective collected telemetry information; predict performance characteristics of each of a plurality of storage devices having different system configurations based on one or more of: the collected telemetry information and each of the different system configurations, wherein the apparatus is further configured to: identify input/output (I/O) data types related to an I/O workload received by a storage array, generate response time (RT) data pairs, mapping each uniquely available system configuration and one or more of the I/O workload and I/O data types to respective response times, and in response to receiving one or more requested performance characteristics and current workload conditions, provide one or more recommended storage device configurations for each request based on the predicted performance characteristics, the requested performance characteristics, and the workload conditions. 2. The apparatus of claim 1 , wherein the telemetry information is collected from at least one or more of: one or more field-deployed storage devices or one or more lab operated storage device. 3. The apparatus of claim 1 , wherein collecting the telemetry information includes controlling a telemetry collection device in each storage device to collect and transmit the telemetry information during one or more predetermined time-windows. 4. The apparatus of claim 3 , wherein: the telemetry information includes one or more of: each storage device's system configuration, each storage device's input/output (I/O) workloads, and each storage device's performance characteristics associated with the workload conditions; and wherein each storage device's performance characteristics are defined at least by their respective response times. 5. The apparatus of claim 4 further configured to: segment the telemetry information based on one or more of: industry, workload types, performance characteristics, input/output (I/O) operations, and service level (SL) performance tiers of each storage device; wherein each storage device's response times are segmented into each storage device's SL performance tiers; and wherein workload conditions are characterized at least by amounts of read and write (R/W) I/O operations of each storage device's workload. 6. The apparatus of claim 4 further configured to: for one or more sets of storage devices, predict performance characteristics by at least using one or more prediction models built for each storage device, wherein each prediction model is built by processing each storage device's telemetry information using one or more machine learning techniques; and wherein: the one or more machine learning techniques includes at least a non-linear regression learning technique, amongst other machine learning techniques, and the non-linear regression technique includes at least one or more of a random forest and a neural network, amongst other machine learning techniques. 7. The apparatus of claim 6 further configured to: predict the performance characteristics based on at least one or more of each storage device's system configurations and workloads present in the collected telemetry information, amongst other data; process the telemetry information using the one or more machine learning techniques augmented with each storage device's uncollected telemetry information and validated telemetry information. 8. The apparatus of claim 7 , wherein each storage device's uncollected and validated telemetry information includes at least one or more of each storage device's validated system configurations and validated workloads not present in each storage device's collected telemetry information. 9. The apparatus of claim 1 further configured to: for one or more sets of storage device, build one or more system configuration prediction models to determine probabilities of each recommended storage device configuration meeting each storage device's SL performance tiers under one or more different workload conditions. 10. The apparatus of claim 9 , wherein: each different workload condition is based on one or more anticipated workloads; and each of the one or more anticipated workloads is predicted using one or more machine learning techniques to process one or more of: a customer type associated with each requested performance characteristic, the collected telemetry information, and uncollected telemetry information. 11. A method comprising: determining system configurations of each of one or more storage devices based on their respective collected telemetry information; predicting performance characteristics of each of a plurality of storage devices having different system configurations based on one or more of: the collected telemetry information and each of the different system configurations, wherein predicting performance characteristics further includes: identifying input/output (I/O) data types related to an I/O workload received by a storage array, generating response time (RT) data pairs, mapping each uniquely available system configuration and one or more of the I/O workload and I/O data types to respective response times, and in response to receiving one or more requested performance characteristics and current workload conditions, providing one or more recommended storage device configurations for each request based on the predicted performance characteristics, the requested performance characteristics, and the workload conditions. 12. The method of claim 11 , wherein the telemetry information is collected from at least one or more of: one or more field-deployed storage devices or one or more lab operated storage devices. 13. The method of claim 11 , wherein collecting the telemetry information includes controlling a telemetry collection device in each storage device to collect and transmit the telemetry information during one or more predetermined time-windows. 14. The method of claim 13 , wherein: the telemetry information includes one or more of: each storage device's system configuration, each storage device's input/output (I/O) workloads, and each storage device's performance characteristics associated with the workload conditions; and wherein each storage device's performance characteristics are defined at least by their respective response times. 15. The method of claim 14 further comprising: segmenting the telemetry information based on one or more of: industry, workload types, performance characteristics, input/output (I/O) operations, and service level (SL) performance tiers of each storage device; wherein each storage device's response times are segmented into each storage device's SL performance tiers; and wherein workload conditions are characterized at least by amounts of read and write (R/W) I/O operations of each storage device's workload. 16. The method of claim 14 , wherein: predicting each storage device's performance characteristics includes using one or more prediction models built for each storage device, wherein each prediction model is built by processing each storage device's telemetry information using one or more machine learning techniques; the one or more machine learning techniques includes at least a non-linear regression learning technique, amongst other machine learning techniques; and the non-linear regression technique includes at least one or more of a random forest and a neural network, amongst other machine learning techniques.

Assignees

Inventors

Classifications

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Combinations of networks · CPC title

  • Feedforward networks · CPC title

  • Supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11354061B2 cover?
One or more aspects of the present disclosure relate to providing storage system configuration recommendations. System configurations of one or more storage devices can be determined based on their respective collected telemetry information. Performance of storage devices having different system configurations can be predicted based on one or more of: the collected telemetry information and eac…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F3/0653. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 07 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).