Neural network model for predicting usage in a hyper-converged infrastructure

US12106203B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12106203-B2
Application numberUS-201816033460-A
CountryUS
Kind codeB2
Filing dateJul 12, 2018
Priority dateJul 12, 2018
Publication dateOct 1, 2024
Grant dateOct 1, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for analyzing the usage of a set of workloads in a hyper-converged infrastructure are disclosed. A neural network model is trained based upon historical usage data of the set of workloads. The neural network model can make usage predictions of future demands on the set of workloads to minimize over-allocation or under-allocation of resources to the workloads.

First claim

Opening claim text (preview).

Therefore, the following is claimed: 1. A method, comprising: obtaining, by at least one computing device, a training dataset for a neural network executed by the at least one computing device, the training dataset comprising historical data associated with resource consumption of a set of workloads within a hyper-converged infrastructure, wherein the neural network comprises a long short term memory recurrent neural network operating on a time window input, the neural network further utilizing gates comprising a sigmoid neural net layer and a pairwise multiplication operation; training the neural network based upon the training dataset, the training dataset comprising historical usage of the set of workloads, using a regression analysis applying a least-square error function; monitoring, by the at least one computing device, usage demand for a particular time period, the usage demand associated with the set of workloads within the hyper-converged infrastructure executed on a plurality of host devices; providing the usage demand to the neural network; generating a usage prediction based upon the usage demand, the usage prediction comprising a predicted resource consumption for the set of workloads over a time period; adjusting an allocation of physical resources within the hyper-converged infrastructure for the set of workloads based upon the usage prediction; and validating the allocation of the physical resources by performing an ongoing validation of the usage prediction relative to an actual usage demand being placed upon the set of workloads by calculating a similarity measure utilizing a dynamic time warping signal processing algorithm, wherein the neural network can be retrained in response to the similarity measure indicating a difference between the actual usage demand and the usage prediction being greater than a threshold. 2. The method of claim 1 , wherein the resource consumption comprises memory, central processing unit (CPU), and storage usage of the set of workloads. 3. The method of claim 1 , wherein the neural network is trained utilizing a regression model. 4. The method of claim 1 , further comprising: obtaining, using the at least one computing device, the actual usage demand for the time period. 5. The method of claim 1 , further comprising: retraining the neural network based upon the similarity between the actual usage demand and the usage prediction. 6. The method of claim 1 , wherein adjusting the allocation of physical resources within the hyper-converged infrastructure comprises increasing or decreasing CPU, memory, or storage resources based upon the usage prediction. 7. The method of claim 1 , wherein the set of workloads comprises one or more workloads, each of the one or more workloads comprising a virtual machine (VM) or an application running on a VM managed by a hypervisor using physical resources of one or more hosts. 8. A system comprising: at least one computing device; an application executed by the at least one computing device, the application causing the at least one computing device to at least: obtain a training dataset for a neural network executed by the at least one computing device, the training dataset comprising historical data associated with resource consumption of a set of workloads within a hyper-converged infrastructure, wherein the neural network comprises a long short term memory recurrent neural network operating on a time window input, the neural network further utilizing gates comprising a sigmoid neural net layer and a pairwise multiplication operation; train the neural network based upon the training dataset, the training dataset comprising historical usage of the set of workloads, using a regression analysis applying a least-square error function; monitor usage demand for a particular time period, the usage demand associated with the set of workloads within the hyper-converged infrastructure executed on a plurality of host devices; provide the usage demand to the neural network; generate a usage prediction based upon the usage demand, the usage prediction comprising a predicted resource consumption for the set of workloads over a time period; adjust an allocation of physical resources within the hyper-converged infrastructure for the set of workloads based upon the usage prediction; and validate the allocation of the physical resources by performing an ongoing validation of the usage prediction relative to an actual usage demand being placed upon the set of workloads by calculating a similarity measure utilizing a dynamic time warping signal processing algorithm, wherein the neural network can be retrained in response to the similarity measure indicating a difference between the actual usage demand and the usage prediction being greater than a threshold. 9. The system of claim 8 , wherein the resource consumption comprises memory, central processing unit (CPU), and storage usage of the set of workloads. 10. The system of claim 8 , wherein the neural network is trained utilizing a regression model. 11. The system of claim 8 , wherein the application causes the at least one computing device to at least: obtain the actual usage demand for the time period. 12. The system of claim 8 , wherein the application causes the at least one computing device to at least: retrain the neural network based upon the similarity between the actual usage demand and the usage prediction. 13. The system of claim 8 , wherein the allocation of physical resources within the hyper-converged infrastructure is adjusted by increasing or decreasing CPU, memory, or storage resources based upon the usage prediction. 14. The system of claim 8 , wherein the set of workloads comprises one or more workloads, each of the one or more workloads comprising a virtual machine (VM) or an application running on a VM managed by a hypervisor using physical resources of one or more hosts. 15. A non-transitory computer-readable medium embodying a program executed by at least one computing device, the program causing the at least one computing device to at least: obtain a training dataset for a neural network executed by the at least one computing device, the training dataset comprising historical data associated with resource consumption of a set of workloads within a hyper-converged infrastructure, wherein the neural network comprises a long short term memory recurrent neural network operating on a time window input, the neural network further utilizing gates comprising a sigmoid neural net layer and a pairwise multiplication operation; train the neural network based upon the training dataset, the training dataset comprising historical usage of the set of workloads, using a regression analysis applying a least-square error function; monitor usage demand for a particular time period, the usage demand associated with the set of workloads within the hyper-converged infrastructure executed on a plurality of host devices; provide the usage demand to the neural network; generate a usage prediction based upon the usage demand, the usage prediction comprising a predicted resource consumption for the set of workloads over a time period; adjust an allocation of physical resources within the hyper-converged infrastructure for the set of workloads based upon the usage prediction; and validate the allocation of the physical resources by performing an ongoing validation of the usage prediction relative to an actual usage demand being placed upon the set of workloads by calculating a similarity measure utilizing a dynamic time warping signal processing algorithm, wherein the neural network

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Machine learning · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • considering the load · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12106203B2 cover?
Systems and methods for analyzing the usage of a set of workloads in a hyper-converged infrastructure are disclosed. A neural network model is trained based upon historical usage data of the set of workloads. The neural network model can make usage predictions of future demands on the set of workloads to minimize over-allocation or under-allocation of resources to the workloads.
Who is the assignee on this patent?
VMware LLC
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).