Resource allocation in a cloud computing system based on predictions of workload probability parameters

US12106147B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12106147-B2
Application numberUS-202117352634-A
CountryUS
Kind codeB2
Filing dateJun 21, 2021
Priority dateJun 21, 2021
Publication dateOct 1, 2024
Grant dateOct 1, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are system, method, and computer program product embodiments for allocating resources based on predictions of workload probability parameters. The method can include collecting a first set of historical workload data generated by operating a first set of one or more applications at a first number of past time instances; predicting probability parameters of a second set of future workload data for operating a second set of one or more applications at a second number of future time instances; and determining future resources allocated to operating the second set of one or more applications for the second number of future time instances, based on allocated current resources, a lower bound of resources to satisfy a quality of service (QoS) for operating the second set of one or more applications, an upper bound of resources to satisfy the QoS, and the predicted probability parameters.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for operating applications on a computing system, comprising: collecting, by a monitoring operator of a workload manager operated by the computing system, a first set of historical workload data generated by operating a first set of one or more applications at a first number of past time instances, wherein the first set of historical workload data are collected for a set of computing nodes within the computing system, and wherein the monitoring operator monitors historical resources used by the set of computing nodes to operate the first set of one or more applications at the first number of past time instances; predicting, by the workload manager, probability parameters including a probability density function of a second set of future workload data for operating a second set of one or more applications at a second number of future time instances by the set of computing nodes; and determining, by the workload manager, future resources allocated to operating the second set of one or more applications for the second number of future time instances, based on allocated current resources, a lower bound number of computing nodes to satisfy a quality of service (QOS) for operating the second set of one or more applications, an upper bound number of computing nodes to satisfy the QoS, and the predicted probability parameters, wherein the future resources are determined as a solution for reducing a first probability for allocating a third number of computing nodes over the upper bound number of computing nodes and reducing a second probability for allocating a fourth number of computing nodes below the lower bound number of computing nodes based on an equation related to the first probability and the second probability; scheduling the determined future resources of the computing system for the second number of future time instances; and operating the second set of one or more applications on the scheduled future resources to generate workload data. 2. The method of claim 1 , wherein the first set of one or more applications include same applications as the second set of one or more applications. 3. The method of claim 1 , wherein the second set of one or more applications is different from the first set of one or more applications, and a first application of the first set of one or more applications is an old version of a second application of the second set of one or more applications. 4. The method of claim 1 , wherein the predicting the probability parameters includes predicting the probability parameters of the second set of future workload data based on recurrent neural networks (RNN). 5. The method of claim 1 , wherein the probability parameters include an average of the second set of future workload data, or a standard deviation of the second set of future workload data. 6. The method of claim 1 , wherein the first set of historical workload data and the second set of future workload data include a CPU usage, a memory utilization, a network bandwidth, a latency, a delay, or a throughput. 7. The method of claim 1 , wherein the first number of past time instances includes 5 past time instances, and the second number of future time instances includes 5 future time instances. 8. The method of claim 1 , wherein the first number of past time instances or the second number of future time instances includes a first time instance and a second time instance separated from the first time instance by 1 hour. 9. The method of claim 1 , wherein the first set of historical workload data and the second set of future workload data follow a Gaussian distribution or a Poisson distribution. 10. The method of claim 1 , wherein the future resources and the current resources include a number of computing nodes of the computing system, a number of storage units of the computing system, or an allocation of network bandwidth. 11. The method of claim 1 , wherein the determining future resources includes determining an offset resource to increase or decrease the current resources. 12. A computing system, comprising: a storage device configured to store a first set of historical workload data generated by operating one or more applications at a first number of past time instances, wherein the first set of historical workload data are collected by a monitoring operator for a set of computing nodes within the computing system, and wherein the monitoring operator monitors historical resources used by the set of computing nodes to operate the one or more applications at the first number of past time instances; at least one processor coupled to the storage device; a workload forecasting operator operated by the at least one processor, configured to predict probability parameters including a probability density function of a second set of future workload data for operating the one or more applications at a second number of future time instances by the set of computing nodes; and a resource allocation scaler operated by the at least one processor, configured to determine future resources allocated to operating the one or more applications for the second number of future time instances, based on allocated current resources, a lower bound number of computing nodes to satisfy a quality of service (QOS) for operating the one or more applications, an upper bound number of computing nodes to satisfy the QoS, and the predicted probability parameters, wherein the future resources are determined as a solution for reducing a first probability for allocating a third number of computing nodes over the upper bound number of computing nodes and reducing a second probability for allocating a fourth number of computing nodes below the lower bound number of computing nodes based on an equation related to the first probability and the second probability; and a scheduler operated by the at least one processor, configured to schedule the determined future resources of the computing system for the second number of future time instances; and a processing operator operated by the at least one processor, configured to operate the second set of one or more applications on the scheduled future resources to generate workload data. 13. The system of claim 12 , wherein the workload forecasting operator is configured to predict the probability parameters of the second set of future workload data based on recurrent neural networks (RNN). 14. The system of claim 12 , wherein the probability parameters include an average of the second set of future workload data, or a standard deviation of the second set of future workload data. 15. The system of claim 12 , wherein the first set of historical workload data and the second set of future workload data include a CPU usage, a memory utilization, a network bandwidth, a latency, a delay, or a throughput. 16. The system of claim 12 wherein the first set of historical workload data and the second set of future workload data follow a Gaussian distribution or a Poisson distribution. 17. A non-transitory computer-readable medium having instructions stored thereon that, when executed by at least one computing device, cause the at least one computing device to perform operations comprising: collecting a first set of historical workload data generated by operating one or more applications at a first number of past time instances, wherein the first set of historical workload data are collected by a monitoring operator for a set of computing nodes within a computing system, and wherein the monitoring operator monitors historical resources used by the set of computi

Assignees

Inventors

Classifications

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Supervised learning · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Learning methods · CPC title

  • Probabilistic or stochastic networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12106147B2 cover?
Disclosed herein are system, method, and computer program product embodiments for allocating resources based on predictions of workload probability parameters. The method can include collecting a first set of historical workload data generated by operating a first set of one or more applications at a first number of past time instances; predicting probability parameters of a second set of futur…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F9/5005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).