Predictive model-based intelligent system for automatically scaling and managing provisioned computing resources

US2019243691A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019243691-A1
Application numberUS-201815887889-A
CountryUS
Kind codeA1
Filing dateFeb 2, 2018
Priority dateFeb 2, 2018
Publication dateAug 8, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system for automatically scaling provisioned resources includes an input interface and a processor. The input interface is configured to receive an estimate of a required number of processing threads. The processor is configured to determine required resources for processing the required number of processing threads using a model; provision the required resources; indicate to execute client tasks using the provisioned resources; determine server telemetry or logging data for the provisioned resources; provide the server telemetry or the logging data to the model; determine a resource utilization score based at least in part on the server telemetry or the logging data; determine a provisioning performance reward based at least in part on the resource utilization score; and adjust model parameters using the provisioning performance reward.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for automatically scaling provisioned resources, comprising: an input interface configured to: receive an estimate of a required number of processing threads; and a processor configured to: determine required resources for processing the required number of processing threads using a model; provision the required resources; indicate to execute client tasks using the provisioned resources; determine server telemetry or logging data for the provisioned resources; provide the server telemetry or the logging data to the model; determine a resource utilization score based at least in part on the server telemetry or the logging data; determine a provisioning performance reward based at least in part on the is resource utilization score; and adjust model parameters using the provisioning performance reward. 2 . The system of claim 1 , wherein indicating to execute client tasks causes the client tasks to execute. 3 . The system of claim 1 , wherein the required number of processing threads comprises a required number of processing threads for a server type of a plurality of server types. 4 . The system of claim 1 , wherein the client tasks comprise client tasks for a client of a plurality of clients. 5 . The system of claim 1 , wherein the resource utilization score is based at least in part on a nonlinear function of the performance metric data. 6 . The system of claim 1 , wherein the resource utilization score is based at least in part on asymmetric function of the server telemetry or the logging data. 7 . The system of claim 1 , wherein the provisioner performance reward comprises: a first value in response to the resource utilization score indicating the provisioned resources are underutilized; a second value in response to the resource utilization score indicating the provisioned resources are correctly utilized; and a third value in response to the resource utilization score indicating the provisioned resources are overutilized. 8 . The system of claim 1 , wherein a next determination of required resources is based at least in part on the server telemetry or the logging data provided to the model. 9 . The system of claim 1 , wherein the model comprises a neural network. 10 . The system of claim 9 , wherein the neural network comprises one or more of: a neuron layer, a plurality of neuron layers, a dense neuron layer, a dropout neuron layer, a feedforward layer, a recurrent layer, a long short term memory layer, a rectified linear unit activation function, or a tan h activation function. 11 . The system of claim 1 , wherein adjusting the model using the resource utilization score comprises using reinforcement learning. 12 . The system of claim 1 , wherein the estimate of the required number of processing threads is based at least in part on historical usage data. 13 . The system of claim 1 , wherein the estimate of the required number of processing threads is determined using a randomized packing algorithm. 14 . The system of claim 1 , wherein a determination of required resources is made at a regular interval. 15 . The system of claim 14 , wherein the regular interval comprises every minute, every 5 minutes, every 10 minutes, or every hour. 16 . The system of claim 1 , wherein determining the server telemetry or the logging data comprises determining a set of performance metrics. 17 . The system of claim 16 , wherein determining the server telemetry or the logging data comprises processing the set of performance metrics. 18 . The system of claim 17 , wherein processing the set of performance metrics comprises one or more of: scaling one or more performance metrics, computing a linear function of one or more performance metrics, aggregating performance metrics, computing a nonlinear function of one or more performance metrics, thresholding one or more performance metrics, processing the set of performance metrics using a model, processing the set of performance metrics using a machine learning model, or processing the set of metrics using a neural network. 19 . A method for automatically scaling provisioned resources, comprising: receiving an estimate of a required number of processing threads; determining, using a processor, required resources for processing the required number of processing threads using a model; provisioning the required resources; indicating to execute client tasks using the provisioned resources; determining server telemetry or logging data for the provisioned resources; providing the server telemetry or the logging data to the model; determining a resource utilization score based at least in part on the server telemetry or the logging data; determining a provisioning performance reward based at least in part on the resource is utilization score; and adjusting model parameters using the provisioning performance reward. 20 . A computer program product for automatically scaling provisioned resources, the computer program product being embodied in a non-transitory computer readable storage medium and comprising computer instructions for: receiving an estimate of a required number of processing threads; determining required resources for processing the required number of processing threads using a model; provisioning the required resources; indicating to execute client tasks using the provisioned resources; determining server telemetry or logging data for the provisioned resources; providing the server telemetry or the logging data to the model; determining a resource utilization score based at least in part on the server telemetry or the logging data; determining a provisioning performance reward based at least in part on the resource utilization score; and adjusting model parameters using the provisioning performance reward.

Assignees

Inventors

Classifications

  • G06F9/5077Primary

    Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title

  • Learning methods · CPC title

  • considering the load · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019243691A1 cover?
A system for automatically scaling provisioned resources includes an input interface and a processor. The input interface is configured to receive an estimate of a required number of processing threads. The processor is configured to determine required resources for processing the required number of processing threads using a model; provision the required resources; indicate to execute client t…
Who is the assignee on this patent?
Workday Inc
What technology area does this patent fall under?
Primary CPC classification G06F9/5077. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 08 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).