Virtual mode execution manager
US-12118376-B2 · Oct 15, 2024 · US
US2023367656A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2023367656-A1 |
| Application number | US-202318195620-A |
| Country | US |
| Kind code | A1 |
| Filing date | May 10, 2023 |
| Priority date | May 10, 2022 |
| Publication date | Nov 16, 2023 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Provided are a provisioning method for cloud service and a system thereof. The provision method according to some embodiments may include collecting a resource utilization rate of each of a plurality of cloud nodes, receiving a regular instance request from a client, determining whether there is a cloud node among the plurality of cloud nodes that satisfies a first condition, in which the collected resource utilization rate is less than a first threshold value, designating a specific cloud node from among the plurality of cloud nodes based on a determination that there is no cloud node that satisfies the first condition, and terminating at least some of spot instances pre-provisioned on the specific cloud node and provisioning a requested regular instance.
Opening claim text (preview).
What is claimed is: 1 . A provisioning method performed by at least one computing device for a cloud service, the provisioning method comprising: collecting a resource utilization rate of each of a plurality of cloud nodes, wherein the resource utilization rate includes a resource utilization rate for at least one of a processor or a memory; receiving a regular instance request from at least one client; determining whether there is a cloud node among the plurality of cloud nodes that satisfies a first condition, in which the collected resource utilization rate is less than a first threshold value; designating a specific cloud node from among the plurality of cloud nodes based on a determination that there is no cloud node that satisfies the first condition; and terminating at least some of spot instances pre-provisioned on the specific cloud node and provisioning a requested regular instance. 2 . The provisioning method of claim 1 , wherein the collecting the resource utilization rate comprises: collecting a resource utilization rate of the specific cloud node through a node agent operating on the specific cloud node; and collecting a resource utilization rate of a specific cloud instance through an instance agent operating on the specific cloud instance of the specific cloud node. 3 . The provisioning method of claim 1 , wherein the provisioning the requested regular instance comprises: designating a termination target from a list of the pre-provisioned spot instances; predicting a resource utilization rate of the specific cloud node according to termination of the designated spot instance; and terminating the designated spot instance based on a determination that the predicted resource utilization rate is less than the first threshold value. 4 . The provisioning method of claim 3 , wherein the termination target is designated based on a resource utilization rate of a spot instance. 5 . The provisioning method of claim 3 , wherein the termination target is designated based on at least one of a bid price or a used period of a spot instance. 6 . The provisioning method of claim 3 , wherein the provisioning the requested regular instance further comprises: further designating another termination target from the list of the pre-provisioned spot instances based on a determination that the predicted resource utilization rate is equal to or greater than the first threshold value. 7 . The provisioning method of claim 1 , further comprising: receiving a spot instance request from a client; determining whether there is a cloud node among the plurality of cloud nodes that satisfies a second condition, in which the collected resource utilization rate is less than a second threshold value; and performing provisioning for a requested spot instance based on a result of the determining whether there is the cloud node that satisfies the second condition, wherein the second threshold value is set to a value greater than the first threshold value. 8 . The provisioning method of claim 1 , further comprising: receiving, from a client, a plurality of spot instance requests within a current time window; and performing, at a last point of the current time window, provisioning for requested plurality of spot instances. 9 . The provisioning method of claim 8 , wherein the performing the provisioning for the requested plurality of spot instances comprises: excluding, from a provisioning target, an instance that exceeds an allowable waiting time of a corresponding client among the requested plurality of spot instances. 10 . The provisioning method of claim 8 , wherein the performing the provisioning for the requested plurality of spot instances comprises: determining at least one spot instance among the requested plurality of spot instances based on a bid price; and provisioning the at least one determined spot instance. 11 . The provisioning method of claim 8 , wherein the performing the provisioning for the requested plurality of spot instances comprises: determining at least one cloud node among the plurality of cloud nodes that satisfies a second condition, in which the collected resource utilization rate is less than a second threshold value; calculating a total number of spot instances that can be provisioned in the determined at least one cloud node based on a resource utilization rate of the determined at least one cloud node; and provisioning the calculated total number of spot instances among the requested plurality of spot instances. 12 . The provisioning method of claim 11 , wherein the calculating the total number of spot instances that can be provisioned comprises: calculating a resource availability rate of a first cloud node, among the determined at least one cloud node, based on a resource utilization rate of the first cloud node and the second threshold value; calculating a representative resource utilization rate for a spot instance based on a resource utilization rate of a spot instance operating in the first cloud node; and calculating a number of spot instances that can be provisioned in the first cloud node based on the resource availability rate and the representative resource utilization rate. 13 . A provisioning method performed by at least one computing device for a cloud service, the provisioning method comprising: collecting a resource utilization rate of each of a plurality of cloud nodes, wherein the resource utilization rate includes a resource utilization rate for at least one of a processor or a memory; receiving a regular instance request from at least one client; determining whether there is a cloud node among the plurality of cloud nodes that satisfies a first condition, in which the collected resource utilization rate is less than a first threshold value; determining a source cloud node from among the plurality of cloud nodes based on a determination that there is no cloud node that satisfies the first condition; migrating at least some of spot instances of the source cloud node to a target cloud node; and provisioning a requested regular instance at the source cloud node. 14 . The provisioning method of claim 13 , wherein the determining the source cloud node comprises: determining whether a first cloud node among the plurality of cloud nodes satisfies the first condition according to spot instance migration or termination; and determining the first cloud node as the source cloud node based on a determination that the first cloud node satisfies the first condition. 15 . The provisioning method of claim 14 , wherein the determining the source cloud node further comprises: determining, based on the first cloud node not satisfying the first condition, whether a second cloud node among the plurality of cloud nodes satisfies the first condition according to the spot instance migration or the termination. 16 . The provisioning method of claim 13 , wherein the migrating the at least some of the spot instances of the source cloud node comprises: determining the target cloud node from among the plurality of cloud nodes based on a number of spot instances that can be provisioned in the target cloud node; and migrating the at least some of spot instances of the source cloud node to the determined target cloud node. 17 . The provisioning method of claim 16 , wherein the migrating to the determined target cloud node comprises: migrating spot instances of the source cloud node as many as the number of spot instances that can be provisioned in the determined target cloud node; wherein s
involving task migration · CPC title
Mechanisms to release resources · CPC title
Monitor · CPC title
Workload threshold · CPC title
to service a request · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.