Cloud computing resource based on type of tasks and resource load rate

US12293223B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12293223-B2
Application numberUS-202117199121-A
CountryUS
Kind codeB2
Filing dateMar 11, 2021
Priority dateSep 11, 2018
Publication dateMay 6, 2025
Grant dateMay 6, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This application provides a resource scheduling method, including: when a first scheduling request message is obtained, determining a first resource server from a resource pool based on a first quantity of resources requested by the first scheduling request message, and scheduling the first quantity of resources in the first resource server, where the resource pool includes at least one resource server, and the first scheduling request message requests a resource for a first-type task; and when a second scheduling request message is obtained, schedule a resource for a task corresponding to the second scheduling request message, determining a second resource server from the resource pool based on a second quantity of resources requested by the second scheduling request message, and scheduling a third quantity of resources in the second resource server, wherein the second scheduling request message requests a resource for a second-type task.

First claim

Opening claim text (preview).

What is claimed is: 1. A cloud computing resource scheduling method, performed by a resource scheduling apparatus comprising a processor and a memory, the method comprising: obtaining a first scheduling request message; determining a first resource server from a resource pool based on a first quantity of resources requested by the first scheduling request message; scheduling the first quantity of resources in the first resource server, wherein the resource pool comprises at least one resource server, and the first scheduling request message is used to request a resource for a first-type task, wherein the first-type task is a service level agreement (SLA) sensitive task; obtaining a second scheduling request message; placing the second scheduling request message in a waiting queue, wherein the waiting queue comprises a scheduling request message of at least one second-type task, wherein the second-type task is a service level agreement (SLA) insensitive task; determining, based on a resource load rate of the resource pool, to schedule a resource for a task corresponding to the second scheduling request message, the resource load rate indicating a resource utilization rate of the resource pool; determining a second resource server from the resource pool based on a second quantity of resources requested by the second scheduling request message; and scheduling a third quantity of resources in the second resource server, wherein the third quantity of resources is less than the second quantity of resources, wherein the second scheduling request message is used to request a resource for a second-type task, and wherein the second-type task is executed by the second resource server with the third quantity of resources, wherein the determining, based on the resource load rate of the resource pool, to schedule the resource for the task corresponding to the second scheduling request message comprises: determining that the resource load rate of the resource pool is less than a first threshold, and the task corresponding to the second scheduling request message is a task that is in the waiting queue and that requests a minimum quantity of resources among tasks corresponding to second scheduling request messages in the waiting queue. 2. The method according to claim 1 , further comprising: determining that the resource load rate of the resource pool is greater than or equal to the first threshold; selecting M second-type tasks from a plurality of tasks executed by the at least one resource server, and releasing resources occupied by the M second-type tasks, wherein M is an integer greater than 0. 3. The method according to claim 1 , further comprising: determining that a quantity of idle resources in the second resource server is less than a second threshold; selecting N second-type tasks from the plurality of tasks executed by the second resource server; and releasing resources occupied by the N second-type tasks, wherein N is an integer greater than 0. 4. The method according to claim 3 , further comprising: placing the N second-type tasks in the waiting queue, wherein the waiting queue comprises the scheduling request message of the at least one second-type task. 5. The method according to claim 1 , wherein the determining the first resource server from the resource pool based on the first quantity of resources requested by the first scheduling request message comprises: selecting, from the at least one resource server comprised in the resource pool, a resource server whose quantity of idle resources is greater than the first quantity of resources as the first resource server. 6. The method according to claim 1 , wherein the determining the second resource server from the resource pool based on the second quantity of resources requested by the second scheduling request message comprises: selecting, from the at least one resource server comprised in the resource pool, a resource server whose quantity of idle resources is greater than the third quantity of resources as the second resource server. 7. The method according to claim 1 , wherein the first-type task is a scheduled with resources based on a quantity of requested resources. 8. The method according to claim 1 , wherein the second-type task is scheduled with resources based on resource usage. 9. A cloud computing resource scheduling apparatus, comprising at least one processor, wherein the at least one processor is coupled to at least one memory; and wherein the at least one processor is configured to execute a computer program or an instruction stored in the at least one memory, so that the apparatus is configured to perform a resource scheduling method comprising: obtaining a first scheduling request message; determining a first resource server from a resource pool based on a first quantity of resources requested by the first scheduling request message; scheduling the first quantity of resources in the first resource server, wherein the resource pool comprises at least one resource server, and the first scheduling request message is used to request a resource for a first-type task, wherein the first-type task is a service level agreement (SLA) sensitive task; obtaining a second scheduling request message; placing the second scheduling request message in a waiting queue, wherein the waiting queue comprises a scheduling request message of at least one second-type task, wherein the second-type task is a service level agreement (SLA) insensitive task; determining, based on a resource load rate of the resource pool, to schedule a resource for a task corresponding to the second scheduling request message, the resource load rate indicating a resource utilization rate of the resource pool; determining a second resource server from the resource pool based on a second quantity of resources requested by the second scheduling request message; and scheduling a third quantity of resources in the second resource server, wherein the third quantity of resources is less than the second quantity of resources, wherein the second scheduling request message is used to request a resource for a second-type task, and wherein the second-type task is executed by the second resource server with the third quantity of resources, wherein the determining, based on the resource load rate of the resource pool, to schedule the resource for the task corresponding to the second scheduling request message comprises: determining that the resource load rate of the resource pool is less than a first threshold, and the task corresponding to the second scheduling request message is a task that is in the waiting queue and that requests a minimum quantity of resources among tasks corresponding to second scheduling request messages in the waiting queue. 10. The apparatus according to claim 9 , wherein the resource scheduling method further comprises: determining that the resource load rate of the resource pool is greater than or equal to the first threshold; selecting M second-type tasks from a plurality of tasks executed by the at least one resource server, and releasing resources occupied by the M second-type tasks, wherein M is an integer greater than 0. 11. The apparatus according to claim 9 , wherein the resource scheduling method further comprises: determining that a quantity of idle resources in the second resource server is less than a second threshold; selecting N second-type tasks from the plurality of tasks executed by the second resource server; and releasing resources occupied by the N second-type tasks, wherein N is an integer greater than 0. 12. The apparatus according to claim 11 , wherein the resource schedulin

Assignees

Inventors

Classifications

  • by program, e.g. task dispatcher, supervisor, operating system · CPC title

  • G06F9/4812Primary

    by interrupt, e.g. masked · CPC title

  • using data related to the state of servers by a load balancer · CPC title

  • based on parameters of servers, e.g. available memory or workload (monitoring of computer activity G06F11/30) · CPC title

  • Pool · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12293223B2 cover?
This application provides a resource scheduling method, including: when a first scheduling request message is obtained, determining a first resource server from a resource pool based on a first quantity of resources requested by the first scheduling request message, and scheduling the first quantity of resources in the first resource server, where the resource pool includes at least one resourc…
Who is the assignee on this patent?
Huawei Cloud Computing Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F9/4812. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 06 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).