Method and system for multi-tenant resource distribution

US10609129B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10609129-B2
Application numberUS-201615142371-A
CountryUS
Kind codeB2
Filing dateApr 29, 2016
Priority dateFeb 19, 2016
Publication dateMar 31, 2020
Grant dateMar 31, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In a distributed computing network, requests for allocation of resources to tenant workloads and messages identifying resource availability are received and aggregated. Resources are allocated to the workloads in accordance with a distribution policy defining values for resource entitlements of the tenants. The values include pre-emption quantities. In response to determining that a quantity of resources allocated for workloads of a first tenant is less than the tenant's pre-emption quantity, processing of another workload from a second tenant is interrupted to re-allocate resources from the second tenant's workload to the first tenant's workload.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of resource allocation in a distributed computing system comprising a plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants, the method comprising: receiving, at a distributed resource manager, a plurality of requests from said plurality of tenants for allocation of quantities of resources to workloads of said tenants; allocating, at the distributed resource manager, said quantities of resources to said workloads of said plurality of tenants in accordance with a distribution policy, the distribution policy defining a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; and in response to determining, at the distributed resource manager, that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant: selecting, by the distributed resource manager, a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant, and interrupting processing of a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant. 2. The method of claim 1 , wherein said resource entitlement for the each of the plurality of tenants comprises a reserve quantity of resources for the each of the plurality of tenants, and wherein allocating quantities of resources comprises allocating said reserve quantity of resources to workloads of each of said plurality of tenants independently of a quantity of resources requested by said workloads of each of said plurality of tenants. 3. The method of claim 1 , wherein said resource entitlement for the each of the plurality of tenants includes a maximum quantity of resources for the each of the plurality of tenants, and wherein said allocating comprises allocating quantities of resources to workloads of each tenant its maximum quantity of resources, wherein said maximum quantity of resources is less than a quantity of idle resources available for allocation to workloads of said plurality of tenant. 4. The method of claim 1 , wherein said distribution policy comprises resource allocation weightings for users belonging to ones of said tenants. 5. The method of claim 4 , wherein said allocating quantities of resources comprises allocating said quantities of resources to workloads for said users according to said resource allocation weightings. 6. The method of claim 1 , wherein said guaranteed quantity of resources for each tenant comprises a proportional share of available resources. 7. The method of claim 1 , wherein said guaranteed quantity of resources for each tenant comprises an absolute value defining a number of resource units. 8. The method of claim 1 , wherein said guaranteed quantity of resources for each tenant comprises a proportional value of available resources and an absolute value defining a number of resource units. 9. The method of claim 1 , wherein said distributed computing system comprises resources in a plurality of resource pools, and wherein said allocating comprises allocating resources of a first resource pool to a first set of said plurality of tenants and allocating resources of a second resource pool to a second set of said plurality of tenants. 10. The method of claim 9 , wherein at least one tenant is part of both said first set of said plurality of tenants and said second set of said plurality of tenants. 11. The method of claim 9 , further comprising defining a data structure defining a distribution policy for each tenant of each one of said first and second resource pools. 12. A master server of a distributed computing system comprising a plurality of resource servers, the master server comprising: a resource collection module for receiving messages identifying available resources associated with said resource servers; a demand collection module for receiving messages identifying resource requests from a plurality of tenants of said distributed computing system, the plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants; a data structure comprising a distribution policy for said available resources, said distribution policy containing a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; a distributed resource manager for: determining that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant; selecting a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant; and interrupting processing a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant. 13. The master server of claim 12 , wherein said resource entitlement for the each of the plurality of tenants comprises a reserve quantity of resources for the each of the plurality of tenants, and wherein allocating said quantities of resources comprises allocating said reserve quantity of resources to workloads of each of said plurality of tenants independently of a quantity of resources requested by said workloads of each of said tenants. 14. The master server of claim 12 , wherein said resource entitlement for the each of the plurality of tenants includes a maximum quantity of resources for the each of the plurality of tenants, wherein said quantities of resources to workloads of each tenant are allocated its maximum quantity of resources, and wherein said maximum quantity of resources is less than a quantity of idle resources available for allocation to workloads of the each of the plurality of tenants. 15. The master server of claim 12 , wherein said distribution policy comprises resource allocation weightings for users belonging to ones of said tenants. 16. The master server of claim 12 , wherein said guaranteed quantity of resources for each tenant comprises a proportional share of available resources. 17. The master server of claim 12 , wherein said guaranteed quantity of resources for each tenant comprises an absolute value defining a number of resource units. 18. The master serv

Assignees

Inventors

Classifications

  • considering the load · CPC title

  • Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title

  • based on parameters of servers, e.g. available memory or workload (monitoring of computer activity G06F11/30) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10609129B2 cover?
In a distributed computing network, requests for allocation of resources to tenant workloads and messages identifying resource availability are received and aggregated. Resources are allocated to the workloads in accordance with a distribution policy defining values for resource entitlements of the tenants. The values include pre-emption quantities. In response to determining that a quantity of…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04L67/1008. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 31 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).