Method, device and computer program product for managing dedicated processing resources

US11663050B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11663050-B2
Application numberUS-201916384209-A
CountryUS
Kind codeB2
Filing dateApr 15, 2019
Priority dateApr 20, 2018
Publication dateMay 30, 2023
Grant dateMay 30, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A resource management method comprises: in response to receiving, from an application operating on a client, a resource allocation request indicating an amount of dedicated processing resources required by the application, acquiring a mapping between a group of physical dedicated processing resources provided by a group of servers and a group of logical dedicated processing resources, the group of physical dedicated processing resources being divided into the group of logical dedicated processing resources; determining allocation statuses of the group of logical dedicated processing resources; determining, based at least on the mapping and the allocation statuses, a first amount of logical dedicated processing resources to be allocated to the application from the group of logical dedicated processing resources; and indicating the first amount of logical dedicated processing resources to the application, to allow the application to utilize physical dedicated processing resources provided by at least one of the group of servers.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: logically partitioning, by a controller, a group of physical dedicated processing resources, which reside on a group of servers, into a group of logical dedicated processing resources, wherein at least one physical dedicated processing resource is logically partitioned into multiple logical dedicated processing resources to enable concurrent use of the at least one physical dedicated processing resource by a plurality of applications, and wherein at least one physical dedicated processing resource of the group of physical dedicated processing resources comprises a graphics processing unit (GPU) device comprising GPU compute and storage resources that are logically partitioned into a plurality of logical GPU compute and storage resources; receiving, by the controller, a resource allocation request from a client driver operating on a client device, the resource allocation request indicating an amount of dedicated processing resources for allocating to an application executing on the client device; in response to the resource allocation request, the controller performing a process comprising: acquiring a mapping between the group of physical dedicated processing resources and the group of logical dedicated processing resources, wherein the mapping comprises at least one entry which maps at least one logical dedicated processing resource to at least one physical dedicated processing resource and which identifies a server of the group of servers on which the at least one physical dedicated processing resource resides; determining allocation statuses of the group of logical dedicated processing resources; determining, based at least on the mapping and the allocation statuses, a first amount of logical dedicated processing resources to be allocated to the application from the group of logical dedicated processing resources, the first amount of logical dedicated processing resources meeting the requested amount of dedicated processing resources; indicating the first amount of logical dedicated processing resources to the client driver, to allow the client driver to communicate with at least one server of the group of servers to establish at least one data path between the client device and the at least one server of the group of servers and enable the application to utilize physical dedicated processing resources provided by the at least one server of the group of servers, the physical dedicated processing resources corresponding to the first amount of logical dedicated processing resources; and in response to receiving, by the controller, notification of completion of execution of the application and release of the first amount of logical dedicated processing resources, the controller updating the allocation statuses of the group of logical dedicated processing resources to indicate availability of the first amount of logical dedicated processing resources; wherein the controller comprises at least one processing device comprising a processor coupled to a memory. 2. The method according to claim 1 , wherein the allocation statuses indicate a second amount of logical dedicated processing resources available for allocation in the group of logical dedicated processing resources, and determining the first amount of logical dedicated processing resources comprises: determining, from the second amount of logical dedicated processing resources, the first amount of logical dedicated processing resources meeting the amount of dedicated processing resources. 3. The method according to claim 1 , wherein determining the first amount of logical dedicated processing resources comprises: determining the first amount of logical dedicated processing resources, such that the first amount of logical dedicated processing resources include a plurality of logical dedicated processing resources mapped to a same physical dedicated processing resource in the group of physical dedicated processing resources. 4. The method of claim 1 , wherein determining the first amount of logical dedicated processing resources comprises: determining the first amount of logical dedicated processing resources, such that the first amount of logical dedicated processing resources include a plurality of logical dedicated processing resources provided by a same server in the group of servers. 5. The method of claim 1 , wherein determining the first amount of logical dedicated processing resources comprises: determining the first amount of logical dedicated processing resources, such that the first amount of logical dedicated processing resources is uniformly mapped to the group of physical dedicated processing resources. 6. The method of claim 1 , wherein determining the first amount of logical dedicated processing resources comprises: determining an estimate about execution efficiencies of the application over different physical dedicated processing resources; and determining, based on the estimate, the first amount of logical dedicated processing resources to be allocated to the application. 7. The method of claim 1 , wherein indicating the first amount of logical dedicated processing resources to the application comprises: determining whether the first amount of logical dedicated processing resources include a plurality of logical dedicated processing resources mapped to a same physical dedicated processing resource; in response to determining that the first amount of logical dedicated processing resources include a plurality of logical dedicated processing resources mapped to a same physical dedicated processing resource, aggregating the plurality of logical dedicated processing resources into a single logical dedicated processing resource; indicating the first amount of logical dedicated processing resources by indicating the single logical dedicated processing resource to the application. 8. The method of claim 1 , further comprising: in response to the first amount of logical dedicated processing resources being indicated to the application, updating respective allocation statuses of the first amount of logical dedicated processing resources. 9. The method of claim 1 , further comprising the controller generating the mapping and storing the mapping in a storage device that is accessible by the controller. 10. A device, comprising: at least one processing unit; at least one memory coupled to the at least one processing unit and storing instructions executed by the at least one processing unit, the instructions, when executed by the at least one processing unit, causing the device to perform actions for managing resources, the actions comprising: logically partitioning a group of physical dedicated processing resources, which reside on a group of servers, into a group of logical dedicated processing resources, wherein at least one physical dedicated processing resource is logically partitioned into multiple logical dedicated processing resources to enable concurrent use of the at least one physical dedicated processing resource by a plurality of applications, and wherein at least one physical dedicated processing resource of the group of physical dedicated processing resources comprises a graphics processing unit (GPU) device comprising GPU compute and storage resources that are logically partitioned into a plurality of logical GPU compute and storage resources; receiving a resource allocation request from a client driver operating on a client device, the resource allocation request indicating an amount of dedicated processing resources for allocating to an application executing on the client device; in response to the resource allocation request, performing a process which comprises: acquiring a m

Assignees

Inventors

Classifications

  • Server selection for load balancing · CPC title

  • G06F9/5077Primary

    Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title

  • G06T1/20Primary

    Processor architectures; Processor configuration, e.g. pipelining · CPC title

  • Memory management · CPC title

  • considering hardware capabilities · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11663050B2 cover?
A resource management method comprises: in response to receiving, from an application operating on a client, a resource allocation request indicating an amount of dedicated processing resources required by the application, acquiring a mapping between a group of physical dedicated processing resources provided by a group of servers and a group of logical dedicated processing resources, the group…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F9/5077. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 30 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).