Method, apparatus, and computer program product for processing computing task

US2019324809A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019324809-A1
Application numberUS-201916382327-A
CountryUS
Kind codeA1
Filing dateApr 12, 2019
Priority dateApr 20, 2018
Publication dateOct 24, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Implementations of the present disclosure relate to a method, apparatus and computer program product for processing a computing task. The method comprises: obtaining status information of multiple computing resources; in response to receiving a neural network model-based computing task, determining configuration information of multiple layers associated with the neural network model; obtaining parameter data associated with at least one part of the multiple layers on the basis of the configuration information; and based on the status information and the parameter data, selecting from the multiple computing resources a group of computing resources for processing the computing task. According to the example implementations of the present disclosure, multiple computing resources may be utilized sufficiently, and it may be guaranteed that a load balance may be stricken between the multiple computing resources.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for processing a computing task, comprising: obtaining status information of multiple computing resources; in response to receiving a neural network model-based computing task, determining configuration information of multiple layers associated with the neural network model; obtaining parameter data associated with at least one part of the multiple layers on the basis of the configuration information; and based on the status information and the parameter data, selecting from the multiple computing resources a group of computing resources for processing the computing task. 2 . The method of claim 1 , wherein selecting a group of computing resources on the basis of the status information comprises: regarding a first layer among the at least one part of the multiple layers, determining a first size of parameters associated with the first layer on the basis of the parameter data; based on the status information, selecting a first computing resource matching the first size from the multiple computing resources to process the parameters associated with the first layer. 3 . The method of claim 2 , wherein selecting a group of computing resources on the basis of the status information comprises: determining corresponding sizes of parameters associated with the at least one part of the multiple layers on the basis of the parameter data; ranking the at least one part of the multiple layers on the basis of the corresponding sizes; and selecting the first layer on the basis of the rankings. 4 . The method of claim 2 , further comprising: determining a first resource allocation required for processing the parameters associated with the first layer; and updating the status information on the basis of the first resource allocation. 5 . The method of claim 4 , further comprising: regarding a second layer among the at least one part of the multiple layers, determining a second size of parameters associated with the second layer on the basis of the parameter data; and based on the updated status information, selecting a second computing resource matching the second size from the multiple computing resources to process the parameters associated with the second layer. 6 . The method of claim 5 , further comprising: determining a number of computing resources required for processing the at least one part of the multiple layers; and in response to determining that the selected group of computing resources reach the number, selecting a computing resource from the selected group of computing resources to process parameters associated with a third layer among the at least one part of the multiple layers. 7 . The method of claim 1 , wherein determining status information of the multiple computing resources comprises: monitoring resource information of the multiple computing resources; and determining status information of the multiple computing resources on the basis of the resource information. 8 . The method of claim 7 , wherein the resource information of the multiple computing resources comprises at least any one of the following indicators: processing capacity information, memory resource information and bandwidth resource information of the multiple computing resources. 9 . The method of claim 8 , wherein determining the status information of the multiple computing resources on the basis of the resource information further comprises: regarding a given computing resource among the multiple computing resources, determining an importance of a corresponding indicator among the indicators for the given computing resource on the basis of the computing task; and determining status information of the given computing resource on the basis of the corresponding indicator and the importance of the corresponding indicator. 10 . The method of claim 1 , wherein the multiple computing resources are multiple graphics processing units. 11 . An apparatus for processing a computing task, comprising: at least one processor; a volatile memory; and a memory coupled to the at least one processor and having instructions stored thereon, the instructions, when executed by the at least one processor, causing the apparatus to perform acts comprising: obtaining status information of multiple computing resources; in response to receiving a neural network model-based computing task, determining configuration information of multiple layers associated with the neural network model; obtaining parameter data associated with at least one part of the multiple layers on the basis of the configuration information; and based on the status information and the parameter data, selecting from the multiple computing resources a group of computing resources for processing the computing task. 12 . The apparatus of claim 11 , wherein selecting a group of computing resources on the basis of the status information comprises: regarding a first layer among the at least one part of the multiple layers, determining a first size of parameters associated with the first layer on the basis of the parameter data; based on the status information, selecting a first computing resource matching the first size from the multiple computing resources to process the parameters associated with the first layer. 13 . The apparatus of claim 12 , wherein selecting a group of computing resources on the basis of the status information comprises: determining corresponding sizes of parameters associated with the at least one part of the multiple layers on the basis of the parameter data; ranking the at least one part of the multiple layers on the basis of the corresponding sizes; and selecting the first layer on the basis of the rankings. 14 . The apparatus of claim 12 , wherein the acts further comprise: determining a first resource allocation required for processing the parameters associated with the first layer; and updating the status information on the basis of the first resource allocation. 15 . The apparatus of claim 14 , wherein the acts further comprise: regarding a second layer among the at least one part of the multiple layers, determining a second size of parameters associated with the second layer on the basis of the parameter data; and based on the updated status information, selecting a second computing resource matching the second size from the multiple computing resources to process the parameters associated with the second layer. 16 . The apparatus of claim 15 , wherein the acts further comprise: determining a number of computing resources required for processing the at least one part of the multiple layers; and in response to determining that the selected group of computing resources reach the number, selecting a computing resource from the selected group of computing resources to process parameters associated with a third layer among the at least one part of the multiple layers. 17 . The apparatus of claim 11 , wherein determining status information of the multiple computing resources comprises: monitoring resource information of the multiple computing resources; and determining status information of the multiple computing resources on the basis of the resource information. 18 . The apparatus of claim 17 , wherein the resource information of the multiple computing resources comprises at least any one of the following indicators: processing capacity information, memory resource information and bandwidth resource information of the multiple computing resources; and wherein determining the status in

Assignees

Inventors

Classifications

  • Architecture, e.g. interconnection topology · CPC title

  • Learning methods · CPC title

  • the resource being a machine, e.g. CPUs, Servers, Terminals · CPC title

  • G06F9/5005Primary

    to service a request · CPC title

  • Neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019324809A1 cover?
Implementations of the present disclosure relate to a method, apparatus and computer program product for processing a computing task. The method comprises: obtaining status information of multiple computing resources; in response to receiving a neural network model-based computing task, determining configuration information of multiple layers associated with the neural network model; obtaining …
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F9/5005. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 24 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).