Managing artificial intelligence model partitions for edge computing environment

US2022012607A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022012607-A1
Application numberUS-202016925864-A
CountryUS
Kind codeA1
Filing dateJul 10, 2020
Priority dateJul 10, 2020
Publication dateJan 13, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are disclosed for managing artificial intelligence model partitions for execution in an information processing system with edge computing resources. For example, a method comprises the following steps. An intermediate representation of an artificial intelligence model is obtained. A computation graph is generated based on the intermediate representation. The computation graph is partitioned into a set of partitions. The method then schedules the set of partitions for respective execution on a set of computing devices in an edge computing environment, and causes deployment of the set of partitions respectively to the set of computing devices for execution in the edge computing environment.

First claim

Opening claim text (preview).

What is claimed is: 1 . An apparatus, comprising: a processor; and a memory storing computer program instructions, the processor executing the computer program instructions in the memory to control the apparatus to: obtain an intermediate representation of an artificial intelligence model; obtain a computation graph generated based on the intermediate representation; partition the computation graph into a set of partitions; schedule the set of partitions for respective execution on a set of computing devices in an edge computing environment; and cause deployment of the set of partitions respectively to the set of computing devices for execution in the edge computing environment. 2 . The apparatus of claim 1 , wherein causing deployment of the set of partitions to the set of computing devices further comprises a model scheduler dispatching a set of partition executers for execution to the set of computing devices in the edge computing environment. 3 . The apparatus of claim 2 , wherein the model scheduler is configured as a remote procedure call server and the set of partition executers are configured as remote procedure call clients. 4 . The apparatus of claim 1 , wherein scheduling the set of partitions further comprises obtaining resource requirements for execution of the set of partitions in order to generate a scheduling plan. 5 . The apparatus of claim 1 , wherein the processor further executes the computer program instructions in the memory to control the apparatus to compile each of the set of partitions to generate a runtime instance specific to the type of computing device on which each partition is to be executed. 6 . The apparatus of claim 1 , wherein the obtaining, partitioning, scheduling and causing deployment operations are performed in accordance with a model parallelism manager layer in coordination with an application manager layer interposed between the model parallelism manager layer and the edge computing environment. 7 . The apparatus of claim 1 , wherein the processor further executes the computer program instructions in the memory to control the apparatus to re-schedule and cause re-deployment of one or more of the set of partitions when one or more of the computing devices in the edge computing environment change geographic locations. 8 . A method, comprising: obtaining an intermediate representation of an artificial intelligence model; obtaining a computation graph generated based on the intermediate representation; partitioning the computation graph into a set of partitions; scheduling the set of partitions for respective execution on a set of computing devices in an edge computing environment; and causing deployment of the set of partitions respectively to the set of computing devices for execution in the edge computing environment; wherein the steps are performed by one or more processing devices. 9 . The method of claim 8 , wherein causing deployment of the set of partitions to the set of computing devices further comprises dispatching, via a model scheduler, a set of partition executers for execution to the set of computing devices in the edge computing environment. 10 . The method of claim 9 , wherein the model scheduler is configured as a remote procedure call server and the set of partition executers are configured as remote procedure call clients. 11 . The method of claim 8 , wherein scheduling the set of partitions further comprises obtaining resource requirements for execution of the set of partitions in order to generate a scheduling plan. 12 . The method of claim 8 , further comprising compiling each of the set of partitions to generate a runtime instance specific to the type of computing device on which each partition is to be executed. 13 . The method of claim 8 , wherein the obtaining, partitioning, scheduling and causing deployment steps are performed in accordance with a model parallelism manager layer in coordination with an application manager layer interposed between the model parallelism manager layer and the edge computing environment. 14 . The method of claim 8 , further comprising re-scheduling and causing re-deployment of one or more of the set of partitions when one or more of the computing devices in the edge computing environment change geographic locations. 15 . A computer program product stored on a non-transitory computer-readable medium and comprising machine executable instructions, the machine executable instructions, when executed, causing a processing device to: obtain an intermediate representation of an artificial intelligence model; obtain a computation graph generated based on the intermediate representation; partition the computation graph into a set of partitions; schedule the set of partitions for respective execution on a set of computing devices in an edge computing environment; and cause deployment of the set of partitions respectively to the set of computing devices for execution in the edge computing environment. 16 . The computer program product of claim 15 , wherein causing deployment of the set of partitions to the set of computing devices further comprises a model scheduler dispatching a set of partition executers for execution to the set of computing devices in the edge computing environment, and wherein the model scheduler is configured as a remote procedure call server and the set of partition executers are configured as remote procedure call clients. 17 . The computer program product of claim 15 , wherein scheduling the set of partitions further comprises obtaining resource requirements for execution of the set of partitions in order to generate a scheduling plan. 18 . The computer program product of claim 15 , wherein the machine executable instructions, when executed, further cause a processing device to compile each of the set of partitions to generate a runtime instance specific to the type of computing device on which each partition is to be executed. 19 . The computer program product of claim 15 , wherein the obtaining, partitioning, scheduling and causing deployment steps are performed in accordance with a model parallelism manager layer in coordination with an application manager layer interposed between the model parallelism manager layer and the edge computing environment. 20 . The computer program product of claim 15 , wherein the machine executable instructions, when executed, further cause a processing device to re-schedule and cause re-deployment of one or more of the set of partitions when one or more of the computing devices in the edge computing environment change geographic locations.

Assignees

Inventors

Classifications

  • considering hardware capabilities · CPC title

  • considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration (scheduling strategies G06F9/4881 and subgroups) · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • Machine learning · CPC title

  • Neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022012607A1 cover?
Techniques are disclosed for managing artificial intelligence model partitions for execution in an information processing system with edge computing resources. For example, a method comprises the following steps. An intermediate representation of an artificial intelligence model is obtained. A computation graph is generated based on the intermediate representation. The computation graph is part…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06N5/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).