Method, device and computer program product for processing machine learning model

US2020334544A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020334544-A1
Application numberUS-201916542757-A
CountryUS
Kind codeA1
Filing dateAug 16, 2019
Priority dateApr 19, 2019
Publication dateOct 22, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method comprises obtaining an intermediate representation of a machine learning model written in a source language, the intermediate representation being independent of the source language and a target language and comprising a computation graph described by a structured text, a node in the computation graph representing a function associated with the machine learning model. The method comprises sending the intermediate representation to a scheduler to obtain indication information related to a plurality of dedicated processing resources for executing the machine learning model. The method further comprises generating a plurality of runtime libraries corresponding to the plurality of dedicated processing resources to process data related to the machine learning model based on the intermediate representation and the indication information, a runtime library comprising functions represented in the target language. General applicability of the compiler is increased, and assignment of the machine learning model on different dedicated processing resources is facilitated.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of processing a machine learning model, comprising: obtaining an intermediate representation of a machine learning model written in a source language, the intermediate representation being independent of the source language and a target language and comprising a computation graph described by a structured text, a node in the computation graph representing a function associated with the machine learning model; sending the intermediate representation to a scheduler to obtain indication information related to a plurality of dedicated processing resources for executing the machine learning model; and generating a plurality of runtime libraries corresponding to the plurality of dedicated processing resources to process data related to the machine learning model based on the intermediate representation and the indication information, a runtime library comprising functions represented in the target language. 2 . The method according to claim 1 , wherein the indication information comprises information related to types of the plurality of dedicated processing resources, and wherein generating the plurality of runtime libraries corresponding to the plurality of dedicated processing resources comprises: determining the runtime library corresponding to the type of the dedicated processing resource based on the intermediate representation and the type of the dedicated processing resource. 3 . The method according to claim 1 , wherein the computation graph further comprises dependencies between the functions. 4 . A computer program product being tangibly stored on a non-transient computer readable medium and comprising machine executable instructions which, when executed, causing a machine to perform steps of the method according to claim 1 . 5 . An electronic device for processing a machine learning model, comprising: a processor; and a memory storing computer program instructions, the processor running the computer program instructions in the memory to control the electronic device to perform acts, comprising: obtaining an intermediate representation of a machine learning model written in a source language, the intermediate representation being independent of the source language and a target language and comprising a computation graph described by a structured text, a node in the computation graph representing a function associated with the machine learning model; sending the intermediate representation to a scheduler to obtain indication information related to a plurality of dedicated processing resources for executing the machine learning model; and generating a plurality of runtime libraries corresponding to the plurality of dedicated processing resources to process data related to the machine learning model based on the intermediate representation and the indication information, a runtime library comprising functions represented in the target language. 6 . The electronic device according to claim 5 , wherein the indication information comprises information related to types of the plurality of dedicated processing resources, and wherein generating plurality of runtime libraries corresponding to the plurality of dedicated processing resources comprises: determining the runtime library corresponding to the type of the dedicated processing resource based on the intermediate representation and the type of the dedicated processing resource. 7 . The electronic device according to claim 5 , wherein the computation graph further comprises dependencies between the functions. 8 . A method of executing a machine learning model, comprising: receiving, at a first device, data to be processed by the machine learning model; sending the received data to a first dedicated processing resource of the first device, so that the first dedicated processing resource processes the data by executing a first group of functions among a plurality of functions related to the machine learning model, the first group of functions being comprised in a first runtime library accessible to the first device; and sending the data which have been processed by the first dedicated processing resource to a second device for processing. 9 . The method according to claim 8 , wherein sending the received data to the first dedicated processing resource of the first device comprises: determining whether first indication information indicating completing the receiving of the data is received; and in response to determining that the first indication information is received, sending the received data to a first dedicated processing resource of the first device. 10 . The method according to claim 8 , wherein sending the received data to the first dedicated processing resource of the first device comprises: sending the received data to the first dedicated processing resource; and sending, to the first dedicated processing resource, second indication information related to the first group of functions, so that the first dedicated processing resource processes the data by executing the first group of functions. 11 . The method according to claim 8 , wherein receiving the data comprises: receiving the data from a third device, the data being determined by a second dedicated processing resource of the third device for executing a second group of functions among the plurality of functions, the second group of functions being comprised in a second runtime library accessible to the third device. 12 . The method according to claim 8 , wherein receiving the data comprises: allocating a storage resource for storing the data; and storing the received data in the storage resource. 13 . The method according to claim 8 , wherein sending the data which have been processed by the first dedicated processing resource to the second device for processing comprises: obtaining the processed data from the first dedicated processing resource; storing the processed data in the storage resource; sending the processed data to a second device; and in response to completing the sending of the processed data, sending, to the second device, second indication information indicating the completion. 14 . A computer program product being tangibly stored on a non-transient computer readable medium and comprising machine executable instructions which, when executed, causing a machine to perform steps of the method according to claim 8 . 15 . An electronic device for executing a machine learning model, comprising: a processor; and a memory storing computer program instructions, the processor running the computer program instructions in the memory to control the electronic device to perform steps according to claim 8 . 16 . The electronic device according to claim 15 , wherein sending the received data to the first dedicated processing resource of the first device comprises: determining whether first indication information indicating completing the receiving of the data is received; and in response to determining that the first indication information is received, sending the received data to a first dedicated processing resource of the first device. 17 . The electronic device according to claim 15 , wherein sending the received data to the first dedicated processing resource of the first device comprises: sending the received data to the first dedicated processing resource; and sending, to the first dedicated processing resource, second indication information related to the first group of functions, so that the first dedicated processing resource processes the data by

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • G06F8/443Primary

    Optimisation · CPC title

  • Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues · CPC title

  • Architecture, e.g. interconnection topology · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020334544A1 cover?
A method comprises obtaining an intermediate representation of a machine learning model written in a source language, the intermediate representation being independent of the source language and a target language and comprising a computation graph described by a structured text, a node in the computation graph representing a function associated with the machine learning model. The method compri…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 22 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).