What technology area does this patent fall under?

Primary CPC classification G06F9/5016. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jul 18 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method, device and computer program product for determining resource amount for dedicated processing resources

US2019220316A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2019220316-A1
Application number	US-201916239803-A
Country	US
Kind code	A1
Filing date	Jan 4, 2019
Priority date	Jan 18, 2018
Publication date	Jul 18, 2019
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure relate to a method, device and computer program product for determining a resource amount of dedicated processing resources. The method comprises obtaining a structural representation of a neural network for deep learning processing, the structural representation indicating a layer attribute of the neural network that is associated with the dedicated processing resources; and determining the resource amount of the dedicated processing resources required for the deep learning processing based on the structural representation. In this manner, the resource amount of the dedicated processing resources required by the deep learning processing may be better estimated to improve the performance and resource utilization rate of the dedicated processing resource scheduling.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of determining a resource amount of dedicated processing resources, comprising: obtaining a structural representation of a neural network for deep learning processing, the structural representation indicating a layer attribute of the neural network that is associated with the dedicated processing resources; and determining the resource amount of the dedicated processing resources required for the deep learning processing based on the structural representation. 2 . The method according to claim 1 , wherein obtaining the structural representation comprises: obtaining a specific structural representation of the neural network, the specific structural representation having a specific form of representation for a deep learning application performing the deep learning processing; and normalizing the specific structural representation as the structural representation. 3 . The method according to claim 2 , wherein normalizing the specific structural representation as the structural representation comprises: determining a layer attribute associated with a neural network layer of the neural network based on the specific structural representation; and normalizing the specific structural representation as the structural representation based on the layer attribute. 4 . The method according to claim 3 , wherein determining the layer attribute comprises determining at least one of: an identifier of the neural network layer; a type of the neural network layer; an upstream neural network layer of the neural network layer; a downstream neural network layer of the neural network layer; and a configurable attribute of the neural network layer. 5 . The method according to claim 1 , wherein obtaining the structural representation comprises: obtaining a file containing the structural representation; and parsing the file to obtain the structural representation. 6 . The method according to claim 1 , wherein obtaining the structural representation comprises: requesting the structural representation from a deep learning application performing the deep learning processing; and in response to receiving a response from the deep learning application, obtaining the structural representation from the response. 7 . The method according to claim 1 , wherein determining the resource amount comprises: determining at least one of a memory resource amount and a computing resource amount of the dedicated processing resources required for the deep learning processing. 8 . The method according to claim 7 , wherein determining the memory resource amount comprises: determining a layer memory resource amount for each neural network layer of the neural network based on the structural representation, the layer memory resource amount indicating a memory of the dedicated processing resources required for a corresponding neural network layer; and determining the memory resource amount based on a sum of the layer memory resource amounts. 9 . The method according to claim 7 , wherein determining the computing resource amount comprises: determining a layer computing resource amount for each neural network layer of the neural network based on the structural representation, the layer computing resource amount indicating a computing capability of the dedicated processing resources required for a corresponding neural network layer; selecting, from the layer computing resource amounts, a target layer computing resource amount requiring a computing capability above a predetermined threshold; and determining the computing resource amount based on the target layer computing resource amount. 10 . A device for determining a resource amount of dedicated processing resources, comprising: at least one processing unit; and at least one memory coupled to the at least one processing unit and storing instructions executed by the at least one processing unit, the instructions, when executed by the at least one processing unit, causing the device to implement acts, comprising: obtaining a structural representation of a neural network for deep learning processing, the structural representation indicating a layer attribute of the neural network that is associated with the dedicated processing resources; and determining the resource amount of the dedicated processing resources required for the deep learning processing based on the structural representation. 11 . The device according to claim 10 , wherein obtaining the structural representation comprises: obtaining a specific structural representation of the neural network, the specific structural representation having a specific form of representation for a deep learning application performing the deep learning processing; and normalizing the specific structural representation as the structural representation. 12 . The device according to claim 11 , wherein normalizing the specific structural representation as the structural representation comprises: determining a layer attribute associated with a neural network layer of the neural network based on the specific structural representation; and normalizing the specific structural representation as the structural representation based on the layer attribute. 13 . The device according to claim 12 , wherein determining the layer attribute comprises determining at least one of: an identifier of the neural network layer; a type of the neural network layer; an upstream neural network layer of the neural network layer; a downstream neural network layer of the neural network layer; and a configurable attribute of the neural network layer. 14 . The device according to claim 10 , wherein obtaining the structural representation comprises: obtaining a file containing the structural representation; and parsing the file to obtain the structural representation. 15 . The device according to claim 10 , wherein obtaining the structural representation comprises: requesting the structural representation from a deep learning application performing the deep learning processing; and in response to receiving a response from the deep learning application, obtaining the structural representation from the response. 16 . The device according to claim 10 , wherein determining the resource amount comprises: determining at least one of a memory resource amount and a computing resource amount of the dedicated processing resources required for the deep learning processing. 17 . The device according to claim 16 , wherein determining the memory resource amount comprises: determining a layer memory resource amount for each neural network layer of the neural network based on the structural representation, the layer memory resource amount indicating a memory of the dedicated processing resources required for a corresponding neural network layer; and determining the memory resource amount based on a sum of the layer memory resource amounts. 18 . The device according to claim 16 , wherein determining the computing resource amount comprises: determining a layer computing resource amount for each neural network layer of the neural network based on the structural representation, the layer computing resource amount indicating a computing capability of the dedicated processing resources required for a corresponding neural network layer; selecting, from the layer computing resource amounts, a target layer computing resource amount requiring a computing capability above a predetermined threshold; and determining the computing resource amo

Assignees

Dell Products Lp

Inventors

Classifications

G06N3/063
using electronic means · CPC title
G06F9/5027
the resource being a machine, e.g. CPUs, Servers, Terminals · CPC title
G06N3/04
Architecture, e.g. interconnection topology · CPC title
G06N3/084
Backpropagation, e.g. using gradient descent · CPC title
G06N3/10
Interfaces, programming languages or software development kits, e.g. for simulating neural networks · CPC title

Patent family

Related publications grouped by family.

View patent family 67213958

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019220316A1 cover?: Embodiments of the present disclosure relate to a method, device and computer program product for determining a resource amount of dedicated processing resources. The method comprises obtaining a structural representation of a neural network for deep learning processing, the structural representation indicating a layer attribute of the neural network that is associated with the dedicated proces…
Who is the assignee on this patent?: Dell Products Lp
What technology area does this patent fall under?: Primary CPC classification G06F9/5016. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jul 18 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).