Deep neural network perforance analysis on shared memory accelerator systems

US2019080232A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019080232-A1
Application numberUS-201715699320-A
CountryUS
Kind codeA1
Filing dateSep 8, 2017
Priority dateSep 8, 2017
Publication dateMar 14, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A Deep Neural Networks (DNN) analysis method, system, and computer program product include characterizing a space of possible configurations for a DNN, evaluating a metric-of-interest for a configuration of the possible configurations, and searching the space to identify a configuration of the possible configurations that maximizes the metric-of-interest.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented Deep Neural Networks analysis method, the method comprising: characterizing a space of possible configurations for a deep neural network (DNN); evaluating a metric-of-interest for a configuration of the possible configurations; and searching the space to identify a configuration of the possible configurations that maximizes the metric-of-interest. 2 . The computer-implemented method of claim 1 , wherein the characterizing characterizes the space based on an input description of the DNN. 3 . The computer-implemented method of claim 1 , wherein the DNN comprises a sequence of convolutional and fully connected layers whose parameters are represented using a tuple <in,out,ij,mb,kij>. 4 . The computer-implemented method of claim 1 , wherein, for each combination of spatial work division across cores of the computer, the configuration with a best record is used, and passed on to a next stage. 5 . The computer-implemented method of claim 1 , wherein the possible configurations comprise at least one of a computation configuration and a data-partitioning configuration. 6 . The computer-implemented method of claim 1 , wherein the evaluating evaluates the metric-of-interest based on a predetermined hardware specification. 7 . The computer-implemented method of claim 6 , wherein the predetermined hardware specification comprises: a plurality of processor cores; a plurality of memory elements; and a plurality of data-links. 8 . The computer-implemented method of claim 7 , further comprising periodically sending control information and receiving status updates to and from the cores and a memory enabling the system to realize all computations in the DNN. 9 . The computer-implemented method of claim 1 , embodied in a cloud-computing environment. 10 . A computer program product for Deep Neural Networks analysis, the computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform: characterizing a space of possible configurations for a deep neural network (DNN); evaluating a metric-of-interest for a configuration of the possible configurations; and searching the space to identify a configuration of the possible configurations that maximizes the metric-of-interest. 11 . The computer program product of claim 10 , wherein the characterizing characterizes the space based on an input description of the DNN. 12 . The computer program product of claim 10 , wherein the DNN comprises a sequence of convolutional and fully connected layers whose parameters are represented using a tuple <in,out,ij,mb,kij>. 13 . The computer program product of claim 10 , wherein, for each combination of spatial work division across cores, the configuration with a best record is used, and passed on to a next stage. 14 . The computer program product of claim 10 , wherein the possible configurations comprise at least one of a computation configuration and a data-partitioning configuration. 15 . The computer program product of claim 10 , wherein the evaluating evaluates the metric-of-interest based on a predetermined hardware specification. 16 . The computer program product of claim 15 , wherein the predetermined hardware specification comprises: a plurality of processor cores; a plurality of memory elements; and a plurality of data-links. 17 . The computer program product of claim 16 , wherein the computer program product further stores instructions to cause the computer to perform: periodically sending control information and receiving status updates to and from the cores and a memory enabling the system to realize all computations in the DNN. 18 . A Deep Neural Networks analysis system, said system comprising: a processor; and a memory, the memory storing instructions to cause the processor to perform: characterizing a space of possible configurations for a deep neural network (DNN); evaluating a metric-of-interest for a configuration of the possible configurations; and searching the space to identify a configuration of the possible configurations that maximizes the metric-of-interest. 19 . The system of claim 18 , wherein the characterizing characterizes the space based on an input description of the DNN. 20 . The system of claim 18 , embodied in a cloud-computing environment.

Assignees

Inventors

Classifications

  • Activation functions · CPC title

  • Combinations of networks · CPC title

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

  • Shells for specifying net layout · CPC title

  • using electronic means · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019080232A1 cover?
A Deep Neural Networks (DNN) analysis method, system, and computer program product include characterizing a space of possible configurations for a DNN, evaluating a metric-of-interest for a configuration of the possible configurations, and searching the space to identify a configuration of the possible configurations that maximizes the metric-of-interest.
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 14 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).