System and method for learning the structure of deep convolutional neural networks

US11010658B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11010658-B2
Application numberUS-201715853403-A
CountryUS
Kind codeB2
Filing dateDec 22, 2017
Priority dateDec 22, 2017
Publication dateMay 18, 2021
Grant dateMay 18, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A recursive method and apparatus produce a deep convolution neural network (CNN). The method iteratively processes an input directed acyclic graph (DAG) representing an initial CNN, a set of nodes, a set of exogenous nodes, and a resolution based on the CNN. An iteration for a node may include recursively performing the iteration upon each node in a descendant node set to create a descendant DAG, and upon each node in ancestor node sets to create ancestor DAGs, the ancestor node sets being a remainder of nodes in the temporary DAG after removing nodes of the descendent node set. The descendant and ancestor DAGs are merged, and a latent layer is created that includes a latent node for each ancestor node set. Each latent node is set to be a parent of sets of parentless nodes in a combined descendant DAG and ancestors DAGs before returning.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for producing a deep convolution neural network (CNN), the apparatus comprising: memory and processing circuitry coupled to the memory, the processing circuitry to: iteratively process an input directed acyclic graph (DAG), a set of nodes, a set of exogenous nodes, and a resolution based on the CNN, an iteration for a node in the set of nodes of the CNN including the processing circuitry to further: increase a resolution of the DAG by creating a temporary DAG used for a current recursive iteration based on the DAG and the resolution; recursively perform the iteration upon each node in a descendant node set to create a descendant DAG, the descendent node set being autonomous nodes having a lowest topological order; recursively perform the iteration upon each node in ancestor node sets to create ancestor DAGs, the ancestor node sets being the remainder of nodes in the temporary DAG after nodes of the descendent node set are removed, wherein the descendant node set is a common child of the ancestor node sets; merge the descendant DAG and the ancestor DAGs into a processed DAG; create a latent layer in the processed DAG that includes a latent node for each ancestor node set for the current resolution; set each latent node to be a parent of sets of parentless nodes in a combined descendant DAG and ancestors DAGs; and return the processed DAG. 2. The apparatus of claim 1 , wherein the increasing of the resolution of the DAG comprises having the processing circuitry to further: for each pair of connected nodes in the temporary DAG: disconnect a pair of the connected nodes when the connected nodes are independent of one another; and direct node edges of the disconnected nodes to a common neighbor node when the common neighbor node is not independent of the disconnected nodes. 3. The apparatus of claim 2 , wherein the determination of when the pair of the connected nodes are independent is done by a statistical test and relates to a statistical dependency of node activations. 4. The apparatus of claim 3 , wherein the statistical test is a conditional independence (CI) test (CIT). 5. The apparatus of claim 4 , wherein the CIT is a partial correlation test or a conditional mutual information test. 6. The apparatus of claim 5 , wherein the CIT is a binary test yielding a binary value of zero or one of a form: CI ⁡ ( W X ⁡ ( i ) , W Y ⁡ ( i ) | W Z ⁡ ( i ) ) = CI ⁡ ( 𝒟 X ⁡ ( i ) , 𝒟 Y ⁡ ( i ) | 𝒟 Z ⁡ ( i ) ) = meas ⁡ ( 𝒟 X ⁡ ( i ) , 𝒟 Y ⁡ ( i ) | 𝒟 Z ⁡ ( i ) ) > γ where: W X (i), W Y (i), and W Z (i) are activations selected by a window from feature maps X, Y, and Z at location i; X (i) and Y (i)) are input variables; Z (i) is a condition set of variables; meas( ) evaluates a level of correlation; and γ is a

Assignees

Inventors

Classifications

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • G06N3/045Primary

    Combinations of networks · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • G06N3/0464Primary

    Convolutional networks [CNN, ConvNet] · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11010658B2 cover?
A recursive method and apparatus produce a deep convolution neural network (CNN). The method iteratively processes an input directed acyclic graph (DAG) representing an initial CNN, a set of nodes, a set of exogenous nodes, and a resolution based on the CNN. An iteration for a node may include recursively performing the iteration upon each node in a descendant node set to create a descendant DA…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 18 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).