Reducing computations in a neural network

US10460230B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10460230-B2
Application numberUS-201615014686-A
CountryUS
Kind codeB2
Filing dateFeb 3, 2016
Priority dateJun 4, 2015
Publication dateOct 29, 2019
Grant dateOct 29, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Reducing computations in a neural network may include determining a group including a plurality of convolution kernels of a convolution stage of a neural network. The convolution kernels of the group are similar to one another. A base convolution kernel for the group may be determined. Scaling factors for a plurality of input feature maps processed by the group may be calculated. The convolution stage of the neural network may be modified to calculate a composite input feature map using the scaling factors and apply the base convolution kernel to the composite input feature map.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: determining, using a processor, a group comprising a plurality of convolution kernels of a convolution stage of a neural network in which the convolution kernels of the group are similar to one another, the determining of the group by: determining a similarity metric between a first convolution kernel and a second convolution kernel of the plurality of convolution kernels by calculating an element-wise ratio of the first convolution kernel to the second convolution kernel and calculating a standard deviation for the element-wise ratio of the first convolution kernel and the second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion; determining a base convolution kernel for the group; determining scaling factors based on the base convolution kernel for the group for a plurality of input feature maps processed by the group, each input feature map corresponding to a scaling factor and a convolution kernel; and modifying the convolution stage of the neural network so that, when the neural network is executed, the convolution stage of the neural network performs: scaling each of a plurality of input feature maps using the scaling factors, generating a composite input feature map as a sum of the scaled input feature maps, applying the base convolution kernel to the composite input feature map, and summing the composite input feature map with any of a second plurality of input feature maps which are processed by a convolution kernel other than the base convolution kernel. 2. The method of claim 1 , wherein determining the group of convolution kernels further comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the feature map as a primary convolution kernel; and calculating an element-wise ratio of the primary convolution kernel to other convolution kernels of the plurality of convolution kernels not yet assigned to a group. 3. The method of claim 1 , wherein determining the base convolution kernel for the group comprises: selecting a convolution kernel of the group as the base convolution kernel according to a base selection metric. 4. The method of claim 1 , wherein determining the base convolution kernel for the group comprises: determining the base convolution kernel as a function of the convolution kernels of the group. 5. The method of claim 1 , further comprising: retraining the neural network comprising the modified convolution stage without permitting changes to membership in the group of convolution kernels. 6. The method of claim 1 , further comprising: executing the neural network comprising the modified convolution stage. 7. A system, comprising: a processor programmed to initiate executable operations comprising: determining a group comprising a plurality of convolution kernels of a convolution stage of a neural network in which the convolution kernels of the group are similar to one another, the determining of the group by: determining a similarity metric between a first convolution kernel and a second convolution kernel of the plurality of convolution kernels by calculating an element-wise ratio of the first convolution kernel to the second convolution kernel and calculating a standard deviation for the element-wise ratio of the first convolution kernel and the second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion; determining a base convolution kernel for the group; determining scaling factors based on the base convolution kernel for the group for a plurality of input feature maps processed by the group, each input feature map corresponding to a scaling factor and a convolution kernel; and modifying the convolution stage of the neural network so that, when the neural network is executed, the convolution stage of the neural network performs: scaling each of a plurality of input feature maps using the scaling factors, generating a composite input feature map as a sum of the scaled input feature maps, applying the base convolution kernel to the composite input feature map, and summing the composite input feature map with any of a second plurality of input feature maps which are processed by a convolution kernel other than the base convolution kernel. 8. The system of claim 7 , wherein determining the group of convolution kernels further comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the feature map as a primary convolution kernel; and calculating an element-wise ratio of the primary convolution kernel to other convolution kernels of the plurality of convolution kernels not yet assigned to a group. 9. The system of claim 7 , wherein determining the base convolution kernel for the group comprises: selecting a convolution kernel of the group as the base convolution kernel according to a base selection metric. 10. The system of claim 7 , wherein determining the base convolution kernel for the group comprises: determining the base convolution kernel as a function of the convolution kernels of the group. 11. The system of claim 7 , wherein the processor is further programmed to initiate executable operations comprising: retraining the neural network comprising the modified convolution stage without permitting changes to membership in the group of convolution kernels. 12. A non-transitory computer-readable storage medium having instructions stored thereon which, when executed by a processor, perform a method comprising: determining, using the processor, a group comprising a plurality of convolution kernels of a convolution stage of a neural network in which the convolution kernels of the group are similar to one another, the determining of the group by: determining a similarity metric between a first convolution kernel and a second convolution kernel of the plurality of convolution kernels by calculating an element-wise ratio of the first convolution kernel to the second convolution kernel and calculating a standard deviation for the element-wise ratio of the first convolution kernel and the second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion; determining a base convolution kernel for the group; determining scaling factors based on the base convolution kernel for the group for a plurality of input feature maps processed by the group, each input feature map corresponding to a scaling factor and a convolution kernel; and modifying the convolution stage of the neural network so that, when the neural network is executed, the convolution stage of the neural network performs: scaling each of a plurality of input feature maps using the scaling factors, generating a composite input feature map as a sum of the scaled input feature maps, applying the base convolution kernel to the composite input feature map, and summing the composite input feature map with any of a second plurality of input feature maps which are processed by a convolution kernel other than the base convolution kernel. 13. The non-transitory computer-readable storage medium of claim 12 , wherein determining the group of convolution kernels further comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the featu

Assignees

Inventors

Classifications

  • G06N3/045Primary

    Combinations of networks · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • G06N3/0454Primary

    Physics · mapped topic

  • G06V10/82Primary

    using neural networks · CPC title

  • G06N3/0464Primary

    Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10460230B2 cover?
Reducing computations in a neural network may include determining a group including a plurality of convolution kernels of a convolution stage of a neural network. The convolution kernels of the group are similar to one another. A base convolution kernel for the group may be determined. Scaling factors for a plurality of input feature maps processed by the group may be calculated. The convolutio…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 29 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).