Reducing computations in a neural network

US2016358068A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016358068-A1
Application numberUS-201615014686-A
CountryUS
Kind codeA1
Filing dateFeb 3, 2016
Priority dateJun 4, 2015
Publication dateDec 8, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Reducing computations in a neural network may include determining a group including a plurality of convolution kernels of a convolution stage of a neural network. The convolution kernels of the group are similar to one another. A base convolution kernel for the group may be determined. Scaling factors for a plurality of input feature maps processed by the group may be calculated. The convolution stage of the neural network may be modified to calculate a composite input feature map using the scaling factors and apply the base convolution kernel to the composite input feature map.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method, comprising: determining, using a processor, a group comprising a plurality of convolution kernels of a convolution stage of a neural network, wherein the convolution kernels of the group are similar to one another; determining a base convolution kernel for the group; determining scaling factors for a plurality of input feature maps processed by the group; and modifying the convolution stage to calculate a composite input feature map using the scaling factors and apply the base convolution kernel to the composite input feature map. 2 . The method of claim 1 , wherein determining the group of convolution kernels comprises: determining a similarity metric between a first convolution kernel and a second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion. 3 . The method of claim 2 , wherein determining a similarity metric comprises: calculating a ratio of the first convolution kernel to the second convolution kernel; and calculating a standard deviation for the ratio of the first convolution kernel and the second convolution kernel. 4 . The method of claim 1 , wherein determining the group of convolution kernels comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the feature map as a primary convolution kernel; and calculating a ratio of the primary convolution kernel to other convolution kernels of the plurality of convolution kernels not yet assigned to a group. 5 . The method of claim 1 , wherein determining the base convolution kernel for the group comprises: selecting a convolution kernel of the group as the base convolution kernel according to a base selection metric. 6 . The method of claim 1 , wherein determining the base convolution kernel for the group comprises: determining the base convolution kernel as a function of the convolution kernels of the group. 7 . The method of claim 1 , further comprising: retraining the neural network comprising the modified convolution stage without permitting changes to membership in the group of convolution kernels. 8 . The method of claim 1 , further comprising: executing the neural network comprising the modified convolution stage, wherein executing the neural network comprises: scaling each of the plurality of input feature maps using the scaling factors, generating the composite input feature map as a sum of the scaled input feature maps, and applying the base convolution kernel to the composite input feature map. 9 . A system, comprising: a processor programmed to initiate executable operations comprising: determining a group comprising a plurality of convolution kernels of a convolution stage of a neural network, wherein the convolution kernels of the group are similar to one another; determining a base convolution kernel for the group; determining scaling factors for a plurality of input feature maps processed by the group; and modifying the convolution stage to calculate a composite input feature map using the scaling factors and apply the base convolution kernel to the composite input feature map. 10 . The system of claim 9 , wherein determining the group of convolution kernels comprises: determining a similarity metric between a first convolution kernel and a second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion. 11 . The system of claim 10 , wherein determining a similarity metric comprises: calculating a ratio of the first convolution kernel to the second convolution kernel; and calculating a standard deviation for the ratio of the first convolution kernel and the second convolution kernel. 12 . The system of claim 9 , wherein determining the group of convolution kernels comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the feature map as a primary convolution kernel; and calculating a ratio of the primary convolution kernel to other convolution kernels of the plurality of convolution kernels not yet assigned to a group. 13 . The system of claim 9 , wherein determining the base convolution kernel for the group comprises: selecting a convolution kernel of the group as the base convolution kernel according to a base selection metric. 14 . The system of claim 9 , wherein determining the base convolution kernel for the group comprises: determining the base convolution kernel as a function of the convolution kernels of the group. 15 . The system of claim 9 , wherein the processor is further programmed to initiate executable operations comprising: retraining the neural network comprising the modified convolution stage without permitting changes to membership in the group of convolution kernels. 16 . A non-transitory computer-readable storage medium having instructions stored thereon which, when executed by a processor, perform a method comprising: determining, using the processor, a group comprising a plurality of convolution kernels of a convolution stage of a trained neural network, wherein the convolution kernels of the group are similar to one another; determining a base convolution kernel for the group; determining scaling factors for a plurality of input feature maps processed by the group; and modifying the convolution stage to calculate a composite input feature map using the scaling factors and apply the base convolution kernel to the composite input feature map. 17 . The non-transitory computer-readable storage medium of claim 16 , wherein determining the group of convolution kernels comprises: determining a similarity metric between a first convolution kernel and a second convolution kernel; and including the first convolution kernel and the second convolution kernel in the group responsive to determining that the similarity metric meets a similarity criterion. 18 . The non-transitory computer-readable storage medium of claim 17 , wherein determining a similarity metric comprises: calculating a ratio of the first convolution kernel to the second convolution kernel; and calculating a standard deviation for the ratio of the first convolution kernel and the second convolution kernel. 19 . The non-transitory computer-readable storage medium of claim 16 , wherein determining the group of convolution kernels comprises: selecting a convolution kernel from among a plurality of convolution kernels that operate on the feature map as a primary convolution kernel; and calculating a ratio of the primary convolution kernel to other convolution kernels of the plurality of convolution kernels not yet assigned to a group. 20 . The non-transitory computer-readable storage medium of claim 16 , wherein the method further comprises: retraining the neural network comprising the modified convolution stage without permitting changes to membership in the group of convolution kernels.

Assignees

Inventors

Classifications

  • G06N3/045Primary

    Combinations of networks · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • G06V10/82Primary

    using neural networks · CPC title

  • G06N3/0464Primary

    Convolutional networks [CNN, ConvNet] · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016358068A1 cover?
Reducing computations in a neural network may include determining a group including a plurality of convolution kernels of a convolution stage of a neural network. The convolution kernels of the group are similar to one another. A base convolution kernel for the group may be determined. Scaling factors for a plurality of input feature maps processed by the group may be calculated. The convolutio…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).