Method and device for generating neuron network compensated for loss due to pruning

US2023124054A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023124054-A1
Application numberUS-202217964452-A
CountryUS
Kind codeA1
Filing dateOct 12, 2022
Priority dateOct 19, 2021
Publication dateApr 20, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments relate to a method and device for generating a neural network that compensates for information loss due to pruning, including obtaining a trained neural network; pruning at least one neuron in the trained neural network; and updating one or more parameter values of a next layer in a pruned neural network based on one or more parameter values of at least one neuron among remaining neurons in a pruned target layer having the pruned neuron.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for generating a neural network that compensates for information loss due to pruning performed by a processor, the method comprising: obtaining a trained neural network; pruning at least one neuron in the trained neural network; and updating one or more parameter values of a next layer in a pruned neural network based on one or more parameter values of at least one neuron among remaining neurons in a pruned target layer having the pruned neuron. 2 . The method according to claim 1 , wherein the step of updating includes the steps of: selecting another neuron or a combination of other neurons most similar to the pruned neuron among the remaining neurons in the target layer; computing a merging value based on one or more parameter values of the selected other neuron or one or more parameter values of a plurality of other neurons underlying the combination and one or more parameter values of the pruned neuron; and computing a merging matrix including the merging value based on one or more parameter values of remaining neurons in the pruned target layer and one or more parameter values of a neuron in an unpruned target layer. 3 . The method according to claim 2 , wherein the step of updating further includes the step of updating one or more parameter values of a neuron in the next layer in the pruned neural network based on one or more parameter values of the merging matrix and one or more parameter values of a neuron in the next layer in an unpruned neural network. 4 . The method according to claim 2 , wherein the other neuron most similar to the pruned neuron is a neuron having a most similar orientation among the remaining neurons in the pruned target layer, the merging value is a ratio between the parameter value of the selected neuron and the parameter value of the pruned neuron. 5 . The method according to claim 2 , wherein the combination of other neurons most similar to the pruned neuron is one in which a result of combining two or more of the remaining neurons in the pruned target layer has the most similar orientation to the pruned neuron, and the combination is a sum of the one or more parameter values. 6 . The method according to claim 5 , wherein the merging matrix includes a plurality of merging values, and each of the plurality of merging values is a coefficient of a sum of one or more parameter values for respective sub neurons of the selected combination. 7 . The method according to claim 2 , wherein the merging matrix is computed by decomposing a matrix of one or more parameter values of neurons in the unpruned target layer into a matrix of one or more parameter values of the remaining neurons in the pruned target layer and the merging matrix through a matrix decomposition way. 8 . The method according to claim 2 , wherein one or more parameter values of neuron in the next layer in the pruned neural network is updated through the following equation, W i+1 ′=Z i W i+1   [Equation] wherein W i+1 ′ is a matrix of the parameter value of neuron in the next layer in the updated, pruned neural network, Z i is the merging matrix, and W i+1 is a matrix of one or more parameter values of neuron in the next layer in the un-updated, pruned neural network. 9 . The method according to claim 1 , wherein the neural network includes at least some of a plurality of fully connected layers and a plurality of convolutional layers, the fully connected layer includes a node as the neuron, and the convolutional layer includes a filter as the neuron, the parameter of the neuron includes at least one of a node parameter and a filter parameter. 10 . The method according to claim 2 , wherein the neural network includes an activation function between the target layer and the next layer. 11 . The method according to claim 10 , wherein when the activation function is ReLU, the step of updating is performed when the merging matrix satisfies a preset specific condition, the specific condition includes that the merging matrix Z i has only non-negative component value. 12 . The method according to claim 11 , wherein the specific condition further includes that the merging matrix has at most one positive component value per column. 13 . A non-transitory computer-readable recording medium which records a program for executing the method according to claim 1 .

Assignees

Inventors

Classifications

  • Activation functions · CPC title

  • G06N3/0464Primary

    Convolutional networks [CNN, ConvNet] · CPC title

  • Matrix or vector computation {, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization (matrix transposition G06F7/78)} · CPC title

  • G06N3/082Primary

    modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023124054A1 cover?
Embodiments relate to a method and device for generating a neural network that compensates for information loss due to pruning, including obtaining a trained neural network; pruning at least one neuron in the trained neural network; and updating one or more parameter values of a next layer in a pruned neural network based on one or more parameter values of at least one neuron among remaining ne…
Who is the assignee on this patent?
Korea Inst Sci & Tech
What technology area does this patent fall under?
Primary CPC classification G06N3/0464. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 20 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).