System and method for torque-based structured pruning for deep neural networks

US2023153625A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023153625-A1
Application numberUS-202218052297-A
CountryUS
Kind codeA1
Filing dateNov 3, 2022
Priority dateNov 17, 2021
Publication dateMay 18, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes accessing a machine learning model, the machine learning model trained using a torque-based constraint. The method also includes receiving an input from an input source and providing the input to the machine learning model. The method also includes receiving an output from the machine learning model. The method also includes instructing at least one action based on the output from the machine learning model. Training the machine learning model includes applying a torque-based constraint on one or more filters of the machine learning model, adjusting, based on applying the torque-based constraint, a first set of one or more filters of the machine learning model to have a higher concentration of weights than a second set of one or more filters of the machine learning model, and pruning at least one channel of the machine learning model based on an average weight for the at least one channel.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: accessing, using at least one processor of an electronic device, a machine learning model, the machine learning model trained using a torque-based constraint; receiving, using the at least one processor, an input from an input source; providing, using the at least one processor, the input to the machine learning model; receiving, using the at least one processor, an output from the machine learning model; and instructing, using the at least one processor, at least one action based on the output from the machine learning model. 2 . The method of claim 1 , wherein the torque-based constraint is used during training of the machine learning model to adjust a concentration of weights among a plurality of filters. 3 . The method of claim 2 , wherein the adjusting causes a weight sparsity to increase based on a filter distance from a pivot point channel. 4 . The method of claim 1 , wherein the machine learning model includes channels remaining after pruning of the machine learning model using the torque-based constraint. 5 . The method of claim 4 , wherein the pruning of the machine learning model results in a removal of at least one channel based on an average weight for the at least one channel. 6 . The method of claim 1 , wherein the input is derived from image data and the output is an image classification. 7 . An apparatus comprising: at least one processing device configured to: access a machine learning model, the machine learning model trained using a torque-based constraint; receive an input from an input source; provide the input to the machine learning model; receive an output from the machine learning model; and instruct at least one action based on the output from the machine learning model. 8 . The apparatus of claim 7 , wherein the torque-based constraint is used during training of the machine learning model to adjust a concentration of weights among a plurality of filters. 9 . The apparatus of claim 8 , wherein the adjusting causes a weight sparsity to increase based on a filter distance from a pivot point channel. 10 . The apparatus of claim 7 , wherein the machine learning model includes channels remaining after pruning of the machine learning model using the torque-based constraint. 11 . The apparatus of claim 10 , wherein the pruning of the machine learning model results in a removal of at least one channel based on an average weight for the at least one channel. 12 . The apparatus of claim 7 , wherein the input is derived from image data and the output is an image classification. 13 . A non-transitory computer readable medium containing instructions that when executed cause at least one processor to: access a machine learning model, the machine learning model trained using a torque-based constraint; receive an input from an input source; provide the input to the machine learning model; receive an output from the machine learning model; and instruct at least one action based on the output from the machine learning model. 14 . The non-transitory computer readable medium of claim 13 , wherein the torque-based constraint is used during training of the machine learning model to adjust a concentration of weights among a plurality of filters. 15 . The non-transitory computer readable medium of claim 14 , wherein the adjusting causes a weight sparsity to increase based on a filter distance from a pivot point channel. 16 . The non-transitory computer readable medium of claim 13 , wherein the machine learning model includes channels remaining after pruning of the machine learning model using the torque-based constraint. 17 . The non-transitory computer readable medium of claim 16 , wherein the pruning of the machine learning model results in a removal of at least one channel based on an average weight for the at least one channel. 18 . The non-transitory computer readable medium of claim 13 , wherein the input is derived from image data and the output is an image classification. 19 . A method comprising: training, using at least one processor of an electronic device, a machine learning model, wherein the training includes: applying a torque-based constraint on one or more filters of the machine learning model; adjusting, based on applying the torque-based constraint, a first set of one or more filters of the machine learning model to have a higher concentration of weights than a second set of one or more filters of the machine learning model; and pruning at least one channel of the machine learning model based on an average weight for the at least one channel. 20 . The method of claim 19 , wherein the adjusting causes a weight sparsity to increase based on a filter distance from a pivot point channel. 21 . The method of claim 20 , wherein the pivot point channel is an output channel of the machine learning model. 22 . The method of claim 20 , wherein the pivot point channel is an input channel of the machine learning model. 23 . The method of claim 19 , wherein the torque-based constraint is applied during a gradient update, wherein the gradient update is based on L1 regularization or based on L2 regularization. 24 . The method of claim 19 , wherein the torque-based constraint is applied as an additional term, based on L1 regularization or L2 regularization, in a loss function. 25 . The method of claim 19 , further comprising fine-tuning remaining parameters of the machine learning model. 26 . The method of claim 25 , further comprising further training the pruned machine learning model by applying an additional torque-based constraint to one or more filters of the pruned machine learning model.

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06N3/082Primary

    modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Combinations of networks · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023153625A1 cover?
A method includes accessing a machine learning model, the machine learning model trained using a torque-based constraint. The method also includes receiving an input from an input source and providing the input to the machine learning model. The method also includes receiving an output from the machine learning model. The method also includes instructing at least one action based on the output …
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/082. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 18 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).