What technology area does this patent fall under?

Primary CPC classification G06N3/082. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 04 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Domain specific model compression

US11620435B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11620435-B2
Application number	US-201916597920-A
Country	US
Kind code	B2
Filing date	Oct 10, 2019
Priority date	Oct 10, 2019
Publication date	Apr 4, 2023
Grant date	Apr 4, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Domain specific model compression by providing a weighting parameter for a candidate operation of a neural network, applying the weighting parameter to an output vector of the candidate operation, performing a regularization of the weighting parameter output vector combination, compressing the neural network model according to the results of the regularization, and providing the neural network model after compression.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer implemented method for domain specific model compression, the method comprising: providing, by one or more computer processors, a pre-trained neural network model; providing, by the one or more computer processors, an additional weighting parameter for a candidate operation of the pre-trained neural network model; applying, by the one or more computer processors, the weighting parameter to an output vector of the candidate operation; performing, by the one or more computer processors, a regularization of the application of the weighting parameter to the output vector, wherein the regularization comprises: adding a penalty term to a loss function, the penalty term associated with the weighting parameter, wherein the penalty term includes one of an absolute value of the additional weighting parameter and the square of the additional weighting parameter; compressing, by the one or more computer processors, the neural network model according to a result of the regularization, by removing candidate operations having a zero-value weighting parameter; and providing, by the one or more computer processors, the neural network model after compression. 2. The computer implemented method according to claim 1 , wherein the neural network model comprises a language processing model. 3. The computer implemented method according to claim 1 , further comprising training, by the one or more computer processors, the neural network using unlabeled domain data. 4. The computer implemented method according to claim 1 , further comprising training, by the one or more computer processors, the neural network with two objectives, wherein one objective comprises a domain classification task. 5. The computer implemented method according to claim 1 , further comprising training, by the one or more computer processors, the neural network using labeled domain data. 6. The computer implemented method according to claim 1 , further comprising reducing, by the one or more computer processors, a neural network model attention head weighting value to zero. 7. The computer implemented method according to claim 1 , further comprising: training, by the one or more computer processors, the neural network with two objectives, wherein one objective comprises a domain classification task; and training, by the one or more computer processors, the neural network using labeled data. 8. A computer program product for domain specific model compression, the computer program product comprising one or more computer readable storage devices and stored program instructions on the one or more computer readable storage devices, the stored program instructions comprising: program instructions to provide a pre-trained neural network model; program instructions to provide an additional weighting parameter for a candidate operation of the neural network model; program instructions to apply the weighting parameter to an output vector of the candidate operation; program instructions to perform a regularization of the application of the weighting parameter to the output vector wherein the regularization comprises: adding a penalty term to a loss function, the penalty term associated with the weighting parameter, wherein the penalty term includes one of an absolute value of the additional weighting parameter and the square of the additional weighting parameter; program instructions to compress the neural network model according to a result of the regularization, by removing candidate operations having a zero-value weighting parameter; and program instructions to provide the neural network model after compression. 9. The computer program product according to claim 8 , wherein the neural network model comprises a language processing model. 10. The computer program product according to claim 8 , further comprising program instructions to train the neural network using unlabeled domain data. 11. The computer program product according to claim 8 , further comprising program instructions to train the neural network with two objectives, wherein one objective comprises a domain classification task. 12. The computer program product according to claim 8 , further comprising program instructions to train the neural network using labeled domain data. 13. The computer program product according to claim 8 , further comprising program instructions to reduce a neural network model attention head weighting value to zero. 14. The computer program product according to claim 8 , further comprising program instructions to: train the neural network with two objectives, wherein one objective comprises a domain classification task; and train the neural network using labeled data. 15. A computer system for domain specific model compression, the computer system comprising: one or more computer processors; one or more computer readable storage devices; and stored program instructions on the one or more computer readable storage devices for execution by the one or more computer processors, the stored program instructions comprising: program instructions to provide a pre-trained neural network model; program instructions to provide an additional weighting parameter for a candidate operation of the neural network model; program instructions to apply the weighting parameter to an output vector of the candidate operation; program instructions to perform a regularization of the application of the weighting parameter to the output vector wherein the regularization comprises: adding a penalty term to a loss function, the penalty term associated with the weighting parameter, wherein the penalty term includes one of an absolute value of the additional weighting parameter and the square of the additional weighting parameter; program instructions to compress the neural network model according to a result of the regularization, by removing candidate operations having a zero-value weighting parameter; and program instructions to provide the neural network model after compression. 16. The computer system according to claim 15 , wherein the neural network model comprises a language processing model. 17. The computer system according to claim 15 , further comprising program instructions to train the neural network using unlabeled domain data. 18. The computer system according to claim 15 , further comprising program instructions to train the neural network with two objectives, wherein one objective comprises a domain classification task. 19. The computer system according to claim 15 , further comprising program instructions to train the neural network using labeled domain data. 20. The computer system according to claim 15 , further comprising program instructions to reduce a neural network model attention head weighting value to zero.

Assignees

Inventors

Classifications

G06N3/0895
Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title
G06N3/09
Supervised learning · CPC title
G06N3/0495
Quantised networks; Sparse networks; Compressed networks · CPC title
G06N3/0464
Convolutional networks [CNN, ConvNet] · CPC title
G06N3/082Primary
modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

View patent family 75383071

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11620435B2 cover?: Domain specific model compression by providing a weighting parameter for a candidate operation of a neural network, applying the weighting parameter to an output vector of the candidate operation, performing a regularization of the weighting parameter output vector combination, compressing the neural network model according to the results of the regularization, and providing the neural network …
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06N3/082. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 04 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).