Convolutional neural network tuning systems and methods
US-2019087729-A1 · Mar 21, 2019 · US
US11977974B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11977974-B2 |
| Application number | US-201715827465-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 30, 2017 |
| Priority date | Nov 30, 2017 |
| Publication date | May 7, 2024 |
| Grant date | May 7, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system, having a memory that stores computer executable components, and a processor that executes the computer executable components, reduces data size in connection with training a neural network by exploiting spatial locality to weight matrices and effecting frequency transformation and compression. A receiving component receives neural network data in the form of a compressed frequency-domain weight matrix. A segmentation component segments the initial weight matrix into original sub-components, wherein respective original sub-components have spatial weights. A sampling component applies a generalized weight distribution to the respective original sub-components to generate respective normalized sub-components. A transform component applies a transform to the respective normalized sub-components. A cropping component crops high-frequency weights of the respective transformed normalized sub-components to yield a set of low-frequency normalized sub-components to generate a compressed representation of the original sub-components.
Opening claim text (preview).
What is claimed is: 1. A system for compressing data during neural network training, comprising: a memory that stores computer executable components and neural network data; a processor that executes computer executable components stored in the memory, wherein the computer executable components: generate a weight matrix, wherein the weight matrix comprises respective weights to be applied to a neural network, and wherein the weight matrix is initialized based on a degree of spatial correlation at the beginning of the training of the neural network, and wherein generation comprises: providing corner weights of sub-blocks of respective original sub-components from a distribution of random numbers; and employing bilinear interpolation to fill up remaining values; and wherein the computer-executable components comprise: a receiving component that receives neural network data in the form of the weight matrix; a segmentation component that: segments the weight matrix into a plurality of original sub-components, wherein respective original sub-components in the plurality of original sub-components comprise a subset of the respective weights in the weight matrix, and wherein selected weights of the respective original sub-components within a defined region have values indicative of a degree of spatial correlation with one another; a transform component that applies a transform to the respective original sub-components resulting in data concentrated in a f first frequency segment; and a cropping component that crops a second frequency segment comprising high-frequency weights from respective transformed sub-components to generate compressed representations of original sub-components during training of the neural network, wherein the system for compressing data during neural network training results in greater compression ratio at a defined accuracy; an inverse transform component that performs an inverse transform on the data in the first frequency segment and a remaining area that is padded with zeros, wherein the first frequency segment comprises low frequency data and the inverse transform yields a data block of spatial weights that are an approximate representation of the respective original sub-component, and wherein the system: trains the neural network employing the data block of spatial weights that are an approximate representation of the respective original sub-component; determines whether a same training accuracy is achieved as a training task wherein compression is not applied and: continue neural network training if a same training accuracy is achieved; or vary the degree of spatial correlation and re-initialize the weight matrix based on the degree of spatial correlation if a same training accuracy is not achieved as a training task wherein compression is not applied. 2. The system of claim 1 , wherein low-frequency weights are located in a first region of the transformed original sub-components and high-frequency weights are located in a second region of the transformed original sub-components, wherein the first region is located in a corner of the respective transformed original sub-components. 3. The system of claim 1 , wherein the inverse transform component applies an inverse discrete cosine transform function to transform the data in the first frequency segment and a remaining area that is padded with zeros to a spatial domain. 4. The system of claim 1 , further comprising a communication component that transmits the compressed representations of original sub-components. 5. A computer-implemented method, comprising employing a processor and memory to execute computer executable components to perform the following acts comprising: generating a weight matrix, wherein the weight matrix comprises respective weights to be applied to a neural network, and wherein the weight matrix is initialized based on a degree of spatial correlation at the beginning of training of the neural network, and wherein the generating comprises: providing corner weights of sub-blocks of respective original sub-components from a distribution of random numbers; and employing bilinear interpolation to fill up remaining values; receiving neural network data in the form of the weight matrix, and wherein spatial locality is present in ones of the respective weights in a defined region; segmenting the weight matrix into a plurality of original sub-components, wherein respective original sub-components comprise a subset of the weights in the weight matrix, and wherein selected weights of the respective original sub-components within a defined region have values indicative of a degree of spatial correlation with one another; applying a transform to respective original sub-components generating a matrix of frequency domain weights determined based on the spatial weights in the respective original sub-components; and cropping high-frequency weights of the respective transformed original sub-components while retaining low frequency weights generating a set of sub-components comprising the low-frequency weights padded with zeros, wherein the computer-implemented method compresses data during neural network training resulting in lower memory and bandwidth usage by a system employing the computer-implemented method; performing an inverse transform on the set of sub-components, wherein the set of sub-components comprises low frequency data and the inverse transform yields a data block of spatial weights that are an approximate representation of the respective original sub-component, and wherein the method further comprises: training the neural network employing the data block of spatial weights that are an approximate representation of the respective original sub-component; determining whether a same training accuracy is achieved as a training task wherein compression is not applied and: continuing neural network training if a same training accuracy is achieved; or varying spatial locality and re-initializing the weight matrix based on a degree of spatial correlation if a same training accuracy is not achieved as a training task wherein compression is not applied. 6. The method of claim 5 , wherein the applying a transform comprises applying a discrete cosine transform. 7. The method of claim 5 , wherein applying an inverse transform comprises applying an inverse discrete cosine transform function to transform the set of sub-components to a spatial domain. 8. The method of claim 5 , wherein the set of sub-components area compressed representation of the weight matrix. 9. A computer program product for compressing training data, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: generate a weight matrix, wherein the weight matrix comprises respective weights to be applied to a neural network, and wherein the weight matrix is initialized based on a degree of spatial correlation at the beginning of the training of the neural network, and wherein generation comprises: determining corner weights of sub-blocks of respective original sub-components from a distribution of random numbers; and employing bilinear interpolation to fill up remaining values; receive neural network data in the form of the weight matrix, and wherein spatial locality is present in ones of the respective weights near one another in a defined region; segment the weight matrix into original sub-components, wherein respective original sub-components comprise a subset of weights in the weight matrix, and wherein selected weights of the respective original sub-components within a defined region have values indicative of a
Distributed learning, e.g. federated learning · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Quantised networks; Sparse networks; Compressed networks · CPC title
Learning methods · CPC title
Discrete orthonormal transforms, e.g. discrete cosine transform, discrete sine transform, and variations therefrom, e.g. modified discrete cosine transform, integer transforms approximating the discrete cosine transform (G06F17/145 takes precedence) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.