Device and method for generating a compressed network from a trained neural network

US12024159B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12024159-B2
Application numberUS-202016983802-A
CountryUS
Kind codeB2
Filing dateAug 3, 2020
Priority dateSep 19, 2019
Publication dateJul 2, 2024
Grant dateJul 2, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device and a method for generating a compressed network from a trained neural network are provided. The method includes: a model generating a compressing map from first training data, the compressing map representing the impact of model components of the model to first output data in response to the first training data; generating a compressed network by compressing the trained neural network in accordance with the compressing map; the trained neural network generating trained network output data in response to second training data; the compressed network generating compressed network output data in response to the second training data; training the model by comparing the trained network output data with the compressed network output data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of a computer-implemented model generating a compressed network from a trained neural network, the method comprising the following steps: generating, by a first model portion of the model, an impact map representing an impact of model components for each first output datum of first output data in response to an associated first training datum, wherein each generated impact map includes, for each of the model components, an impact in processing the associated first training datum; generating a combined impact map using each of the generated impact maps; generating, by a second model portion of the model, a compressing map from the combined impact map; generating a compressed network by compressing the trained neural network in accordance with the compressing map, wherein the generating of the compressed network includes deleting network components from the trained neural network in accordance with the compressing map when a corresponding value in the compressing map meets a predefined deleting criterion; generating, by the trained neural network, trained network output data in response to second training data; generating, by the compressed network, compressed network output data in response to the second training data; and training the model by comparing the trained network output data with the compressed network output data. 2. The method of claim 1 , wherein the training of the model includes training the first model portion and/or training the second model portion by comparing the trained network output data with the compressed network output data. 3. The method of claim 1 , wherein the predefined deleting criterion is met when the corresponding value in the compressing map is below a predefined threshold value. 4. The method of claim 1 , wherein the training of the model includes training the model to increase a total compression by reducing a sum of each value of the compressing map. 5. The method of claim 1 , wherein the first output data are generated by the trained neural network for the first training data. 6. The method of claim 1 , wherein the trained neural network is trained to provide first output data for first input data of a plurality of tasks, and wherein the compressed network provides second output data for second input data of at least one task of the plurality of tasks. 7. The method of claim 6 , wherein the first training data and/or the second training data are selected from a plurality of data using a selection model. 8. The method of claim 1 , further comprising the following steps: generating, by the compressed network generated by the trained model, third training data in response to input data; and training another model using the third training data. 9. A non-transitory computer-readable memory medium on which is stored a computer program of a computer-implemented model generating a compressed network from a trained neural network, the computer program, when executed by a computer, causing the computer to perform the following steps: generating, by a first model portion of the model, an impact map representing an impact of model components for each first output datum of first output data in response to an associated first training datum, wherein each generated impact map includes, for each of the model components, an impact in processing the associated first training datum; generating a combined impact map using each of the generated impact maps; generating, by a second model portion of the model, a compressing map from the combined impact map; generating a compressed network by compressing the trained neural network in accordance with the compressing map, wherein the generating of the compressed network includes deleting network components from the trained neural network in accordance with the compressing map when a corresponding value in the compressing map meets a predefined deleting criterion; generating, by the trained neural network, trained network output data in response to second training data; generating, by the compressed network, compressed network output data in response to the second training data; and training the model by comparing the trained network output data with the compressed network output data. 10. A system, comprising: a device including a compressed network generated by a trained model, the device configured to process digital input data; and at least one sensor configured to provide digital input data for the device; wherein the trained model is trained by generating, by a first model portion of the model, an impact map representing an impact of model components for each first output datum of first output data in response to an associated first training datum, wherein each generated impact map includes, for each of the model components, an impact in processing the associated first training datum; generating a combined impact map using each of the generated impact maps; generating, by a second model portion of the model, a compressing map from the combined impact map; generating a compressed network by compressing a trained neural network in accordance with the compressing map, wherein the generating of the compressed network includes deleting network components from the trained neural network in accordance with the compressing map when a corresponding value in the compressing map meets a predefined deleting criterion; generating, by the trained neural network, trained network output data in response to second training data; generating, by the compressed network, compressed network output data in response to the second training data; and training the model by comparing the trained network output data with the compressed network output data. 11. A vehicle, comprising: at least one sensor configured to provide digital input data; and a driving assistance system including a compressed network generated from a trained neural network using a trained model, wherein the trained neural network is configured to process the digital input data, wherein the compressed network is configured to provide digital output data for the digital input data provided by the at least one sensor, and wherein the driving assistance system is configured to control the vehicle using the digital output data, and wherein the trained model is trained by: generating, by a first model portion of the model, an impact map representing an impact of model components for each first output datum of first output data in response to an associated first training datum, wherein each generated impact map includes, for each of the model components, an impact in processing the associated first training datum; generating a combined impact map using each of the generated impact maps; generating, by a second model portion of the model, a compressing map from the combined impact map; generating a compressed network by compressing the trained neural network in accordance with the compressing map, wherein the generating of the compressed network includes deleting network components from the trained neural network in accordance with the compressing map when a corresponding value in the compressing map meets a predefined deleting criterion; generating, by the trained neural network, trained network output data in response to second training data; generating, by the compressed network, compressed network output data in response to the second training data; and training the model by comparing the trained network output data with the compressed network output data.

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • Supervised learning · CPC title

  • G06N3/082Primary

    modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12024159B2 cover?
A device and a method for generating a compressed network from a trained neural network are provided. The method includes: a model generating a compressing map from first training data, the compressing map representing the impact of model components of the model to first output data in response to the first training data; generating a compressed network by compressing the trained neural network…
Who is the assignee on this patent?
Bosch Gmbh Robert
What technology area does this patent fall under?
Primary CPC classification G06N3/082. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).