Multivariate initialization for convolutional layers

US12555198B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12555198-B2
Application numberUS-202318373768-A
CountryUS
Kind codeB2
Filing dateSep 27, 2023
Priority dateSep 27, 2023
Publication dateFeb 17, 2026
Grant dateFeb 17, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of initializing and training a neural network having a plurality of layers includes defining a first function configured to generate a filter based on a plurality of variance values associated with respective pairs of parameters of the plurality of layers, calculating the plurality of variance values based on depths of respective layers of the plurality of layers such that the variance values increase as the depths increase, calculating a covariance matrix using the first function, the covariance matrix having a block structure and each block of the covariance matrix corresponding to a covariance between a respective parameter and other parameters of the plurality of layers, providing, as input, the covariance matrix to the neural network to initialize the neural network for training, and generating, using the neural network, an output based on the covariance matrix.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of initializing and training a neural network having a plurality of layers, the method comprising: defining a first function configured to generate a filter based on a plurality of variance values associated with respective pairs of parameters of the plurality of layers; calculating the plurality of variance values based on depths of respective layers of the plurality of layers such that the variance values increase as the depths increase; calculating a covariance matrix using the first function, wherein the covariance matrix has a block structure and each block of the covariance matrix corresponds to a covariance between a respective parameter and other parameters of the plurality of layers; providing, as input, the covariance matrix to the neural network to initialize the neural network for training; and generating, using the neural network, an output based on the covariance matrix. 2 . The method of claim 1 , wherein the first function corresponds to a Gaussian function. 3 . The method of claim 1 , wherein the filter is a super-Gaussian filter. 4 . The method of claim 1 , wherein the covariance corresponds to a covariance between a filter pixel and other filter pixels. 5 . The method of claim 1 , wherein calculating the plurality of variance values includes calculating the plurality of variance values further based on a velocity that indicates a rate at which the variance values increase from an initial variance as the depths increase. 6 . The method of claim 1 , wherein calculating the plurality of variance values includes calculating the plurality of variance values further based on an acceleration parameter. 7 . The method of claim 1 , wherein calculating the plurality of variance values includes calculating the plurality of variances in accordance with σ ← σ 0 + v σ ⁢ d + 1 2 ⁢ a σ ⁢ d 2 , wherein σ is a calculated variance value, σ 0 is an initial variance, v is a velocity at which variance grows from the initial variance as a depth d increases, and α is an acceleration parameter. 8 . The method of claim 1 , wherein the covariance matrix is used to define a multivariate probability distribution, and wherein initializing the neural network includes sampling the multivariate probability distribution to obtain initial filters for the neural network. 9 . A computing device configured to initialize and train a neural network having a plurality of layers, the computing device including a processing device configured to execute instructions stored in memory to: define a first function configured to generate a filter based on a plurality of variance values associated with respective pairs of parameters of the plurality of layers; calculate the plurality of variance values based on depths of respective layers of the plurality of layers such that the variance values increase as the depths increase; calculate a covariance matrix using the first function, wherein the covariance matrix has a block structure and each block of the covariance matrix corresponds to a covariance between a respective parameter and other parameters of the plurality of layers; provide, as input, the covariance matrix to the neural network to initialize the neural network for training; and generate, using the neural network, an output based on the covariance matrix. 10 . The computing device of claim 9 , wherein the first function corresponds to a Gaussian function. 11 . The computing device of claim 9 , wherein the filter is a super-Gaussian filter. 12 . The computing device of claim 9 , wherein the covariance corresponds to a covariance between a filter pixel and other filter pixels. 13 . The computing device of claim 9 , wherein, to calculate the plurality of variance values, the processing device is configured to execute instructions to calculate the plurality of variance values further based on a velocity that indicates a rate at which the variance values increase from an initial variance as the depths increase. 14 . The computing device of claim 9 , wherein, to calculate the plurality of variance values, the processing device is configured to execute instructions to calculate the plurality of variance values further based on an acceleration parameter. 15 . The computing device of claim 9 , wherein, to calculate the plurality of variance values, the processing device is configured to execute instructions to calculate the plurality of variances in accordance with σ ← σ 0 + v σ ⁢ d + 1 2 ⁢ a σ ⁢ d 2 , wherein σ is a calculated variance value, σ 0 is an initial variance, v is a velocity at which variance grows from the initial variance as a depth d increases, and α is an acceleration parameter. 16 . The computing device of claim 9 , wherein the processing device is configured to execute instructions to use the covariance matrix to define a multivariate probability distribution, and wherein initializing the neural network includes sampling the multivariate probability distribution to obtain initial filters for the neural network. 17 . A computer-controlled machine comprising the computing device of claim 9 , wherein the computer-controlled machine further comprises an actuator configured to control an operation of the computer-controlled machine based on an output of the neural network. 18 . A system configured to train a neural network having a plurality of layers, the system comprising: data storage that stores training data for training the neural network; memory that stores a data representation of the neural network; and a processing device configured to iteratively train the neural network using the training data to obtain a trained neural network, wherein iteratively training the neural network includes initializing the neural network by defining a first function configured to generate a filter

Assignees

Inventors

Classifications

  • G06T5/20Primary

    using local operators · CPC title

  • Artificial neural networks [ANN] · CPC title

  • Training; Learning · CPC title

  • Combinations of networks · CPC title

  • Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12555198B2 cover?
A method of initializing and training a neural network having a plurality of layers includes defining a first function configured to generate a filter based on a plurality of variance values associated with respective pairs of parameters of the plurality of layers, calculating the plurality of variance values based on depths of respective layers of the plurality of layers such that the variance…
Who is the assignee on this patent?
Bosch Gmbh Robert
What technology area does this patent fall under?
Primary CPC classification G06T5/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 17 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).