Neural network system that executes batch normalization

US2024256867A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024256867-A1
Application numberUS-202118560798-A
CountryUS
Kind codeA1
Filing dateSep 24, 2021
Priority dateJun 9, 2021
Publication dateAug 1, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A neural network system includes: a first layer generating first layer outputs of pieces of training data; a second layer; and a numerical conversion layer. During training, the numerical conversion layer: receives the first layer outputs from the first layer, calculates a numerical conversion parameter corresponding to each of the pieces of training data, numerically converts each of components of each of the first layer outputs using the numerical conversion parameter to generate a numerical conversion layer output, and inputs the same to the second layer. The numerical conversion parameter corresponding to one of the pieces of training data is: calculated from the first layer outputs of the pieces of training data except the one of the pieces of training data, or calculated by weighting one of the first layer outputs of the pieces of training data including the one of the pieces of training data.

First claim

Opening claim text (preview).

1 . A neural network system implemented by one or more computers, comprising: a first layer that generates first layer outputs of a plurality of pieces of training data, each of the first layer outputs having a plurality of components; a second layer; and a numerical conversion layer disposed between the first layer and the second layer, wherein during training of the neural network system, the numerical conversion layer: receives the first layer outputs from the first layer, calculates a numerical conversion parameter corresponding to each of the pieces of training data, numerically converts each of the components of each of the first layer outputs using the numerical conversion parameter to generate a numerical conversion layer output, and inputs the numerical conversion layer output to the second layer, and the numerical conversion parameter corresponding to one of the pieces of training data is: calculated from the first layer outputs of the pieces of training data except the one of the pieces of training data, or calculated by weighting each of the first layer outputs of the pieces of training data including the one of the pieces of training data, and a weight of one of the first layer outputs of the one of the pieces of training data is smaller than a weight of the other first layer outputs of the other pieces of training data. 2 . The neural network system according to claim 1 , wherein the numerical conversion parameter corresponding to the one of the pieces of training data is calculated from first layer outputs of a plurality of pieces of training data selected, from a batch including a set of the pieces of training data including the one of the pieces of training data, by a predetermined selection method to exclude the one of the pieces of training data. 3 . The neural network system according to claim 2 , wherein the components of each of the first layer outputs are indexed by dimensions, and the numerical conversion layer calculates the numerical conversion parameter by: with respect to the pieces of training data per the batch, for each of the dimensions, calculating a mean of the components of each of the first layer outputs of the pieces of training data selected by the selection method, as a pseudo mean of the components of each of the first layer outputs, and calculating, for each of the dimensions, a variance of the components of each of the first layer outputs using the components of each of the first layer outputs and the pseudo mean with respect to the pieces of training data per the batch. 4 . The neural network system according to claim 3 , wherein the numerical conversion layer generates the numerical conversion layer output by, for each of the pieces of training data, numerically converting the components of each of the first layer outputs of the pieces of training data using the pseudo mean and the variance for each of the dimensions corresponding to each of the components. 5 . The neural network system according to claim 4 , wherein the numerical conversion layer generates the numerical conversion layer output by of transforming the components numerically converted based on values of a set of transformation parameters for each of the dimensions. 6 . The neural network system according to claim 5 , wherein after the neural network system is trained, the numerical conversion layer: receives a new first layer output for a new neural network input generated by the first layer, generates a new numerically converted layer output by numerically converting each of components of the new first layer output using a precalculated numerical conversion parameter, generates a new numerical conversion layer output by converting, for each of the dimensions, each of components of the new numerically converted layer output based on a set of transformation parameters for each of the dimensions, and newly inputs the new numerical conversion layer output to the second layer. 7 . The neural network system according to claim 6 , wherein the precalculated numerical conversion parameter is calculated from the first layer outputs generated by the first layer during the training of the neural network system. 8 . The neural network system according to claim 6 , wherein the precalculated numerical conversion parameter is calculated from the new first layer output generated by the first layer after the neural network system is trained. 9 . The neural network system according to claim 7 , wherein a new neural network input processed by the neural network system after the neural network system is trained is an input of a different type from the pieces of training data used to train the neural network system. 10 . The neural network system according to claim 2 , wherein the components of each of the first layer outputs are indexed by a feature index and a spatial location index, the numerical conversion layer calculates the numerical conversion parameter by: with respect to the pieces of training data per the batch, for each combination of the feature index and the spatial location index, calculating a mean of the components of each of the first layer outputs of the pieces of training data selected by the selection method, with respect to the pieces of training data per the batch, for each of the feature index, calculating an arithmetic mean of the mean with respect to the combination including the feature index, for each combination of the feature index and the spatial location index, calculating a variance of the components of each of the first layer outputs using the components of each of the first layer outputs and the arithmetic mean with respect to the pieces of training data per the batch, and for each of the feature index, calculating an arithmetic variance of the variance with respect to the combination including the feature index. 11 . The neural network system according to claim 10 , wherein the numerical conversion layer generates the numerical conversion layer output by, for each of the pieces of training data, numerically converting the components of each of the first layer outputs of the pieces of training data using the arithmetic mean and the arithmetic variance. 12 . The neural network system according to claim 11 , wherein the numerical conversion layer generates the numerical conversion layer output by converting the components numerically converted based on a set of transformation parameters for each of the feature index. 13 . The neural network system according to claim 12 , wherein after the neural network system is trained, the numerical conversion layer: receives a new first layer output for a new neural network input generated by the first layer, generates a new numerically converted layer output by numerically converting each of components of the new first layer output using a precalculated numerical conversion parameter, generates a new numerical conversion layer output by converting, for each of the feature index, each of components of the new numerically converted layer output based on a set of transformation parameters for each of the feature index, and newly inputs the new numerical conversion layer output to the second layer. 14 . The neural network system according to claim 2 , wherein the components of each of the first layer outputs are indexed by a feature index and a spatial location index, and the numerical conversion layer calculates the numerical conversion parameter by: with respect to the pieces of training data per the batch, for each of the feature index, calculating a mean of the components of each of the first layer outputs of the pi

Assignees

Inventors

Classifications

  • Clustering; Classification · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

  • Combinations of networks · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • G06N3/04Primary

    Architecture, e.g. interconnection topology · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024256867A1 cover?
A neural network system includes: a first layer generating first layer outputs of pieces of training data; a second layer; and a numerical conversion layer. During training, the numerical conversion layer: receives the first layer outputs from the first layer, calculates a numerical conversion parameter corresponding to each of the pieces of training data, numerically converts each of component…
Who is the assignee on this patent?
Konica Minolta Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).