Audio signal processing device, audio signal processing method, and control program
US-2020090029-A1 · Mar 19, 2020 · US
US11748602B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11748602-B2 |
| Application number | US-202017134445-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 27, 2020 |
| Priority date | Dec 27, 2017 |
| Publication date | Sep 5, 2023 |
| Grant date | Sep 5, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An integrated circuit chip device and related products are provided. The integrated circuit chip device is used for performing a multiplication operation, a convolution operation or a training operation of a neural network. The device has the advantages of small calculation amount and low power consumption.
Opening claim text (preview).
The invention claimed is: 1. An integrated circuit chip device for training a neural network having n layers, n being an integer greater than or equal to 2, the integrated circuit chip device comprising: a main processing circuit; and a plurality of basic processing circuits; wherein: the main processing circuit comprises a data type conversion circuit configured to convert data between a floating point data type and a fixed point data type; the plurality of basic processing circuits are arranged as an array, each basic processing circuit is connected to an adjacent basic processing circuit, the main processing circuit is connected to a first quantity of basic processing circuits in a first row, the first quantity of basic processing circuits in an m th row, and m basic processing circuits in a first column; the integrated circuit chip device is configured to: receive a training instruction; determine input data and weight group data of a first layer according to the training instruction; and perform a forward computation of an i th layer of the neural network on the input data and the weight group data of the first layer to obtain an i th output result of the forward computation, i being an integer greater than or equal to 1 and smaller than or equal to n; the main processing circuit is configured to: obtain an i th output result gradient according to the i th output result; obtain an i th backward computation of backward computations of the i th layer according to the training instruction; obtain an i th backward computation complexity according to the i th output result gradient, input data of the i th layer, weight group data of the i th layer, and the i th backward computation; determine an i th back data type corresponding to the i th output result gradient, the input data of the i th layer, and the weight group data of the i th layer according to the i th backward computation complexity; classify the i th output result gradient, the input data of the i th layer, and the weight group data of the i th layer into a broadcasting data block and a distribution data block according to a type of the i th backward computation; partition the distribution data block of the i th back data type to obtain a plurality of basic data blocks; distribute the plurality of basic data blocks to at least one branch processing circuit of the plurality of basic processing circuits connected to the main processing circuit; and broadcast the broadcasting data block of the i th back data type to the basic processing circuits connected to the main processing circuit; at least one of the plurality of basic processing circuits is configured to: perform computations of the neural network in parallel according to the broadcasting data block of the i th back data type and the basic data blocks of the i th back data type to obtain computation results; and transfer the computation results to the main processing circuit through the basic processing circuits connected to the main processing circuit; the main processing circuit is further configured to: process the computation results to obtain a weight group gradient of the i th layer and an input data gradient of the i th layer; and update the weight group data of the i th layer according to the weight group gradient of the i th layer, wherein the i th back data type includes a fixed point type or a floating point type; the integrated circuit device is further configured to: perform backward computations of an (i−1) th layer by using the input data gradient of the i th layer as an (i−1) th output result gradient of the (i−1) th layer to obtain a weight group gradient of the (i−1) th layer; and update weight group data of a corresponding layer according to the weight group gradient of the (i−1) th layer, wherein the weight group data includes at least two weights; and the main processing circuit is configured to: when the i th backward computation is a multiplication computation, classify both the input data of the i th layer and the weight group data of the i th layer into distribution data blocks, and the i th output result gradient as a broadcasting data block; and when the i th backward computation is a convolution computation, classify both the input data of the i th layer and the weight group data of the i th layer into broadcasting data blocks, and the i th output result gradient into a distribution data block. 2. The integrated circuit chip device of claim 1 , wherein the main processing circuit is configured to compare the i th backward computation complexity with a preset threshold, wherein: when the i th backward computation complexity is greater than the preset threshold, the main processing circuit is configured to determine the i th back data type as the fixed point type; and when the i th backward computation complexity is less than or equal to the preset threshold, the main processing circuit is configured to determine the i th back data type as the floating point type. 3. The integrated circuit chip device of claim 2 , wherein the main processing circuit is configured to determine whether the i th output result gradient, the input data of the i th layer, and the weight group data of the i th layer belong to an (i+1) th back data type, wherein: when the (i+1) th back data type differs from the i th back data type, the main processing circuit is configured to convert the i th output result gradient, the input data of the i th layer, and the weight group data of the i th layer belonging to the (i+1) th back data type to an i th output result gradient, an input data of the i th layer, and weight group data of the i th layer belonging to the i th back data type using the data type conversion circuit. 4. The integrated circuit chip device of claim 1 , wherein: the main processing circuit is configured to: when the i th backward computation is a convolution computation, determine convolution input data as the input data of the i th layer and a convolution kernel as the i th output result gradient; the i th backward computation complexity is computed as: i th backward computation complexity=α*C1*kH*kW*M*N*W*C2*H, wherein: α is a convolution coefficient greater than 1; C1, kH, kW, M are values of four dimensions of the convolution kernel; and N, W, C2, and H are values of four dimensions of the convolution input data; when the i th backward computation complexity is greater than a preset threshold, the main processing circuit is configured to: determine the i th back data type as the floating point type; and determine whether the convolution input data and the convolution kernel are floating point data, wherein: when the convolution input data and the convolution kernel are not floating point data, the main processing circuit is configured to: convert the convolution input data into floating point data and the convolution kernel into floating point data; and perform the convolution computation on the convolution input data of the floating point type and the convolution kernel of the floating point type. 5. The integrated circuit chip device of claim 1 , wherein: the main processing circuit is configured to: when the i th backward computation is a matrix-multiply-matrix computation, determine input data as the input data of the i th layer and a weight as the i th output result gradient; the i th backward computation complexity is computed as complexity=β*F*G*E*F1, wherein: β is a matrix coefficient greater than or equal to 1; F and G are row and column values of the input data of the i th layer; and E and F1 are row and column values of the weight; when the i th backward computation complexity is greater than a preset threshold, the
Convolutional networks [CNN, ConvNet] · CPC title
Supervised learning · CPC title
using electronic means · CPC title
Architecture, e.g. interconnection topology · CPC title
Sum of products (for applications thereof, see the relevant places, e.g. G06F17/10, H03H17/00) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.