What technology area does this patent fall under?

Primary CPC classification G06N3/08. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Mar 12 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Methods and apparatus for enhancing a neural network using binary tensor and scale factor pairs

US2020082264A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2020082264-A1
Application number	US-201816609735-A
Country	US
Kind code	A1
Filing date	May 22, 2018
Priority date	May 23, 2017
Publication date	Mar 12, 2020
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and apparatus are disclosed for enhancing a neural network using binary tensor and scale factor pairs. For one example, a method of optimizing a trained convolutional neural network (CNN) includes initializing an approximation residue as a trained weight tensor for the trained CNN. A plurality of binary tensors and scale factor pairs are determined. The approximation residue is updated using the binary tensors and scale factor pairs.

First claim

Opening claim text (preview).

1 . A method of optimizing a trained convolutional neural network (CNN) comprising: initializing an approximation residue as a trained weight tensor for the trained CNN; determining a plurality of binary tensors and scale factor pairs; and updating the approximation residue using the binary tensors and scale factor pairs. 2 . The method of claim 1 , further comprising: iteratively determining additional binary tensors and scale factor pairs and updating the approximation residue using the additional binary tensors and scale factor pairs. 3 . The method of claim 1 , wherein iteratively determining additional binary tensors and scale factor pairs and updating the approximation residue using the additional binary tensors and scale factor pairs is repeated to find a maximum network efficiency. 4 . The method of claim 1 , further comprising: approximating trained filters of the trained CNN by determining a basis of binary tensors and a series of scale factors. 5 . The method of claim 4 , wherein determining the basis of binary tensors and the series of scale factors includes determining a plurality of binary approximations, wherein each approximation is a combination of different binary tensors and each binary tensor is paired with a scale factor. 6 . The method of claim 1 , wherein determining the plurality of binary tensors and scale factor pairs includes: learning heuristically the binary tensors and scale factors by selecting a first binary tensor and scale factor to be an optimum and using a previously selected optimum for each of a plurality of additional selections until all of the trained filters are approximated. 7 . The method of claim 6 , wherein each binary tensor represents a sign of a respective approximation residue and each scale factor represents a corresponding average magnitude. 8 . The method of claim 6 , wherein each binary tensor represents a sign of a respective approximation residue and each scale factor is refined using a least squares regression of all the binary tensors. 9 . The method of claim 1 , wherein the trained weight tensor comprises a floating-point weight tensor. 10 . The method of claim 1 , wherein the binary tensors and scale factor pairs comprise a binary structure. 11 . The method of claim 10 , further comprising: using the binary structure directly in a pre-trained filter according to a CNN model to produce binary weight models via tensor expansion. 12 . The method of claim 1 , further comprising: approximating the pre-trained filter with a linear span of a certain binary basis before initializing the approximation residue. 13 . The method of claim 1 , further comprising: grouping identical binary tensors to pursue a maximal network efficiency. 14 . A machine-readable medium comprising instructions which when operated on by the machine cause the machine to perform a method comprising: initializing an approximation residue as a trained weight tensor for the trained CNN; determining a plurality of binary tensors and scale factor pairs; and updating the approximation residue using the binary tensors and scale factor pairs. 15 . An apparatus comprising: a memory to store input initial, intermediate, and final results; a neural network; and a processor to initialize an approximation residue as a trained weight tensor for the trained CNN; determine a plurality of binary tensors and scale factor pairs; and update the approximation residue using the binary tensors and scale factor pairs. 16 - 29 . (canceled) 30 . The machine-readable medium of claim 14 , wherein the method further comprises: iteratively determining additional binary tensors and scale factor pairs and updating the approximation residue using the additional binary tensors and scale factor pairs. 31 . The machine-readable medium of claim 30 , wherein iteratively determining additional binary tensors and scale factor pairs and updating the approximation residue using the additional binary tensors and scale factor pairs is repeated to find a maximum network efficiency. 32 . The apparatus of claim 15 , wherein the processor is further operable to iteratively determine additional binary tensors and scale factor pairs and update the approximation residue using the additional binary tensors and scale factor pairs. 33 . The apparatus of claim 32 , wherein the processor iteratively determines additional binary tensors and scale factor pairs and updates the approximation residue using the additional binary tensors and scale factor pairs is repeated to find a maximum network efficiency.

Assignees

Intel Corp

Inventors

Classifications

G06N3/082
modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title
G06N3/084
Backpropagation, e.g. using gradient descent · CPC title
G06N3/04
Architecture, e.g. interconnection topology · CPC title
G06N3/08Primary
Learning methods · CPC title
G06N3/044
Recurrent networks, e.g. Hopfield networks · CPC title

Patent family

Related publications grouped by family.

View patent family 64397018

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020082264A1 cover?: Methods and apparatus are disclosed for enhancing a neural network using binary tensor and scale factor pairs. For one example, a method of optimizing a trained convolutional neural network (CNN) includes initializing an approximation residue as a trained weight tensor for the trained CNN. A plurality of binary tensors and scale factor pairs are determined. The approximation residue is updated …
Who is the assignee on this patent?: Intel Corp
What technology area does this patent fall under?: Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Mar 12 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Hybrid Filter Banks for Artificial Neural Networks

Neural network processing method and evaluation method, and data analysis method and device

Error tolerant neural network model compression

Information processing apparatus, information processing method, and program

Quantized neural network training and inference

Frequently asked questions