Efficient method for VLSI implementation of useful neural network activation functions

US12400112B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12400112-B2
Application numberUS-202017115285-A
CountryUS
Kind codeB2
Filing dateDec 8, 2020
Priority dateDec 8, 2020
Publication dateAug 26, 2025
Grant dateAug 26, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A neural inference chip is provided, including at least one neural inference core. The at least one neural inference core is adapted to apply a plurality of synaptic weights to a plurality of input activations to produce a plurality of intermediate outputs. The at least one neural inference core comprises a plurality of activation units configured to receive the plurality of intermediate outputs and produce a plurality of activations. Each of the plurality of activation units is configured to apply a configurable activation function to its input. The configurable activation function has at least a re-ranging term and a scaling term, the re-ranging term determining the range of the activations and the scaling term determining the scale of the activations. Each of the plurality of activations units is configured to obtain the re-ranging term and the scaling term from one or more look up tables.

First claim

Opening claim text (preview).

What is claimed is: 1. A neural inference chip comprising: at least one neural inference core adapted to apply a plurality of synaptic weights to a plurality of input data tensors to produce a plurality of intermediate output data tensors, the at least one neural inference core comprising a plurality of activation units configured to receive the plurality of intermediate output data tensors and produce a plurality of activations, each of the plurality of activation units being configured to apply a configurable activation function to its input data tensor, the configurable activation function having at least a re-ranging term and a scaling term, the re-ranging term applying a bias that determines a range of the plurality of activations and the scaling term applying a slope that determines a scale of the plurality of activations, wherein the re-ranging term and the scaling term are obtained separately from predetermined sets, each predetermined set being associated with one of the re-ranging term or the scaling term, each of the plurality of activations units obtaining the re-ranging term and the scaling term from one or more look up tables. 2. The neural inference chip of claim 1 , wherein the plurality of activations have flexible precision. 3. The neural inference chip of claim 1 , wherein the plurality of activations have a floating point value. 4. The neural inference chip of claim 1 , wherein the plurality of input data tensors have 16 -bit precision. 5. The neural inference chip of claim 1 , wherein the plurality of input data tensors have 32 -bit precision. 6. The neural inference chip of claim 1 , wherein each predetermined set is a lookup table. 7. The neural inference chip of claim 1 , wherein each of the re-ranging term and the scaling term are learned. 8. The neural inference chip of claim 1 , wherein the configurable activation function is selected from a list consisting of: Boolean, trinary, linear, ReLU, shifted ReLU, ExpReLU, sigmoid, and tanh. 9. An integrated circuit, comprising: at least one neural inference core, adapted to apply a plurality of synaptic weights to a plurality of input data tensors to produce a plurality of intermediate output data tensors, the at least one neural inference core comprising a plurality of activation units configured to receive the plurality of intermediate output data tensors and produce a plurality of activations, each of the plurality of activation units being configured to apply an activation function to its input data tensor, the activation function being Boolean, trinary, linear, ReLU, shifted ReLU, ExpReLU, sigmoid, or tanh, the activation function having at least a re-ranging term and a scaling term, the re-ranging term applying a bias that determines a range of the plurality of activations and the scaling term applying a slope that determines a scale of the plurality of activations, each of the re-ranging term and the scaling term having an associated lookup table, each of the plurality of activations units obtaining the re-ranging term and the scaling term separately from the associated lookup tables. 10. A computer-implemented method, comprising: applying a plurality of synaptic weights to a plurality of input data tensors to produce a plurality of intermediate output data tensors; receiving the plurality of intermediate output data tensors and produce therefrom a plurality of activations, wherein producing the plurality of activations comprises: applying a configurable activation function to the plurality of intermediate output data tensors, the configurable activation function having at least a re-ranging term and a scaling term, the re-ranging term applying a bias that determines a range of the plurality of activations and the scaling term applying a slope that determines a scale of the plurality of activations, each of the re-ranging term and the scaling term having an associated lookup table, and obtaining the re-ranging term and the scaling term separately from the associated lookup tables. 11. The method of claim 10 , wherein the plurality of activations have flexible precision. 12. The method of claim 10 , wherein the plurality of activations have a floating point value. 13. The method of claim 10 , wherein the plurality of input data tensors have 16-bit precision. 14. The method of claim 10 , wherein the plurality of input data tensors have 32-bit precision. 15. The method of claim 10 , wherein each of the re-ranging term and the scaling term have one associated lookup table. 16. The method of claim 10 , wherein each of the re-ranging term and the scaling term are learned. 17. The method of claim 10 , wherein the configurable activation function is selected from a list consisting of: Boolean, trinary, linear, ReLU, shifted ReLU, ExpReLU, sigmoid, and tanh.

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06N3/063Primary

    using electronic means · CPC title

  • G06N3/04Primary

    Architecture, e.g. interconnection topology · CPC title

  • Inference or reasoning models · CPC title

  • Activation functions · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12400112B2 cover?
A neural inference chip is provided, including at least one neural inference core. The at least one neural inference core is adapted to apply a plurality of synaptic weights to a plurality of input activations to produce a plurality of intermediate outputs. The at least one neural inference core comprises a plurality of activation units configured to receive the plurality of intermediate output…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 26 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).