Systems and methods for structured mixed-precision in a specialized processing block

US12554464B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12554464-B2
Application numberUS-202117559338-A
CountryUS
Kind codeB2
Filing dateDec 22, 2021
Priority dateDec 22, 2021
Publication dateFeb 17, 2026
Grant dateFeb 17, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This disclosure is directed to a digital signal processing (DSP) block that includes multiple weight registers configurable to receive and store a first plurality of values having multiple precisions, and multiple multipliers that are each configurable to receive a respective value of the first plurality of values. The DSP block further includes one or more inputs configurable to receive a second plurality of values, and a multiplexer network configurable to receive the second plurality of values and route each respective value of the second plurality of values to a multiplier of the multipliers. The multipliers are configurable to simultaneously multiply each value of the first plurality of values by a respective value of the second plurality of values to generate a plurality of products. Additionally, the DSP block includes adder circuitry configurable to generate a first sum and a second sum based on the plurality of products.

First claim

Opening claim text (preview).

What is claimed is: 1 . A digital signal processing (DSP) block comprising: a plurality of weight registers configurable to receive and store a first plurality of values, wherein the first plurality of values comprises a first portion and a second portion, wherein: the first portion comprises one or more first values of the first plurality of values having a first precision; and the second portion comprises one or more second values of the first plurality of values having a second precision; a plurality of multipliers, wherein each respective multiplier of the plurality of multipliers is configurable to receive a respective value of the first plurality of values, wherein the plurality of multipliers comprises: one or more first multipliers configurable to perform multiplication involving values having the first precision; and one or more second multipliers configurable to perform multiplication involving values having the second precision; one or more inputs configurable to receive a second plurality of values; a multiplexer network configurable to receive the second plurality of values and route each respective value of the second plurality of values to a multiplier of the plurality of multipliers, wherein the plurality of multipliers is configurable to simultaneously multiply each value of the first plurality of values by a respective value of the second plurality of values to generate a plurality of products, wherein: in a first mode of operation, the multiplexer network is configurable to route the second plurality of values to the one or more first multipliers and the one or more second multipliers; and in a second mode of operation, the multiplexer network is configurable to route the second plurality of values only to the one or more first multipliers; and adder circuitry configurable to generate a first sum and a second sum based on the plurality of products. 2 . The DSP block of claim 1 , comprising a plurality of control registers configurable to store a third plurality of values, wherein: the multiplexer network is configurable to receive the third plurality of values from the plurality of control registers; and route the second plurality of values based on the third plurality of values. 3 . The DSP block of claim 2 , wherein the values of the third plurality of values are respectively indicative of whether a value of the second plurality of values is either to be multiplied by a value of the first portion of the first plurality of values or multiplied by a value of the second portion of the first plurality of values. 4 . The DSP block of claim 1 , wherein: the first precision is more precise than the second precision; and the multiplexer network is configurable to: route a first portion of the second plurality of values to the one or more first multipliers; and route a second portion of the second plurality values to the one or more second multipliers; the plurality of multipliers is configurable to generate the plurality of products by: generating one or more first products by multiplying each of the one or more first values by a value of the first portion of the second plurality of values; and generating one or more second products by multiplying each of the one or more second values by a value of the second portion of the second plurality of values. 5 . The DSP block of claim 4 , wherein the one or more second multipliers are not configurable to perform multiplication between values having the first precision. 6 . The DSP block of claim 4 , wherein the adder circuitry is configurable to: generate the first sum by adding the one or more first products; and generate the second sum by adding the one or more second products. 7 . The DSP block of claim 6 , wherein the adder circuitry comprises: a first adder configurable to generate a third sum by adding the first sum and the second sum; a second adder configurable to generate a fourth sum by adding the second sum and a first value received from a second DSP block; and a third adder configurable to generate a fifth sum by: adding the first sum and a second value received from the second DSP block; or adding the third sum and the second value received from the second DSP block. 8 . The DSP block of claim 1 , wherein: the first precision and the second precision are of the same precision; and the plurality of multipliers is configurable to generate the plurality of products using only the one or more first multipliers. 9 . The DSP block of claim 1 , wherein the DSP block is part of a field programmable gate array (FPGA). 10 . An integrated circuit device comprising: a digital signal processing (DSP) block, the DSP block comprising: a plurality of weight registers configurable to receive and store a first plurality of values, wherein the first plurality of values comprises a first portion and a second portion, wherein: the first portion comprises one or more first values of the first plurality of values having a first precision; and the second portion comprises one or more second values of the first plurality of values having a second precision; a plurality of multipliers, wherein each respective multiplier of the plurality of multipliers is configurable to receive a respective value of the first plurality of values, wherein the plurality of multipliers comprises: one or more first multipliers configurable to perform multiplication involving values having the first precision; and one or more second multipliers configurable to perform multiplication involving values having the second precision; one or more inputs configurable to receive a second plurality of values; a multiplexer network configurable to receive the second plurality of values and route each respective value of the second plurality of values to a multiplier of the plurality of multipliers, wherein the plurality of multipliers is configurable to simultaneously multiply each value of the first plurality of values by a respective value of the second plurality of values to generate a plurality of products, wherein: in a first mode of operation, the multiplexer network is configurable to route the second plurality of values to the one or more first multipliers and the one or more second multipliers; and in a second mode of operation, the multiplexer network is configurable to route the second plurality of values only to the one or more first multipliers; and adder circuitry configurable to generate a first sum and a second sum based on the plurality of products. 11 . The integrated circuit device of claim 10 , comprising a plurality of control registers configurable to store a third plurality of values, wherein: the values of the third plurality of values are respectively indicative of whether a value of the second plurality of values is either to be multiplied by a value of the first portion of the first plurality of values or multiplied by a value of the second portion of the first plurality of values; the multiplexer network is configurable to receive the third plurality of values from the plurality of control registers; and route the second plurality of values based on the third plurality of values. 12 . The integrated circuit device of claim 11 , wherein the multiplexer network comprises a plurality of multiplexers each configurable to: receive at least two values of the second plurality of values; receive a respective value of the third plurality of values; and route one of the at least two values of the second plurality of values to a multiplier of the plurality of multipliers based on the respective value of the third plurality of values.

Assignees

Inventors

Classifications

  • Adding; Subtracting (G06F7/483 - G06F7/491, G06F7/544 - G06F7/556 take precedence) · CPC title

  • Interfaces, programming languages or software development kits, e.g. for simulating neural networks · CPC title

  • using electronic means · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • for reconfigurable circuits, e.g. field programmable gate arrays [FPGA] or programmable logic devices [PLD] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12554464B2 cover?
This disclosure is directed to a digital signal processing (DSP) block that includes multiple weight registers configurable to receive and store a first plurality of values having multiple precisions, and multiple multipliers that are each configurable to receive a respective value of the first plurality of values. The DSP block further includes one or more inputs configurable to receive a seco…
Who is the assignee on this patent?
Langhammer Martin, Wu Michael, Tunali Nihat Engin, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06F7/523. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 17 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).