Reconfigurable digital signal processing (DSP) vector engine

US11983530B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11983530-B2
Application numberUS-202016833164-A
CountryUS
Kind codeB2
Filing dateMar 27, 2020
Priority dateMar 27, 2020
Publication dateMay 14, 2024
Grant dateMay 14, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods described herein may relate to providing a dynamically configurable circuitry able to process data associated with a variety of matrix dimensions using one or more complex number operations, one or more real number operations, or both. Configurations may be applied to the configurable circuitry to program the configurable circuitry for a next operation. The configurable circuitry may process data according to a variety of operations based at least in part on operation of a repeated processing element coupled in a compute network of processing elements.

First claim

Opening claim text (preview).

What is claimed is: 1. An integrated circuit, comprising: a first circuit; interface circuitry; and an integrated accumulator coupled to the first circuit through the interface circuitry, wherein the integrated accumulator is configurable to perform a complex number operation at a first time and to perform a real number operation at a second time, wherein the integrated accumulator performs the real number operation using first real type data as an input and generates second real type data as an output, wherein the integrated accumulator comprises a plurality of multipliers and a plurality of adders, wherein the integrated accumulator has a reprogrammable data width, and wherein a data width of data processed using the complex number operation or the real number operation is different from a data width of the interface circuitry. 2. The integrated circuit of claim 1 , wherein a first portion of the processing circuit comprises a first compute network comprising a first subset of the plurality of multipliers and the plurality of adders arranged in a first pattern, and wherein a second portion of the processing circuit comprises a second compute network comprising a second subset of the plurality of multipliers and the plurality of adders also arranged in the first pattern. 3. The integrated circuit of claim 1 , wherein the data width of data processed using the complex number operation or the real number operation is different from a data width of the first circuit. 4. The integrated circuit of claim 1 , wherein the integrated accumulator comprises multiplexing circuitry configured to route data through a multiplier of the plurality of multipliers to perform the complex number operation on the data. 5. The integrated circuit of claim 1 , wherein the integrated accumulator comprises multiplexing circuitry configured to route data through a multiplier of the plurality of multipliers to perform the real number operation on the data. 6. The integrated circuit of claim 1 , wherein the integrated accumulator is configured to perform the complex number operation and to perform a different complex number operation in parallel. 7. The integrated circuit of claim 1 , wherein the integrated accumulator is configured to perform the complex number operation at a first time and to perform a Radix-2 number operation at a second time using a same circuit. 8. The integrated circuit of claim 1 , wherein the integrated accumulator is configured to perform the complex number operation on a first matrix of a first size at a first time and to perform the complex number operation of a second matrix of a second size different from the first size at a second time without reconfiguring between the first time and the second time. 9. The integrated circuit of claim 1 , wherein the integrated accumulator is configured to perform the complex number operation on a first matrix at a first time, wherein the integrated accumulator is configured to perform the real number operation on a second matrix at the first time, wherein the integrated accumulator is configured to perform the complex number operation on a third matrix at a second time, wherein the integrated accumulator is configured to perform the real number operation on a fourth matrix at the second time, wherein the third matrix is larger than the first matrix, and wherein the fourth matrix is smaller than the second matrix. 10. The integrated circuit of claim 1 , comprising a controller configured to operate the integrated accumulator, wherein the integrated accumulator comprises an instruction decoder configured to generate and apply a configuration for the integrated accumulator in response to an instruction generated by the controller to operate the integrated accumulator. 11. The integrated circuit of claim 1 , wherein the integrated accumulator comprises a compute network coupled to an instruction decoder, wherein the compute network comprises one or more multipliers of the plurality of multipliers and one or more adders of the plurality of adders and is configured with control signals generated by the instruction decoder to adjust a relative weighting associated with the one or more multipliers or the one or more adders. 12. A method of operating a compute network of a first processing circuit to perform a complex number operation, comprising: receiving, via the first processing circuit, an instruction from a controller; receiving, via the first processing circuit, input data; generating, via the first processing circuit, a configuration for the compute network in response to the instruction; programming, via the first processing circuit, an integrated accumulator of the compute network based on the configuration, wherein the integrated accumulator is configurable to perform the complex number operation at a first time and to perform a real number operation at a second time based on the configuration, wherein the real number operation is performed on first real type data as an input and generates second real type data as an output, wherein the integrated accumulator is configurable to perform the complex number operation on a first matrix of a first size at a first time and to perform the complex number operation of a second matrix of a second size different from the first size at a second time without reconfiguring between the first time and the second time; transmitting, via the first processing circuit, the input data through the first processing circuit to perform the complex number operation on the input data; and outputting, via the first processing circuit, the input data to a first circuit of an integrated circuit. 13. The method of claim 12 , further comprising applying the configuration to the compute network by: generating, via the first processing circuit, an operational mask; and applying, via the first processing circuit, the operational mask to the compute network to deactivate a subset of inputs of the compute network from applying the input data to the complex number operation. 14. The method of claim 12 , further comprising applying the configuration to the compute network by: configuring, via the first processing circuit, multiplexing circuitry of the compute network, wherein the multiplexing circuitry is configured to route the input data through the compute network to perform the complex number operation on the input data after application of the configuration. 15. The method of claim 12 , wherein the integrated accumulator has a reprogrammable data width, and wherein a data width of data processed using the complex number operation or the real number operation is different from a data width of the interface circuitry. 16. A system, comprising: a first circuit; and an integrated accumulator coupled to the first circuit, wherein the integrated accumulator is configurable to perform one or more real number operations and one or more complex number operations, wherein the real number operation is performed on real type data as an input and generates real type data as an output, wherein the integrated accumulator comprises a compute network coupled to an instruction decoder, wherein the compute network comprises one or more multipliers and one or more adders and is configured with control signals generated by the instruction decoder to adjust a relative weighting associated with the one or more multipliers or the one or more adders, wherein the integrated accumulator is configurable to: receive an instruction from a controller; receive input data from the first circuit; generate a configuration for the compute network in response to the instruc

Assignees

Inventors

Classifications

  • G06F9/3001Primary

    Arithmetic instructions · CPC title

  • Computations with complex numbers · CPC title

  • Computations with a radix, other than binary, 8, 16 or decimal, e.g. ternary, negative or imaginary radices, mixed radix {non-linear PCM (G06F7/4824 takes precedence)} · CPC title

  • Adding; Subtracting (G06F7/483 - G06F7/491, G06F7/544 - G06F7/556 take precedence) · CPC title

  • Multiplying only · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11983530B2 cover?
Systems and methods described herein may relate to providing a dynamically configurable circuitry able to process data associated with a variety of matrix dimensions using one or more complex number operations, one or more real number operations, or both. Configurations may be applied to the configurable circuitry to program the configurable circuitry for a next operation. The configurable circ…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F9/3001. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 14 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).