Load store circuit with dedicated single or dual bit shift circuit and opcodes for low power accelerator processor
US-2017060586-A1 · Mar 2, 2017 · US
US9817791B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9817791-B2 |
| Application number | US-201514678939-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 4, 2015 |
| Priority date | Apr 4, 2015 |
| Publication date | Nov 14, 2017 |
| Grant date | Nov 14, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods and apparatus for a low energy accelerator processor architecture with short parallel instruction word. An integrated circuit includes a system bus having a data width N, where N is a positive integer; a central processor unit coupled to the system bus and configured to execute instructions retrieved from a memory coupled to the system bus; and a low energy accelerator processor coupled to the system bus and configured to execute instruction words retrieved from a low energy accelerator code memory, the low energy accelerator processor having a plurality of execution units including a load store unit, a load coefficient unit, a multiply unit, and a butterfly/adder ALU unit, each of the execution units configured to perform operations responsive to op-codes decoded from the retrieved instruction words, wherein the width of the instruction words is equal to the data width N. Additional methods and apparatus are disclosed.
Opening claim text (preview).
What is claimed is: 1. An integrated circuit, comprising: a system bus for transferring data between memory devices, processors, and peripheral devices having a data width N, where N is a positive integer; a central processor unit coupled to the system bus and configured to execute instructions retrieved from a memory coupled to the system bus; and a low energy accelerator processor coupled to the system bus and configured to execute instruction words retrieved from a low energy accelerator code memory coupled to the system bus, the low energy accelerator processor having a plurality of execution units including a load store unit, a load coefficient unit, a multiply unit, and a butterfly/adder unit, each of the execution units configured to perform operations responsive to op-codes decoded from the retrieved instruction words, wherein the width of the instruction words is equal to a selected one of the data width N of the system bus and twice the data width N of the system bus; wherein the instruction words for the low energy accelerator processor have a field of 11 bits containing opcodes for the load store unit. 2. The integrated circuit of claim 1 , wherein the low energy accelerator processor further comprises the multiply unit and the butterfly/adder ALU unit configured to execute fixed point instructions. 3. The integrated circuit of claim 1 , and further comprising a peripheral bridge unit coupled to the system bus and configured to communicate data to and from additional circuitry. 4. The integrated circuit of claim 1 , and further comprising additional circuitry comprising analog circuitry. 5. The integrated circuit of claim 4 , wherein the analog circuitry further comprises an analog to digital converter circuit. 6. The integrated circuit of claim 1 , wherein the instruction words stored for the low energy accelerator processor include instruction words configured to cause the low energy accelerator to perform vector computations. 7. The integrated circuit of claim 1 , wherein the low energy accelerator further comprises the multiply unit and the butterfly/adder ALU unit configured to perform vector operations in floating point and fixed point computations. 8. The integrated circuit of claim 7 , and further comprising an operand overload register containing a flag indicating a floating point computation. 9. The integrated circuit of claim 1 , wherein the instruction words for the low energy accelerator processor have a field of 1 bits containing opcodes for the load coefficient unit. 10. The integrated circuit of claim 1 , wherein the instruction words for the low energy accelerator processor have a field of 6 bits for the multiply unit. 11. The integrated circuit of claim 1 , wherein the instruction words for the low energy accelerator processor have a field of 13 bits for the butterfly/adder ALU unit.
with variable precision · CPC title
Parallel decoding, e.g. parallel decode units · CPC title
of compound instructions · CPC title
for loops, e.g. loop detection or loop counter · CPC title
comprising an array of processing units with common control, e.g. single instruction multiple data processors (G06F15/82 takes precedence {; for correlation function computation G06F17/15}) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.