Processor-based apparatus and method for processing bit streams using bit-oriented instructions through byte-oriented storage
US-9740484-B2 · Aug 22, 2017 · US
US10705840B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10705840-B2 |
| Application number | US-201916452868-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 26, 2019 |
| Priority date | Jun 27, 2017 |
| Publication date | Jul 7, 2020 |
| Grant date | Jul 7, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus integrates arithmetic with logic operations. The apparatus includes a calculation device that calculates source data to generate and output first destination data. The apparatus further includes a normalization unit, coupled to the calculation device, that normalizes the first destination data to generate second destination data of a first type when receiving a signal indicating an output of first-type data, and normalizing the first destination data to generate the second destination data of a second type when receiving the signal indicating an output of second-type data.
Opening claim text (preview).
What is claimed is: 1. An apparatus for integrating arithmetic with logic operations, comprising: a calculation device, calculating source data to generate and output first destination data; and a normalization unit, coupled to the calculation device, normalizing the first destination data to generate second destination data of a first type when receiving a signal indicating an output of first-type data, and normalizing the first destination data to generate the second destination data of a second type when receiving the signal indicating an output of second-type data; wherein the normalization unit comprises: a shifter; an adder; a comparator, coupled to the shifter and the adder, wherein the comparator, the shifter and the adder form a loop for performing a normalization of a floating-point value; and a merger, coupled to the shifter and the adder, combining a sign bit, a mantissa output from the shifter and an exponent output from the adder to generate the second destination data. 2. The apparatus of claim 1 , wherein the normalization unit is coupled to the calculation device via a delay circuit, wherein the calculation device generates and outputs the first destination data at one cycle and the normalization unit normalizes the first destination data to generate the second destination data of the first type or the second type at the next cycle. 3. The apparatus of claim 1 , wherein the calculation device performs a calculation: dest=Src 0 ×Src 1 +Src 2 , Src 0 , Src 1 and Src 2 represent the source data of three source memories, and dest represents the first destination data. 4. The apparatus of claim 1 , wherein the second destination data of the first type is 24-bit floating-point value, and the second destination data of the second type is 32-bit floating-point value. 5. The apparatus of claim 1 , wherein the shifter drops 7 bits from a mantissa of 34 bits of the first destination data and the adder drops 1 bit from an exponent of 10 bits of the first destination data when receiving the signal indicating an output of first-type data. 6. The apparatus of claim 5 , wherein the comparator repeatedly operates until the MSB (Most Significant Bit) of a output from the shifter is 1, wherein, in each iteration, when the MSB of the output of the shifter is not 1, the comparator directs the shifter to left-shift the mantissa of the first destination data by one bit and directs the adder to add −1 to the exponent of the first destination data.
using a mask · CPC title
controlled by a single instruction for multiple threads [SIMT] in parallel · CPC title
controlled by a single instruction for multiple data lanes [SIMD] · CPC title
Instructions to perform operations on packed data, e.g. vector, tile or matrix operations · CPC title
Processor architectures; Processor configuration, e.g. pipelining · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.