Tunable hardware sort engine for performing composite sorting algorithms
US-9710503-B2 · Jul 18, 2017 · US
US9448794B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9448794-B2 |
| Application number | US-201113977735-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 30, 2011 |
| Priority date | Nov 30, 2011 |
| Publication date | Sep 20, 2016 |
| Grant date | Sep 20, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Instructions and logic provide vector horizontal majority voting functionality. Some embodiments, responsive to an instruction specifying: a destination operand, a size of the vector elements, a source operand, and a mask corresponding to a portion of the vector element data fields in the source operand; read a number of values from data fields of the specified size in the source operand, corresponding to the mask specified by the instruction and store a result value to that number of corresponding data fields in the destination operand, the result value computed from the majority of values read from the number of data fields of the source operand.
Opening claim text (preview).
What is claimed is: 1. A processor comprising: a vector register comprising a plurality of data fields to store values of vector elements; a decode stage to decode a first instruction specifying: a destination operand, a size of the vector elements, a portion of the plurality of data fields, and a source operand; and an execution unit, responsive to the decoded first instruction, to: read a number of values from data fields of the size of the vector elements in the source operand; and store a result value in the destination operand specified by the first instruction, wherein the result value is computed from most common values read from the number of the values from the data fields of the source operand, and wherein the result value is computed as a bitwise majority value from the values read from the data fields of the source operand and the bitwise majority value has a bias for one or a bias for zero in case of ties. 2. The processor of claim 1 , wherein the execution unit, responsive to the decoded first instruction, is to store the result value to a number of corresponding data fields in the destination operand specified by the first instruction. 3. The processor of claim 1 , wherein the first instruction specifies a mask identifying the portion of the plurality of data fields, and wherein the number of the values read from the data fields in the source operand corresponds to vector elements in the source operand unmasked by the mask specified by the first instruction. 4. The processor of claim 3 , wherein the result value is computed as the bitwise majority value from the unmasked vector elements of the values read from the data fields of the source operand. 5. The processor of claim 3 , wherein the result value is computed as the bitwise majority value from at least k ones of the unmasked vector elements of the values read from the data fields of the source operand, and wherein k is specified by the first instruction. 6. The processor of claim 1 , wherein the result value is computed as a value of a majority of matching ones of the values read from the data fields of vector elements of the source operand. 7. The processor of claim 6 , wherein the result value is computed as a value of a majority from at least k matching ones of the values read from unmasked ones of the vector elements of the source operand, and wherein k is specified by the first instruction. 8. A non-transitory machine-readable medium having a first executable instruction stored therein that, when executed by a machine, causes the machine to: read a number of values from a portion of a plurality of data fields in a source operand specified by the first executable instruction, wherein the data fields store values of masked and unmasked vector elements, and wherein the number corresponds to a same number of the unmasked vector elements according to a mask specified by the first executable instruction; compute a result value to represent a majority of the number of the values read; and store the result value in a destination operand specified by the first executable instruction, wherein the result value is computed as a bitwise majority value from the unmasked vector elements of the values read from the portion of the plurality of data fields of the source operand and the bitwise majority value has a bias for one or a bias for zero in case of ties. 9. The non-transitory machine-readable medium of claim 8 , wherein to store the result value in the destination operand comprises storage of the result value to corresponding data fields in the destination operand specified by the first executable instruction. 10. The non-transitory machine-readable medium of claim 8 , wherein the result value is computed as the bitwise majority value from the unmasked vector elements of the values read from the data fields of the source operand. 11. The non-transitory machine-readable medium of claim 8 , wherein the result value is computed as the bitwise majority value from at least k ones of the unmasked vector elements of the values read from the data fields of the source operand, and wherein k is specified by the first executable instruction. 12. The non-transitory machine-readable medium of claim 8 , wherein the result value is computed as a value of a majority of matching ones of the values read from the data fields of the unmasked vector elements of the source operand. 13. A processing system comprising: a memory; and a plurality of processors, wherein each of the plurality of processors comprises: a vector register comprising a plurality of data fields to store values of vector elements; a decode stage to decode a first instruction specifying: a destination operand, a size of the vector elements, a portion of the plurality of data fields, and a source operand; and an execution unit, responsive to the decoded first instruction, to: read a number of values from data fields of the size of the vector elements in the source operand; and store a result value in the destination operand specified by the first instruction, wherein the result value is computed from most common values read from the number of the values from the data fields of the source operand, and wherein the result value is computed as a bitwise majority value from the values read from the data fields of the source operand and the bitwise majority value has a bias for one or a bias for zero in case of ties. 14. The processing system of claim 13 , wherein the execution unit, responsive to the decoded first instruction, is to store the result value to a number of corresponding data fields in the destination operand specified by the first instruction. 15. The processing system of claim 13 , wherein the first instruction specifies a mask identifying the portion of the plurality of data fields, and wherein the number of the values read from the data fields in the source operand corresponds to vector elements in the source operand unmasked by the mask specified by the first instruction. 16. The processing system of claim 13 , wherein the result value is computed as the bitwise majority value from at least k ones of the unmasked vector elements of the values read from the data fields of the source operand, and wherein k is specified by the first instruction. 17. The processing system of claim 13 , wherein the result value is computed as a value of a majority of matching ones of the values read from the data fields of vector elements of the source operand. 18. The processing system of claim 17 , wherein the result value is computed as a value of a majority from at least k matching ones of the values read from unmasked ones of the vector elements of the source operand, and wherein k is specified by the first instruction. 19. A computer-implemented method comprising: read a number of values from a portion of a plurality of data fields in a source operand specified by a first executable instruction, wherein the data fields store values of masked and unmasked vector elements, and wherein the number corresponds to a same number of the unmasked vector elements according to a mask specified by the first executable instruction; compute a result value to represent a majority of the number of the values read; and store the result value in a destination operand specified by the first executable instruction, wherein the result value is computed as a bitwise majority value from the unmasked vector elements of the values read from the portion of the plurality of data fields of the source operand and the bitwise majority value has a
Generic software techniques for error detection or fault masking · CPC title
Compare instructions, e.g. Greater-Than, Equal-To, MINMAX · CPC title
Arrangements for sorting or merging computer data on continuous record carriers, e.g. tape, drum, disc · CPC title
Instruction analysis, e.g. decoding, instruction word fields · CPC title
using arrangements adapted for a specific error detection or correction feature · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.