Systems, apparatuses, and methods for setting an output mask in a destination writemask register from a source write mask register using an input writemask and immediate

US9703558B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9703558-B2
Application numberUS-201113991877-A
CountryUS
Kind codeB2
Filing dateDec 23, 2011
Priority dateDec 23, 2011
Publication dateJul 11, 2017
Grant dateJul 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of systems, apparatuses, and methods for performing in a computer processor generation of a predicate mask based on vector comparison in response to a single instruction are described.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of performing in a computer processor generation of a predicate mask based on vector comparison in response to a single instruction that includes a single source writemask register operand, a destination writemask register operand, a control writemask operand, and an opcode, the method comprising of: decoding the instruction; executing the decoded instruction to determine a least significant bit position of the source writemask register that is a true value by determining a least significant bit position of the source writemask register that is a true value further comprises determining that a value in the control writemask register at a corresponding bit position is also true, and store a true value in each bit position of the destination writemask register that is of lesser significance than the determined corresponding bit position. 2. The method of claim 1 , further comprising: storing a false value in each bit position of the destination writemask register that is of equal or greater significance than the determined corresponding bit position. 3. The method of claim 1 , further comprising: storing a true value in a bit position of the destination writemask register that is of equal significance to the determined corresponding bit position. 4. The method of claim 1 , wherein the registers are 16-bit registers. 5. The method of claim 1 , wherein the registers are 64-bit registers. 6. An article of manufacture comprising: a non-transitory tangible machine-readable storage medium having stored thereon an occurrence of an instruction; wherein the instruction's format specifies as its source operand a single source writemask register, as its destination a single destination writemask register, and a control writemask operand; and wherein the instruction format includes an opcode which instructs a machine, responsive to the single occurrence of the single instruction, to cause a determination of a least significant bit position of the source writemask register that is a true value and storage of a true value in each bit position of the destination writemask register that is of lesser significance than a determined corresponding bit position, wherein the determination is to determine a least significant bit position of the source writemask register that is a true value further comprises determining that a value in the control writemask register at a corresponding bit position is also true. 7. The article of manufacture of claim 6 , further comprising: storage of a false value in each bit position of the destination writemask register that is of equal or greater significance than the determined corresponding bit position. 8. The article of manufacture of claim 6 , further comprising: storage of a true value in a bit position of the destination writemask register that is of equal significance to the determined corresponding bit position. 9. The article of manufacture of claim 6 , wherein the registers are 16-bit registers. 10. The article of manufacture of claim 6 , wherein the registers are 64-bit registers. 11. An apparatus comprising: a hardware decoder to decode an instruction that includes a single source writemask register operand, a destination writemask register operand, a control writemask operand, and an opcode; and execution circuitry to execute the decoded instruction to determine a least significant bit position of the source writemask register that is a true value and store a true value in each bit position of the destination writemask register that is lesser significance than a determined corresponding bit position by determining that a value in the control writemask register at a corresponding bit position is also true. 12. The apparatus of claim 11 , further comprising: the execution circuitry to store a false value in each bit position of the destination writemask register that is of equal or greater significance than the determined corresponding bit position. 13. The apparatus of claim 11 , further comprising: the execution circuitry to store a true value in a bit position of the destination writemask register that is of equal significance to the determined corresponding bit position. 14. The apparatus of claim 11 , wherein the registers are 16-bit registers.

Assignees

Inventors

Classifications

  • single instruction multiple data [SIMD] multiprocessors · CPC title

  • Special arrangements thereof, e.g. mask or switch · CPC title

  • Array of vector units · CPC title

  • to perform conditional operations, e.g. using predicates or guards · CPC title

  • Instructions to perform operations on packed data, e.g. vector, tile or matrix operations · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9703558B2 cover?
Embodiments of systems, apparatuses, and methods for performing in a computer processor generation of a predicate mask based on vector comparison in response to a single instruction are described.
Who is the assignee on this patent?
Lee Victor W, Kim Daehyun, Ngai Tin-Fook, and 5 more
What technology area does this patent fall under?
Primary CPC classification G06F9/30036. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).