Systems, methods, and apparatuses for tile store

US11714642B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11714642-B2
Application numberUS-202217706428-A
CountryUS
Kind codeB2
Filing dateMar 28, 2022
Priority dateMar 20, 2017
Publication dateAug 1, 2023
Grant dateAug 1, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments detailed herein relate to matrix operations. In particular, the loading of a matrix (tile) from memory. For example, support for a loading instruction is described in at least a form of decode circuitry to decode an instruction having fields for an opcode, a source matrix operand identifier, and destination memory information, and execution circuitry to execute the decoded instruction to store each data element of configured rows of the identified source matrix operand to memory based on the destination memory information.

First claim

Opening claim text (preview).

We claim: 1. An apparatus comprising: decode circuitry to decode a single instruction having fields for an opcode to indicate execution circuitry is to save a context state to memory, wherein the context state is to include multidimensional matrix data of tiles according to two configuration bits, a first configuration bit to correspond to configuration data loaded in a tile configuration and the second configuration bit to correspond to matrix data; and execution circuitry to execute the decoded single instruction to store the context state to memory. 2. The apparatus of claim 1 , wherein the two configuration bits are located in a control register. 3. The apparatus of claim 1 , wherein the execution circuitry is a part of an accelerator. 4. The apparatus of claim 1 , wherein the execution circuitry is a part of a processor. 5. The apparatus of claim 1 , wherein the execution circuitry is further to write zeros beyond a specified number of rows of the matrix data of the tiles. 6. The apparatus of claim 1 , wherein the matrix data of the tiles is to include garbage data in areas that are not configured for use in tile operations. 7. The apparatus of claim 1 , wherein the tiles are a plurality of registers configured to represent a matrix. 8. A method comprising: decoding a single instruction having fields for an opcode to indicate execution circuitry is to save a context state to memory, wherein the context state is to include multidimensional matrix data of tiles according to two configuration bits, a first configuration bit to correspond to configuration data loaded in a tile configuration and the second configuration bit to correspond to matrix data; and executing the decoded single instruction to store the context state to memory. 9. The method of claim 8 , wherein the two configuration bits are located in a control register. 10. The method of claim 8 , wherein a size of each data element of the matrix data is a doubleword. 11. The method of claim 8 , wherein a size of each data element of the matrix data is a word. 12. The method of claim 8 , wherein the executing is further to write zeros beyond a specified number of rows of the matrix data of the tiles. 13. The method of claim 8 , wherein the matrix data of the tiles is to include garbage data in areas that are not configured for use in tile operations. 14. The method of claim 8 , wherein the tiles are a plurality of registers configured to represent a matrix. 15. A non-transitory machine-readable medium storing an instruction which causes an apparatus to perform a method, the method comprising: decoding a single instruction having fields for an opcode to indicate execution circuitry is to save a context state to memory, wherein the context state is to include multidimensional matrix data of tiles according to two configuration bits, a first configuration bit to correspond to configuration data loaded in a tile configuration and the second configuration bit to correspond to matrix data; and executing the decoded single instruction to store the context state to memory. 16. The non-transitory machine-readable medium of claim 15 , wherein the two configuration bits are located in a control register. 17. The non-transitory machine-readable medium of claim 15 , wherein the executing is further to write zeros beyond a specified number of rows of the matrix data of the tiles. 18. The non-transitory machine-readable medium of claim 15 , wherein the matrix data of the tiles is to include garbage data in areas that are not configured for use in tile operations. 19. The non-transitory machine-readable medium of claim 15 , wherein the tiles are a plurality of registers configured to represent a matrix.

Assignees

Inventors

Classifications

  • Image or video data · CPC title

  • Vector or matrix data · CPC title

  • Sum of products (for applications thereof, see the relevant places, e.g. G06F17/10, H03H17/00) · CPC title

  • with multidimensional access, e.g. row/column, matrix · CPC title

  • Recovery, e.g. branch miss-prediction, exception handling (error detection or correction G06F11/00) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11714642B2 cover?
Embodiments detailed herein relate to matrix operations. In particular, the loading of a matrix (tile) from memory. For example, support for a loading instruction is described in at least a form of decode circuitry to decode an instruction having fields for an opcode, a source matrix operand identifier, and destination memory information, and execution circuitry to execute the decoded instructi…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F9/30036. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 01 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).