Multi-tile graphics processor rendering

US11145105B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11145105-B2
Application numberUS-201916355364-A
CountryUS
Kind codeB2
Filing dateMar 15, 2019
Priority dateMar 15, 2019
Publication dateOct 12, 2021
Grant dateOct 12, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments are generally directed to multi-tile graphics processor rendering. An embodiment of an apparatus includes a memory for storage of data; and one or more processors including a graphics processing unit (GPU) to process data, wherein the GPU includes a plurality of GPU tiles, wherein, upon geometric data being assigned to each of a plurality of screen tiles, the apparatus is to transfer the geometric data to the plurality of GPU tiles.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: a memory for storage of data, the data including geometric data for graphics processing, the memory including a distributed data structure; and one or more processors including a graphics processing unit (GPU) to process data, wherein the GPU includes a plurality of GPU tiles on a substrate, each GPU tile being a separate chiplet, and each GPU tile having a respective tile-based storage; wherein, upon a respective set of geometric data being assigned to each of a plurality of screen tiles, the apparatus is to transfer the set of geometric data assigned to each of the plurality of screen tiles from the distributed data structure to a tile-based storage of a respective GPU tile of the plurality of GPU tiles, including the apparatus to transfer a first set of geometric data assigned to a first screen tile from the distributed data structure to a tile-based storage of a first GPU tile, and wherein each set of geometric data is to be processed locally in the respective GPU tile of the plurality of GPU tiles to which the set of geometric data was transferred, including processing the set of first geometric data locally in the first GPU tile. 2. The apparatus of claim 1 , further comprising a display, wherein the apparatus is further to pull geometric data from any GPU tile of the plurality of GPU tiles for the display without consolidating the geometric data of the plurality of GPU tiles. 3. The apparatus of claim 1 , wherein the apparatus is to collect all triangles mapping to one or more screen tiles in a single render pass. 4. The apparatus of claim 1 , further comprising a mesh shader to operate with the plurality of GPU tiles, wherein the mesh shader is to process data from any GPU tile of the plurality of GPU tiles without transfer of data across GPU tiles. 5. The apparatus of claim 4 , wherein the apparatus is to provide tile-based immediate mode rendering (TBIMR) with the mesh shader. 6. The apparatus of claim 4 , further comprising a stream out circuit to read out mesh data from the mesh shader and write the data to memory in a structure of arrays for compression of the mesh data. 7. One or more non-transitory computer-readable storage mediums having stored thereon executable computer program instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: assigning a set of geometric data for graphics processing by a graphics processing unit (GPU) of a computing system to each of a plurality of screen tiles, the GPU including a plurality of GPU tiles on a substrate, each GPU tile being a separate chiplet, and each GPU tile having a respective tile-based storage; transferring each set of geometric data that is assigned to each screen tile of the plurality of screen tiles from a distributed data structure of a memory of the computing system to a tile-based storage of a respective GPU tile of the plurality of GPU tiles of the GPU, including transferring a first set of geometric data assigned to a first screen tile from the distributed data structure to a tile-based storage of a first GPU tile; and processing each set of geometric data locally in the respective GPU tile of the plurality of GPU tiles to which the set of geometric data was transferred, including processing the set of first geometric data locally in the first GPU tile. 8. The one or more computer-readable storage mediums of claim 7 , further comprising instructions for pulling geometric data from any GPU tile of the plurality of GPU tiles for a display without consolidating the data of the plurality of GPU tiles. 9. The one or more computer-readable storage mediums of claim 7 , further comprising instructions for collecting all triangles mapping to one or more screen tiles in a single render pass. 10. The one or more computer-readable storage mediums of claim 7 , further comprising instructions for processing data from any GPU tile of the plurality of GPU tiles using a mesh shader without transfer of data across GPU tiles. 11. The one or more computer-readable storage mediums of claim 10 , further comprising instructions for providing tile-based immediate mode rendering (TBIMR) with the mesh shader. 12. The one or more computer-readable storage mediums of claim 10 , further comprising instructions for reading out mesh data from the mesh shader and writing the data to memory in a structure of arrays for compression of the mesh data. 13. A method comprising: assigning a set of geometric data for graphics processing by a graphics processing unit (GPU) of a computing system to each of a plurality of screen tiles, the GPU including a plurality of GPU tiles on a substrate, each GPU tile being a separate chiplet, and each GPU tile having a respective tile-based storage; transferring each set of geometric data from the that is assigned to each screen tile of the plurality of screen tiles from a distributed data structure of a memory of the computing system to a tile-based storage of a respective GPU tile of the plurality of GPU tiles of the GPU including transferring a first set of geometric data assigned to a first screen tile from the distributed data structure to a tile-based storage of a first GPU tile; and processing each set of geometric data locally in the respective GPU tile of the plurality of GPU tiles to which the set of geometric data was transferred, including processing the set of first geometric data locally in the first GPU tile. 14. The method of claim 13 , further comprising instructions for pulling geometric data from any GPU tile of the plurality of GPU tiles for a display without consolidating the data of the plurality of GPU tiles.

Assignees

Inventors

Classifications

  • Finite element generation, e.g. wire-frame surface description, {tesselation} · CPC title

  • Memory management · CPC title

  • G06T15/005Primary

    General purpose rendering architectures · CPC title

  • Processor architectures; Processor configuration, e.g. pipelining · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11145105B2 cover?
Embodiments are generally directed to multi-tile graphics processor rendering. An embodiment of an apparatus includes a memory for storage of data; and one or more processors including a graphics processing unit (GPU) to process data, wherein the GPU includes a plurality of GPU tiles, wherein, upon geometric data being assigned to each of a plurality of screen tiles, the apparatus is to transfe…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T15/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 12 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).