Optimizing shading process for mixed order-sensitive and order-insensitive shader operations

US2016307365A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016307365-A1
Application numberUS-201514687832-A
CountryUS
Kind codeA1
Filing dateApr 15, 2015
Priority dateApr 15, 2015
Publication dateOct 20, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A graphics processing unit (GPU) includes programmable shader hardware and grouping hardware. The grouping hardware receives pixels collected from a set of primitives, wherein pixel locations of each primitive have been obtained through rasterization of a set of vertices of the primitive. The grouping hardware also groups the pixels into a set of groups having a sequential order. None of the pixels in each group overlapped with each other in a display and overlapped pixels belong to different groups. The programmable shader hardware performs order-insensitive shader operations on the groups according to a first subset of an instruction set defined for a programmable shader, with two or more of the groups processed in parallel. The programmable shader hardware also performs order-sensitive shader operations on each of the groups in the sequential order according to a second subset of the instruction set defined for the programmable shader.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of a Graphics Processing Unit (GPU) comprising: receiving pixels collected from a set of primitives, wherein pixel locations of each primitive have been obtained through rasterization of a set of vertices of the primitive; grouping the pixels into a set of groups having a sequential order, wherein none of the pixels in each group overlapped with each other in a display and overlapped pixels belong to different groups; performing order-insensitive shader operations on the groups according to a first subset of an instruction set defined for a programmable shader, with two or more of the groups processed in parallel; and performing order-sensitive shader operations on each of the groups in the sequential order according to a second subset of the instruction set defined for the programmable shader. 2 . The method of claim 1 , wherein the overlapped pixels in the different groups follow the sequential order defined by a graphics application program interface (API) requirement that specifies an API order of sequencing the primitives for rendering. 3 . The method of claim 2 , wherein a primitive sequenced after a second group according to the API order is placed in a first group before the second group if the primitive does not overlap with any other primitives in the first group and the second group. 4 . The method of claim 1 , wherein the order-insensitive operations include fragment shader operations, and the order-sensitive operations include alpha blending. 5 . The method of claim 4 , further comprising: executing an intergroup barrier on each group after the fragment shader operations to order the groups in the sequential order for the alpha blending. 6 . The method of claim 5 , wherein the group includes a first group and a second group, the method further comprising: performing the fragment shader operations on the first group and the second group; and stalling the second group with the intergroup barrier when performing the blending operations on the first group. 7 . The method of claim 4 , further comprising: executing an instruction that specifies a blending mode register to select a shader code segment for a pre-defined blending mode. 8 . The method of claim 1 , wherein performing the order-sensitive operations further comprises: performing the order-sensitive operations in an arbitrary order on the pixels within each of the groups. 9 . The method of claim 1 , wherein the order-sensitive operations include depth testing to resolve visibility of the pixels across different groups. 10 . The method of claim 1 , wherein performing the order-sensitive operations further comprises: performing alpha blending on a set of pixels that include at least one of: transparent pixels, opaque pixels, and a combination of transparent pixels and opaque pixels. 11 . A Graphics Processing Unit (GPU) comprising: grouping hardware to receive pixels collected from a set of primitives, wherein pixel locations of each primitive have been obtained through rasterization of a set of vertices of the primitive, and to group the pixels into a set of groups having a sequential order, wherein none of the pixels in each group overlapped with each other in a display and overlapped pixels belong to different groups, programmable shader hardware coupled to the grouping hardware and including an array of computing units, the programmable shader hardware operative to: perform order-insensitive shader operations on the groups according to a first subset of an instruction set defined for a programmable shader, with two or more of the groups processed in parallel; and perform order-sensitive shader operations on each of the groups in the sequential order according to a second subset of the instruction set defined for the programmable shader. 12 . The GPU of claim 11 , wherein the overlapped pixels in the different groups follow the sequential order defined by a graphics application program interface (API) requirement that specifies an API order of sequencing the primitives for rendering. 13 . The GPU of claim 12 , wherein a primitive sequenced after a second group according to the API order is placed in a first group before the second group if the primitive does not overlap with any other primitives in the first group and the second group. 14 . The GPU of claim 11 , wherein the order-insensitive operations include fragment shader operations, and the order-sensitive operations include alpha blending. 15 . The GPU of claim 14 , wherein the programmable shader hardware is further operative to execute an intergroup barrier on each group after the fragment shader operations to order the groups in the sequential order for the alpha blending. 16 . The GPU of claim 15 , wherein the group includes a first group and a second group, and wherein the programmable shader hardware is further operative to: perform the fragment shader operations on the first group and the second group; and stall the second group with the intergroup barrier when performing the blending operations on the first group. 17 . The GPU of claim 14 , wherein the programmable shader hardware is further operative to execute an instruction that specifies a blending mode register to select a shader code segment for a pre-defined blending mode. 18 . The GPU of claim 11 , wherein the programmable shader hardware is further operative to perform the order-sensitive operations in an arbitrary order on the pixels within each of the groups. 19 . The GPU of claim 11 , wherein the order-sensitive operations include depth testing to resolve visibility of the pixels across different groups. 20 . The GPU of claim 11 , wherein the programmable shader hardware is further operative to perform alpha blending on a set of pixels that include at least one of: transparent pixels, opaque pixels, and a combination of transparent pixels and opaque pixels.

Assignees

Inventors

Classifications

  • G06T15/83Primary

    Phong shading · CPC title

  • Perspective computation · CPC title

  • Blending, e.g. for anti-aliasing · CPC title

  • G06T1/60Primary

    Memory management · CPC title

  • G06T15/005Primary

    General purpose rendering architectures · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016307365A1 cover?
A graphics processing unit (GPU) includes programmable shader hardware and grouping hardware. The grouping hardware receives pixels collected from a set of primitives, wherein pixel locations of each primitive have been obtained through rasterization of a set of vertices of the primitive. The grouping hardware also groups the pixels into a set of groups having a sequential order. None of the pi…
Who is the assignee on this patent?
Mediatek Singapore Pte Ltd
What technology area does this patent fall under?
Primary CPC classification G06T15/83. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).