What technology area does this patent fall under?

Primary CPC classification G06T15/04. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Nov 26 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Efficient convolution operations with a kernel shader

US12154209B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12154209-B2
Application number	US-202217849539-A
Country	US
Kind code	B2
Filing date	Jun 24, 2022
Priority date	Jun 25, 2021
Publication date	Nov 26, 2024
Grant date	Nov 26, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of improving texture fetching by a texturing/shading unit in a GPU pipeline by performing efficient convolution operations, includes receiving a shader and determining whether the shader is a kernel shader. In response to determining that the shader is a kernel shader, the shader is modified to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer system comprising one or more processors and a memory, the memory comprising computer readable instructions that, when executed by the one or more processors, cause the computer system to: receive a shader; determine whether the shader is a kernel shader; and in response to determining that the shader is a kernel shader, modifying the shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels. 2. A non-transitory computer readable storage medium having stored thereon computer readable instructions that, when executed at a computer system, cause the computer system to: receive a shader; determine whether the shader is a kernel shader; and in response to determining that the shader is a kernel shader, modifying the shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels. 3. A method comprising: receiving a shader; determining whether the shader is a kernel shader; and in response to determining that the shader is a kernel shader, modifying the shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels. 4. The method according to claim 3 , wherein modifying the shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels comprises: modifying the shader to include a plurality of fetch instructions for different sample positions, each sample position being offset from a texel centre and defining a different, non-overlapping patch of adjacent texels to be fetched. 5. The method according to claim 3 , wherein modifying the shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels comprises: modifying the shader to include a plurality of gather operations for different sample positions, each sample position being offset by integer values from a centre of one of the output pixels and defining a different, non-overlapping patch of adjacent texels to be fetched. 6. The method according to claim 3 , wherein determining whether the shader is a kernel shader comprises: determining if there is a 1:1 correspondence between texels and sampling points in the received shader, wherein if there is no correspondence the shader is not a kernel shader. 7. The method according to claim 3 , further comprising: in response to determining that the shader is not a kernel shader, leaving the shader unmodified. 8. The method according to claim 3 , further comprising: determining whether the shader is an optimized kernel shader that uses bilinear interpolation; and in response to determining that the shader is an optimized kernel shader: adapting the shader to reverse the bilinear interpolation; modifying the adapted shader to perform a collective fetch of all texels used in convolution operations for a group of output pixels instead of performing independent fetches of texels for each output pixel in the group of output pixels. 9. The method according to claim 8 , wherein adapting the shader to reverse the bilinear interpolation comprises: modifying each offset sample position in the shader to explicitly fetch two adjacent texels; and defining separate weights for each of the two adjacent texels. 10. The method according to claim 9 , wherein defining separate weights for each of the two adjacent texels comprises: allocating a weight associated with the offset sample position to each of the adjacent texels; and for each of the adjacent texels, modifying the allocated weight based on an offset of the offset sample position. 11. The method according to claim 9 , wherein adapting the shader to reverse the bilinear interpolation further comprises: for any non-offset sample position in the shader, leaving the sample position and associated weight unchanged. 12. The method according to claim 8 , wherein determining whether the shader is an optimized kernel shader that uses bilinear interpolation comprises: determining whether sample positions in the shader are spread around a common coordinate with offsets, wherein if the sample positions are spread around a common coordinate, the shader is an optimized kernel shader. 13. The method according to claim 8 , wherein determining whether the shader is an optimized kernel shader that uses bilinear interpolation comprises: determining whether convolution weights in the shader are all consistently distributed except for one convolution weight, wherein if all except for one of the convolution weights in the shader are consistently distributed, the shader is an optimized kernel shader. 14. The method according to claim 8 , further comprising: in response to determining that the shader is not a kernel shader or an optimized kernel shader, leaving the shader unmodified. 15. The method according to claim 3 , wherein determining whether the shader is an optimized kernel shader that uses bilinear interpolation comprises: determining whether the shader fetches an even number of texels, wherein if the shader fetches an even number of texels, the shader is an optimized kernel shader. 16. The method according to claim 3 , further comprising: validating the modified shader; and in response to the validation failing, reverting to the received, unmodified, shader. 17. The method according to claim 16 , wherein validating the secondary shader comprises: checking that all the texels used in convolution operations for the group of output pixels fall within a predefined maximum patch size. 18. The method according to claim 3 , further comprising: generating a secondary shader configured to validate the modified shader when executed, in response to the validation passing, to trigger use of the modified shader and in response to the validation failing, to trigger use of the received, unmodified, shader.

Assignees

Imagination Tech Ltd

Inventors

Classifications

G06T15/80
Shading · CPC title
G06T15/10
Geometric effects · CPC title
G06T15/20
Perspective computation · CPC title
G06T15/04Primary
Texture mapping · CPC title
G06T2200/28
involving image processing hardware · CPC title

Patent family

Related publications grouped by family.

View patent family 77179728

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12154209B2 cover?: A method of improving texture fetching by a texturing/shading unit in a GPU pipeline by performing efficient convolution operations, includes receiving a shader and determining whether the shader is a kernel shader. In response to determining that the shader is a kernel shader, the shader is modified to perform a collective fetch of all texels used in convolution operations for a group of outpu…
Who is the assignee on this patent?: Imagination Tech Ltd
What technology area does this patent fall under?: Primary CPC classification G06T15/04. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Nov 26 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).