System and method for creating aliased mappings to minimize impact of cache invalidation

US2016321773A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016321773-A1
Application numberUS-201514698024-A
CountryUS
Kind codeA1
Filing dateApr 28, 2015
Priority dateApr 28, 2015
Publication dateNov 3, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A parallel processor and a method of reducing texture cache invalidation are disclosed. In one embodiment, the parallel processor includes a cache configured to receive lines of data; and a parallel execution unit associated with the cache and configured to execute parallel counterparts of an operation. The parallel counterparts, when executed, are configured to create, in the cache, corresponding aliases of a line of data pertaining to the operation such that the parallel counterparts are operable to invalidate only the corresponding aliases.

First claim

Opening claim text (preview).

1 . A parallel processor, comprising: a cache configured to receive lines of data; and a parallel execution unit associated with said cache and configured to execute parallel counterparts of an operation, said parallel counterparts, when executed, configured to create, in said cache, corresponding aliases of a line of data pertaining to said operation such that said parallel counterparts are operable to invalidate only said corresponding aliases. 2 . The parallel processor as recited in claim 1 wherein said aliases are created in said cache by executing a texture load. 3 . The parallel processor as recited in claim 1 wherein said parallel processor is a graphics processing unit. 4 . The parallel processor as recited in claim 1 wherein said operation includes blending color values to accomplish transparency. 5 . The parallel processor as recited in claim 1 wherein said parallel counterparts are configured to store said aliases by executing a surface store. 6 . The parallel processor as recited in claim 1 wherein said aliases include a copy of said line of data. 7 . The parallel processor as recited in claim 1 wherein said aliases correspond to said parallel counterparts based on a number of said parallel counterparts using said aliases. 8 . A method of reducing an impact of cache invalidation, comprising: executing parallel counterparts of a surface operation that is to operate on a line of data; causing, with said parallel counterparts, corresponding aliases of said line of data to be loaded into a cache; and allowing said parallel counterparts of said operation to invalidate only said corresponding aliases. 9 . The method as recited in claim 8 wherein said cache is associated with a parallel processor and said aliases are loaded by executing a texture load. 10 . The method as recited in claim 8 wherein said parallel processor is a graphics processing unit. 11 . The method as recited in claim 8 wherein said cache is a texture cache. 12 . The method as recited in claim 11 wherein said aliases are stored by executing a surface store. 13 . The method as recited in claim 8 wherein said aliases include a copy of said line of data. 14 . The method as recited in claim 8 wherein said aliases correspond to said parallel counterparts based on a number of said parallel counterparts using said aliases. 15 . A surface operation in a library of graphics operations, said surface operation comprising: executing parallel counterparts of said surface operation that is to operate on a line of data; causing, with said parallel counterparts, corresponding aliases of said line of data to be loaded into a cache; and allowing said parallel counterparts of said operation to invalidate only said corresponding aliases. 16 . The surface operation as recited in claim 15 wherein said aliases are loaded by executing a texture load. 17 . The surface operation as recited in claim 15 wherein said cache is a texture cache. 18 . The surface operation as recited in claim 15 wherein said aliases are stored by executing a surface store. 19 . The surface operation as recited in claim 15 wherein said aliases include a copy of said line of data. 20 . The surface operation as recited in claim 15 wherein said aliases correspond to said parallel counterparts based on a number of said parallel counterparts using said aliases.

Assignees

Inventors

Classifications

  • G06T1/20Primary

    Processor architectures; Processor configuration, e.g. pipelining · CPC title

  • in combination with broadcast means (e.g. for invalidation or updating) · CPC title

  • Coherency control relating to peripheral accessing, e.g. from DMA or I/O device · CPC title

  • Memory management · CPC title

  • Arrangements specially adapted for transferring the contents of the bit-mapped memory to the screen (G09G5/399 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016321773A1 cover?
A parallel processor and a method of reducing texture cache invalidation are disclosed. In one embodiment, the parallel processor includes a cache configured to receive lines of data; and a parallel execution unit associated with the cache and configured to execute parallel counterparts of an operation. The parallel counterparts, when executed, are configured to create, in the cache, correspond…
Who is the assignee on this patent?
Nvidia Corp
What technology area does this patent fall under?
Primary CPC classification G06T1/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 03 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).