Prefetch status notification for memory prefetching

US12373912B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12373912-B2
Application numberUS-202318511074-A
CountryUS
Kind codeB2
Filing dateNov 16, 2023
Priority dateMar 15, 2019
Publication dateJul 29, 2025
Grant dateJul 29, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments are generally directed to memory prefetching in multiple GPU environment. An embodiment of an apparatus includes multiple processors including a host processor and multiple graphics processing units (GPUs) to process data, each of the GPUs including a prefetcher and a cache; and a memory for storage of data, the memory including a plurality of memory elements, wherein the prefetcher of each of the GPUs is to prefetch data from the memory to the cache of the GPU; and wherein the prefetcher of a GPU is prohibited from prefetching from a page that is not owned by the GPU or by the host processor.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: one or more processors including a graphics processing unit (GPU) to process data, the GPU including: one or more processing cores, including a first core, a prefetcher, and one or more caches; and a memory for storage of data; wherein the prefetcher is to prefetch data from the memory to a cache of the one or more caches; wherein, upon completion of a prefetch operation by the GPU to prefetch data for a first thread running on the first core, the prefetcher is to issue a prefetch status notification to the first thread. 2. The apparatus of claim 1 , wherein issuance of the prefetch status notification indicates that the data prefetched for the first thread has been loaded into the cache. 3. The apparatus of claim 1 , wherein the GPU is to synchronize execution of the first thread with one or more other threads based at least in part on the prefetch status notification. 4. The apparatus of claim 1 , wherein the GPU is to throttle one or more prefetches for the first thread based at least in part on the prefetch status notification. 5. The apparatus of claim 1 , wherein the prefetch status notification is a one-bit flag. 6. The apparatus of claim 1 , wherein the first core is a shader core. 7. One or more non-transitory computer-readable storage mediums having stored thereon executable computer program instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: performing, by a prefetcher of a graphics processing unit (GPU) in a computing system, a prefetch operation to prefetch data for a first thread running on a first core of the GPU, the data being prefetched from a computer memory to a cache of the GPU; and upon completion of the prefetch operation, issuing, by the prefetcher, a prefetch status notification to the first thread. 8. The one or more computer-readable storage mediums of claim 7 , wherein issuance of the prefetch status notification indicates that the data prefetched for the first thread has been loaded into the cache. 9. The one or more computer-readable storage mediums of claim 7 , wherein the instructions further include instructions for: synchronizing execution of the first thread with one or more other threads based at least in part on the prefetch status notification. 10. The one or more computer-readable storage mediums of claim 7 , wherein the instructions further include instructions for: throttling one or more prefetches for the first thread based at least in part on the prefetch status notification. 11. The one or more computer-readable storage mediums of claim 7 , wherein the prefetch status notification is a one-bit flag. 12. The one or more computer-readable storage mediums of claim 7 , wherein the first core is a shader core. 13. A method comprising: performing, by a prefetcher of a graphics processing unit (GPU) in a computing system, a prefetch operation to prefetch data for a first thread running on a first core of the GPU, the data being prefetched from a computer memory to a cache of the GPU; and upon completion of the prefetch operation, issuing, by the prefetcher, a prefetch status notification to the first thread. 14. The method of claim 13 , wherein issuance of the prefetch status notification indicates that the data prefetched for the first thread has been loaded into the cache. 15. The method of claim 13 , further comprising: synchronizing execution of the first thread with one or more other threads based at least in part on the prefetch status notification. 16. The method of claim 13 , further comprising: throttling one or more prefetches for the first thread based at least in part on the prefetch status notification. 17. The method of claim 13 , wherein the prefetch status notification is a one-bit flag.

Assignees

Inventors

Classifications

  • General purpose rendering architectures · CPC title

  • Memory management · CPC title

  • using a secondary processor, e.g. coprocessor (peripheral processor G06F13/12) · CPC title

  • Instruction prefetching · CPC title

  • using page tables, e.g. page table structures · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12373912B2 cover?
Embodiments are generally directed to memory prefetching in multiple GPU environment. An embodiment of an apparatus includes multiple processors including a host processor and multiple graphics processing units (GPUs) to process data, each of the GPUs including a prefetcher and a cache; and a memory for storage of data, the memory including a plurality of memory elements, wherein the prefetcher…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T1/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).