Systems and methods for updating memory side caches in a multi-GPU configuration

US12561276B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12561276-B2
Application numberUS-202017428534-A
CountryUS
Kind codeB2
Filing dateMar 14, 2020
Priority dateMar 15, 2019
Publication dateFeb 24, 2026
Grant dateFeb 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for updating remote memory side caches in a multi-GPU configuration are disclosed herein. In one embodiment, a graphics processor for a multi-tile architecture includes a first graphics processing unit (GPU) having a first memory, a first memory side cache memory, a first communication fabric, and a first memory management unit (MMU). The graphics processor includes a second graphics processing unit (GPU) having a second memory, a second memory side cache memory, a second memory management unit (MMU), and a second communication fabric that is communicatively coupled to the first communication fabric. The first MMU is configured to control memory requests for the first memory, to update content in the first memory, to update content in the first memory side cache memory, and to determine whether to update the content in the second memory side cache memory.

First claim

Opening claim text (preview).

What is claimed is: 1 . A graphics processing unit (GPU) in a multi-GPU architecture, the GPU comprising: processing resources to perform graphics operations; a first memory side cache that is adjacent to a memory associated with a first side of a cross-GPU communication fabric; and memory management unit (MMU) circuitry associated with a second side of the cross-GPU communication fabric, wherein the MMU circuitry is configured to control memory requests for the memory, to request content from an address in the memory, to update the content in the first memory side cache, and to determine whether to update the content in a second memory side cache associated with another GPU, wherein the MMU circuitry is further to determine ownership of physical addresses based on a distributed ownership scheme across multiple GPUs, wherein the cross-GPU communication fabric includes a cross-GPU switch for facilitating communications between the GPU and another GPU, wherein the cross-GPU communication fabric further includes dedicated GPU-to-GPU links that bypass system memory for inter-GPU cache coherency operations, wherein the content is updated in the first memory side cache when the MMU circuitry determines the GPU owns a requested physical address that is associated with the content. 2 . The graphics processing unit (GPU) of claim 1 , wherein the update associated with the first memory side cache or the second memory side cache is tracked using a fence. 3 . The graphics processing unit (GPU) of claim 1 , wherein the MMU circuitry is further configured to determine when to synchronize the second memory side cache with the memory based on an ordering hint. 4 . The graphics processing unit (GPU) of claim 1 , wherein the content is updated in the second memory side cache when the MMU circuitry determines that the another GPU owns the requested physical address that is associated with the content. 5 . A system having multiple graphics processing units (GPUs), the system comprising: a first GPU in communication with a second GPU, the first GPU comprising: processing resources to perform graphics operations; a first memory side cache that is adjacent to a memory associated with a first side of a cross-GPU communication fabric; and memory management unit (MMU) circuitry associated with a second side of the cross-GPU communication fabric, wherein the MMU circuitry is configured to control memory requests for the memory, to request content from an address in the memory, to update the content in the first memory side cache, and to determine whether to update the content in a second memory side cache associated with the second GPU, wherein the MMU circuitry is further to determine ownership of physical addresses based on a distributed ownership scheme across multiple GPUs, wherein the cross-GPU communication fabric includes a cross-GPU switch for facilitating communications between the first GPU and the second GPU, wherein the cross-GPU communication fabric further includes dedicated GPU-to-GPU links that bypass system memory for inter-GPU cache coherency operations, wherein the content is updated in the first memory side cache when the MMU circuitry determines the first GPU owns a requested physical address that is associated with the content. 6 . The system of claim 5 , wherein the update associated with the first memory side cache or a second memory side cache is tracked using a fence. 7 . The system of claim 5 , wherein the MMU circuitry is further configured to determine when to synchronize the second memory side cache with the memory based on an ordering hint. 8 . The system of claim 5 , wherein the content is updated in the second memory side cache when the MMU circuitry determines the second GPU owns the requested physical address that is associated with the content. 9 . A method comprising: a first memory side cache that is adjacent to a memory associated with a first side of a cross-graphics processing unit (GPU) communication fabric associated with a first GPU; and controlling memory requests for a memory, and requesting content from an address in the memory, and updating the content in a first memory side cache, and determining whether to update the content in a second memory side cache associated with a second GPU, wherein the MMU circuitry is further to determine ownership of physical addresses based on a distributed ownership scheme across multiple GPUs, wherein the cross-GPU communication fabric includes a cross-GPU switch for facilitating communications between the first GPU and the second GPU, wherein the cross-GPU communication fabric further includes dedicated GPU-to-GPU links that bypass system memory for inter-GPU cache coherency operations, wherein the content is updated in the first memory side cache when the MMU circuitry determines the GPU owns a requested physical address associated with the content. 10 . The method of claim 9 , wherein the update associated with the first memory side cache or the second memory side cache is tracked using a fence. 11 . The method of claim 9 , further comprising determining when to synchronize the second memory side cache with the memory based on an ordering hint.

Assignees

Inventors

Classifications

  • by reordering requests · CPC title

  • Ray-tracing · CPC title

  • Loop control instructions; iterative instructions, e.g. LOOP, REPEAT · CPC title

  • controlled by a single instruction for multiple threads [SIMT] in parallel · CPC title

  • LOAD or STORE instructions; Clear instruction · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12561276B2 cover?
Systems and methods for updating remote memory side caches in a multi-GPU configuration are disclosed herein. In one embodiment, a graphics processor for a multi-tile architecture includes a first graphics processing unit (GPU) having a first memory, a first memory side cache memory, a first communication fabric, and a first memory management unit (MMU). The graphics processor includes a second…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F12/0862. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).