Tracking stores and loads by bypassing load store units

US11048506B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11048506-B2
Application numberUS-201916450897-A
CountryUS
Kind codeB2
Filing dateJun 24, 2019
Priority dateAug 19, 2016
Publication dateJun 29, 2021
Grant dateJun 29, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for tracking stores and loads to reduce load latency when forming the same memory address by bypassing a load store unit within an execution unit is disclosed. Store-load pairs which have a strong history of store-to-load forwarding are identified. Once identified, the load is memory renamed to the register stored by the store. The memory dependency predictor may also be used to detect loads that are dependent on a store but cannot be renamed. In such a configuration, the dependence is signaled to the load store unit and the load store unit uses the information to issue the load after the identified store has its physical address.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for predicting dependent store-load pairs to reduce load latency when storing and loading from a same memory address by bypassing a load store (LS) unit within an execution unit, the method comprising: inputting load and store instruction information from a micro operation queue; comparing load instruction information to values in a prediction table and outputting the comparison to a predictor logic; comparing store instruction information to values in the prediction table; determining if the store is valid based on values in the prediction table and outputting a store validity determination and the store comparison to the predictor logic; determining a confidence value and providing an indication of the confidence value to the predictor logic; and providing memory dependency predictor information from the predictor logic to a memory renaming unit. 2. The method of claim 1 , wherein the input load and store instruction information is used to identify the respective load and store instructions. 3. The method of claim 1 , wherein the input load and store instruction information comprises a fetch address of the store. 4. The method of claim 1 , wherein the input load and store instruction information comprises a fetch address of the load. 5. The method of claim 1 , wherein the prediction table has been trained through the LS unit. 6. The method of claim 1 , further comprising linking the load to the store in the memory renaming unit. 7. A system for predicting dependent store-load pairs to reduce load latency when storing and loading from a same memory address by bypassing a load store (LS) unit within an execution unit, the system comprising: a micro operation queue configured to dispatch load or store operations; one or more predictive units configured to identify conditions where a load/store unit is bypassed and memory renaming is utilized for at least one of the load or store operations; and a memory renaming unit configured to operate to cause the at least one of the load or store operations to be renamed. 8. The system of claim 7 , further comprising a floating point renamer that renames the at least one of the load or store operations. 9. The system of claim 7 , further comprising an integer renamer that renames the at least one of the load or store operations. 10. The system of claim 7 , wherein the dispatched load or store operation is used to identify the respective load and store. 11. The system of claim 7 , wherein a dispatched load or store information comprises a fetch address of the store. 12. The system of claim 7 , wherein a dispatched load or store information comprises a fetch address of the load. 13. The system of claim 7 , wherein the predictive unit includes at least one prediction table. 14. The system of claim 7 , wherein the predictive unit has been trained through the LS unit. 15. The system of claim 7 , further comprising linking the load to the store in the memory renaming unit. 16. A method for performing training of a prediction table used for predicting dependency of stores and loads to reduce load latency when storing and loading from a same memory address by bypassing a load store (LS) unit within an execution unit, the method comprising: inputting load and store instruction information from the load store unit; providing an indication of an acceptance of store to load forwarding to a confidence counter update logic; comparing load instruction information to values in the prediction table and outputting the comparison to the confidence counter update logic; comparing store instruction information to values in the prediction table and outputting the comparison to the confidence counter update logic; determining a confidence value and providing an indication of the confidence value to the confidence counter update logic; and updating entry in a confidence field of the prediction table based on feedback from the confidence counter update logic. 17. The method of claim 16 , wherein the input load and store instruction information is used to identify the respective load and store instruction. 18. The method of claim 16 , wherein the input load and store instruction information comprises a fetch address of the store. 19. The method of claim 16 , wherein the input load and store instruction information comprises a fetch address of the load. 20. The method of claim 16 , wherein the prediction table has been trained through the LS unit. 21. A method of memory renaming to reduce load latency when storing and loading from a same memory address by bypassing a load store (LS) unit within an execution unit, the method comprising: implementing a load store forwarding prediction mechanism in a decode unit of a processor; maintaining data stored by the store in the execution unit using a register file based in the implemented prediction mechanism; directing a dependent load to utilize the maintained data to bypass the LS unit; and providing correction if the prediction mechanism is incorrect in the prediction.

Assignees

Inventors

Classifications

  • G06F9/3838Primary

    Dependency mechanisms, e.g. register scoreboarding · CPC title

  • Maintaining memory consistency · CPC title

  • G06F9/28Primary

    Enhancement of operational speed, e.g. by using several microcontrol devices operating in parallel · CPC title

  • LOAD or STORE instructions; Clear instruction · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11048506B2 cover?
A system and method for tracking stores and loads to reduce load latency when forming the same memory address by bypassing a load store unit within an execution unit is disclosed. Store-load pairs which have a strong history of store-to-load forwarding are identified. Once identified, the load is memory renamed to the register stored by the store. The memory dependency predictor may also be use…
Who is the assignee on this patent?
Advanced Micro Devices Inc
What technology area does this patent fall under?
Primary CPC classification G06F9/3838. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 29 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).