Technologies for dividing work across accelerator devices
US-2024143410-A1 · May 2, 2024 · US
US9507640B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9507640-B2 |
| Application number | US-33569608-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 16, 2008 |
| Priority date | Dec 16, 2008 |
| Publication date | Nov 29, 2016 |
| Grant date | Nov 29, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A multiprocessor system having plural heterogeneous processing units schedules instruction sets for execution on a selected of the processing units by matching workload processing characteristics of processing units and the instruction sets. To establish an instruction set's processing characteristics, the homogeneous instruction set is executed on each of the plural processing units with one or more performance metrics tracked at each of the processing units to determine which processing unit most efficiently executes the instruction set. Instruction set workload processing characteristics are stored for reference in scheduling subsequent execution of the instruction set.
Opening claim text (preview).
What is claimed is: 1. A multiprocessor system comprising: plural heterogeneous processing units; a performance sensor associated with each processing unit and operable to determine one or more performance metrics for the associated processing unit; a workload scheduler operable to simultaneously initiate a homogeneous instruction set on the plural heterogeneous processing units; and a performance analyzer interfaced with the performance sensors, the performance analyzer operable to compare the performance metrics to select one of the plural heterogeneous processing units to execute the instruction set. 2. The multiprocessor system of claim 1 wherein each heterogeneous processing unit comprises plural cores, at least one processing unit having a proportionately greater number of integer cores relative to at least one other processing unit. 3. The multiprocessor system of claim 1 wherein each heterogeneous processing unit comprises plural cores, at least one processing unit having a proportionately greater number of floating point cores relative to at least one other processing unit. 4. The multiprocessor system of claim 1 wherein each heterogeneous processing unit comprises one or more prefetch engines, at least one processing unit having a proportionately greater number of prefetch engines relative to at least one other processing unit. 5. The multiprocessor system of claim 1 wherein the performance metric comprises cycles per instruction. 6. The multiprocessor system of claim 1 wherein the performance metric comprises cache miss rates. 7. The multiprocessor system of claim 1 wherein the performance metric comprises the number of prefetches. 8. The multiprocessor system of claim 1 wherein the workload scheduler is further operable to: store performance metrics associated with the instruction set; subsequently detect pending execution of the instruction set; and schedule the instruction set for execution based on the stored performance metrics.
Instruction prefetching · CPC title
where the computing system component is a central processing unit [CPU] · CPC title
Performance criteria · CPC title
considering hardware capabilities · CPC title
Compare instructions, e.g. Greater-Than, Equal-To, MINMAX · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.