Autonomous warehouse-scale computers

US11960936B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11960936-B2
Application numberUS-202117150285-A
CountryUS
Kind codeB2
Filing dateJan 15, 2021
Priority dateJan 15, 2021
Publication dateApr 16, 2024
Grant dateApr 16, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The subject matter described herein provides systems and techniques to address the challenges of growing hardware and workload heterogeneity using a Warehouse-Scale Computer (WSC) design that improves the efficiency and utilization of WSCs. The WSC design may include an abstraction layer and an efficiency layer in the software stack of the WSC. The abstraction layer and the efficiency layer may be designed to improve job scheduling, simplify resource management, and drive hardware-software co-optimization using machine learning techniques and automation in order to customize the WSC for applications at scale. The abstraction layer may embrace platform/hardware and workload diversity through greater coordination between hardware and higher layers of the WSC software stack in the WSC design. The efficiency layer may employ machine learning techniques at scale to realize hardware/software co-optimizations as a part of the autonomous WSC design.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of scheduling a workload to be processed on one or more computer servers, the method comprising: generating a performance profile for the workload using a first software layer; classifying the workload using a machine learning model based on the performance profile for the workload using the first software layer; generating a record of selected platforms for the workload using the first software layer; and scheduling the workload for processing on a hardware layer based on the record of preferred platforms for the workload using a second software layer, wherein the second software layer is disposed between the first software layer and the hardware layer in a software stack of the one or more computer servers. 2. The method of claim 1 , wherein the one or more computer servers are associated with a warehouse-scale computer (WSC). 3. The method of claim 1 , wherein the first software layer is a warehouse-scale computer (WSC) efficiency layer, and wherein the WSC efficiency layer includes at least one software module. 4. The method of claim 3 , wherein the machine learning model includes a hierarchical agglomerative clustering technique, and wherein the at least one software module includes a platform ranking algorithm module that performs the hierarchical agglomerative clustering technique. 5. The method of claim 4 , wherein the platform ranking algorithm module optimizes an objective function that includes at least one of a cost value, an efficiency value, and a performance value associated with scheduling the workload to be processed. 6. The method of claim 1 , wherein the second software layer is a software-defined server (SDS) abstraction layer, and wherein the SDS abstraction layer comprises application programming interfaces (APIs) to directly access the hardware layer. 7. The method of claim 1 , wherein the record of selected platforms indicates an order of preference for each one of a set of hardware platforms of the hardware layer. 8. A system for scheduling a workload to be processed on one or more computer servers, the system comprising one or more processors configured to: generate a performance profile for the workload using a first software layer; classify the workload using a machine learning model based on the performance profile for the workload using the first software layer; generate a record of selected platforms for the workload using the first software layer; and schedule the workload for processing on a hardware layer based on the record of preferred platforms for the workload using a second software layer, wherein the second software layer is disposed between the first software layer and the hardware layer in a software stack of the one or more computer servers. 9. The system of claim 8 , wherein the one or more computer servers are associated with a warehouse-scale computer (WSC). 10. The system of claim 8 , wherein the first software layer is a warehouse-scale computer (WSC) efficiency layer, and wherein the WSC efficiency layer includes at least one software module. 11. The system of claim 10 , wherein the machine learning model includes a hierarchical agglomerative clustering technique, and wherein the at least one software module includes a platform ranking algorithm module that performs the hierarchical agglomerative clustering technique. 12. The system of claim 11 , wherein the platform ranking algorithm module optimizes an objective function that includes at least one of a cost value, an efficiency value, and a performance value associated with scheduling the workload to be processed. 13. The system of claim 8 , wherein the second software layer is a software-defined server (SDS) abstraction layer, and wherein the SDS abstraction layer comprises application programming interfaces (APIs) to directly access the hardware layer. 14. The system of claim 8 , wherein the record of selected platforms indicates an order of preference for each one of a set of hardware platforms of the hardware layer. 15. A non-transitory computer-readable medium storing instructions for scheduling a workload to be processed on one or more computer servers, that when executed by one or more processors, cause the one or more processors to: generate a performance profile for the workload using a first software layer; classify the workload using a machine learning model based on the performance profile for the workload using the first software layer; generate a record of selected platforms for the workload using the first software layer; and schedule the workload for processing on a hardware layer based on the record of preferred platforms for the workload using a second software layer, wherein the second software layer is disposed between the first software layer and the hardware layer in a software stack of the one or more computer servers. 16. The non-transitory computer-readable medium of claim 15 , wherein the one or more computer servers are associated with a warehouse-scale computer (WSC). 17. The non-transitory computer-readable medium of claim 15 , wherein the first software layer is a warehouse-scale computer (WSC) efficiency layer, and wherein the WSC efficiency layer includes at least one software module. 18. The non-transitory computer-readable medium of claim 17 , wherein the machine learning model includes a hierarchical agglomerative clustering technique, and wherein the at least one software module includes a platform ranking algorithm module that performs the hierarchical agglomerative clustering technique. 19. The non-transitory computer-readable medium of claim 18 , wherein the platform ranking algorithm module optimizes an objective function that includes at least one of a cost value, an efficiency value, and a performance value associated with scheduling the workload to be processed. 20. The non-transitory computer-readable medium of claim 15 , wherein the second software layer is a software-defined server (SDS) abstraction layer, and wherein the SDS abstraction layer comprises application programming interfaces (APIs) to directly access the hardware layer.

Assignees

Inventors

Classifications

  • G06F9/5027Primary

    the resource being a machine, e.g. CPUs, Servers, Terminals · CPC title

  • Machine learning · CPC title

  • Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11960936B2 cover?
The subject matter described herein provides systems and techniques to address the challenges of growing hardware and workload heterogeneity using a Warehouse-Scale Computer (WSC) design that improves the efficiency and utilization of WSCs. The WSC design may include an abstraction layer and an efficiency layer in the software stack of the WSC. The abstraction layer and the efficiency layer may…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06F9/5027. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 16 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).