Nonhomogeneous server arrangement

US10503225B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10503225-B2
Application numberUS-201816172511-A
CountryUS
Kind codeB2
Filing dateOct 26, 2018
Priority dateDec 31, 2013
Publication dateDec 10, 2019
Grant dateDec 10, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Aspects of the present invention describe a nonhomogeneous server deployment in which different classes of servers are placed within a data center unit, such as a rack or chassis. In one aspect, the cooling capacity for the unit is intentionally sized to be incapable of providing enough cooling to maintain an acceptable operational temperature for the servers, if all servers in the rack are simultaneously in an active processing mode. The control fabric maintains an acceptable operating temperature within the unit by assigning workloads to only a portion of the servers within the unit at a given point in time. In one aspect, servers are arranged within a data center unit according to anticipated peak periods of usage. Units can be arranged to be adjacent to servers optimized for a specialized workload having a peak period of usage that differs from each other.

First claim

Opening claim text (preview).

The invention claimed is: 1. A data center having nonhomogeneous servers interleaved within a rack comprising: a first quantity of graphics optimized servers having a first hardware configuration and a second quantity of general processing optimized servers having a second hardware configuration that is different from the first hardware configuration, the first quantity of graphics optimized servers and the second quantity of general processing optimized servers being power balanced to use a substantially equal amount of power at peak power, and an amount of cooling available being insufficient to cool both the first quantity of graphics optimized servers and the second quantity of general processing optimized servers in active processing mode, wherein the first hardware configuration comprises a graphics processing unit (“GPU”) and the second hardware configuration does not include GPU, and wherein interleaved within a rack means that the first quantity of graphics optimized servers and the second quantity of general processing optimized servers are arranged within the rack in an alternating arrangement. 2. The data center of claim 1 , wherein the graphics optimized server outputs a rendered video game image over a wide area network to a remotely located gaming device. 3. The data center of claim 1 , wherein the graphics optimized server has a central processing unit (“CPU”), and a video encoder, and wherein a maximum power usage of the GPU comprises more than 40% of the graphics optimized server's maximum power usage. 4. The data center of claim 1 , wherein the first class of server and the second class of server generate a substantially equal amount of heat when in use, and the amount of available cooling for the data center unit is less than 60% of an amount needed to adequately cool all servers in the first class of server and the second class of server running in the active processing mode. 5. The data center of claim 1 , wherein the first quantity and the second quantity are substantially equal. 6. The data center of claim 1 , wherein the first quantity of graphics optimized servers is designed for a workload with a peak usage during a first time period that does not overlap with a second time period for which the second quantity of general processing optimized servers is designed. 7. The data center of claim 1 , wherein the rack uses vertical cooling provided by one or more fans located on top of the rack or underneath the rack, and wherein the amount of cooling available does not exceed 70% of what is adequate to facilitate simultaneous operation of the first quantity of graphics optimized servers and the second quantity of general processing optimized servers in the active processing mode. 8. A method for managing workloads within a data center, the method comprising: during a first time period, setting substantially all of a first class of server within a data center rack to a low power mode, the data center rack having a nonhomogeneous deployment of servers comprising at least the first class of server and a second class of server, an amount of available cooling for the data center rack being insufficient to cool all servers in the first class of server and the second class of server running in an active processing mode, wherein the first class of server has a first hardware configuration and the second class of server has a second hardware configuration, wherein the first hardware configuration comprises a graphics processing unit (“GPU”) and the second hardware configuration does not include GPU, wherein the nonhomogeneous deployment is in a repeating pattern of a unit of the first class of server adjacent to a unit of the second class of server; and during a second time period in the data center, setting a majority of the second class of server within the data center rack to the low power mode, the second time period not substantially overlapping with the first time period. 9. The method of claim 8 , wherein the rack uses vertical cooling provided by one or more fans located on top of the rack or underneath the rack, and wherein the amount of cooling available does not exceed 70% of what is adequate to facilitate simultaneous operation of the first quantity of graphics optimized servers and the second quantity of general processing optimized servers in the active processing mode. 10. The method of claim 8 , wherein the rack include the same amount of the first class of server and the second class of server. 11. The method of claim 8 , wherein the first class of server is a game optimized server and the second class of server is a general purpose server. 12. The method of claim 8 , wherein the first class of server and the second class of server generate a substantially equal amount of heat when in use, and the amount of available cooling for the data center unit is less than 60% of an amount needed to adequately cool all servers in the first class of server and the second class of server running in the active processing mode. 13. The method of claim 8 , wherein the first class of servers is designed for a workload with a peak usage during a first time period that does not overlap with a second time period during which the second class of servers is designed. 14. The method of claim 8 , wherein the first class of server outputs a rendered video game image over a wide area network to a remotely located graphics device. 15. A data center system comprising: a data center rack having a total quantity of servers comprising at least graphics optimized servers and general purpose servers, wherein the general purpose servers and the graphics optimized servers are arranged within the rack in an alternating pattern of a graphics optimized server adjacent to a general purpose server; and a data center controller to control an operation of the graphics optimized servers and the general purpose servers within the rack to ensure that at least 40% of the total quantity of servers in the data center rack are in a low power mode and less than 60% of the total quantity of servers are in an active processing mode, an amount of cooling available for the graphics optimized servers and the general purpose servers being insufficient to cool the total quantity of servers in an active processing mode wherein the graphics optimized server outputs a rendered video game image over a wide area network to a remotely located gaming device. 16. The data center system of claim 15 , wherein the general purpose servers and the graphics optimized servers output a substantially equal amount of heat when running in the active processing mode. 17. The data center system of claim 15 , further comprising a cooling system for the total quantity of servers, the cooling system having a cooling capacity that is not adequate to maintain operational temperatures within the total quantity of servers when more than 70% of the total quantity of servers is in the active processing mode. 18. The data center system of claim 17 , wherein the cooling system is a vertical cooling system provided by one or more fans located on top of the rack or underneath the rack. 19. The data center system of claim 15 , wherein the graphics optimized servers are designed for a workload with a peak usage during a first time period that does not overlap with a second time period for which the general processing optimized servers are designed. 20. The data center system of claim 15 , wherein the graphics optimized servers have a graphics processing unit (“GPU”), a central processing unit (“CP

Assignees

Inventors

Classifications

  • where the allocation takes into account power or heat criteria (power management in computers in general G06F1/3203; thermal management in computers in general G06F1/206) · CPC title

  • G06F1/206Primary

    comprising thermal management · CPC title

  • Monitoring of events, devices or parameters that trigger a change in power modality · CPC title

  • Cross-Sectional Technologies · mapped topic

  • Cross-Sectional Technologies · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10503225B2 cover?
Aspects of the present invention describe a nonhomogeneous server deployment in which different classes of servers are placed within a data center unit, such as a rack or chassis. In one aspect, the cooling capacity for the unit is intentionally sized to be incapable of providing enough cooling to maintain an acceptable operational temperature for the servers, if all servers in the rack are sim…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F1/206. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 10 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).