Techniques for managing a distributed computing environment using event digests

US2018129503A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2018129503-A1
Application numberUS-201715635270-A
CountryUS
Kind codeA1
Filing dateJun 28, 2017
Priority dateNov 8, 2016
Publication dateMay 10, 2018
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques and apparatus for managing a distributed computing environment using event digests are described. In one embodiment, for example, an apparatus may include at least one memory, and logic for a system manager, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to determine a workload to schedule, access an event digest associated with a plurality of compute hosts, the event digest comprising event digest values determined using out-of-band information, determine metrics from the event digest, generate at least one host weight for at least a portion of the plurality of compute hosts based on the metrics, identify at least one candidate host from the portion of the plurality of compute hosts based on the at least one host weight, and schedule the workload on the at least one candidate host. Other embodiments are described and claimed.

First claim

Opening claim text (preview).

What is claimed is: 1 . An apparatus, comprising: at least one memory; and logic for a system manager, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to: determine a workload to schedule, access an event digest associated with a plurality of compute hosts, the event digest comprising event digest values determined using out-of-band information, determine metrics from the event digest, generate at least one host weight for at least a portion of the plurality of compute hosts based on the metrics, identify at least one candidate host from the portion of the plurality of compute hosts based on the at least one host weight, and schedule the workload on the at least one candidate host. 2 . The apparatus of claim 1 , the system manager comprising one of a cloud controller or a cloud orchestrator. 3 . The apparatus of claim 1 , the metrics comprising at least one of processor power, C0 residency, efficient performance indicator (EPI), uncore clocks, memory bandwidth, or core frequency. 4 . The apparatus of claim 1 , the logic to: obtain in-band information responsive to identifying a plurality of candidate hosts, and determine a single candidate host based on the in-band information. 5 . The apparatus of claim 1 , the logic to generate the at least one host weight based on an aggregation of a plurality of fields of the event digest. 6 . The apparatus of claim 1 , the logic to generate the at least one weight to indicate a resource availability of the plurality of compute hosts. 7 . The apparatus of claim 1 , the logic to generate the at least one weight to indicate a compatibility of the at least one compute hosts to execute the workload. 8 . The apparatus of claim 1 , the logic to access the event digest via an out-of-band communication channel comprising one of an intelligent platform management interface (IPMI) communication channel or a platform environment control interface (PECI). 9 . The apparatus of claim 1 , the logic to: monitor for an error condition based on the metrics, and obtain in-band metrics responsive to a detection of the error condition to determine a cause of the error condition. 10 . The apparatus of claim 1 , the logic to: monitor for an error condition based on the metrics, and obtain in-band metrics responsive to a detection of the error condition to determine a remedial action for the error condition. 11 . A method, comprising: determining a workload to schedule; accessing an event digest associated with a plurality of compute hosts, the event digest comprising event digest values determined using out-of-band information; determining metrics from the event digest; generating at least one host weight for at least a portion of the plurality of compute hosts based on the metrics; identifying at least one candidate host from the portion of the plurality of compute hosts based on the at least one host weight; and scheduling the workload on the at least one candidate host. 12 . The method of claim 11 , the event digest accessed via a system manager operating on one of a cloud controller or a cloud orchestrator. 13 . The method of claim 11 , the metrics comprising at least one of processor power, C0 residency, efficient performance indicator (EPI), uncore clocks, memory bandwidth, or core frequency. 14 . The method of claim 11 , comprising: obtaining in-band information responsive to identifying a plurality of candidate hosts, and determining a single candidate host based on the in-band information. 15 . The method of claim 11 , comprising generating the at least one host weight based on an aggregation of a plurality of fields of the event digest. 16 . The method of claim 11 , comprising generating the at least one weight to indicate a resource availability of the plurality of compute hosts. 17 . The method of claim 11 , comprising generating the at least one weight to indicate a compatibility of the at least one compute hosts to execute the workload. 18 . The method of claim 11 , comprising accessing the event digest via an out-of-band communication channel comprising one of an intelligent platform management interface (IPMI) communication channel or a platform environment control interface (PECI). 19 . The method of claim 11 , comprising: monitoring for an error condition based on the metrics, and obtaining in-band metrics responsive to a detection of the error condition to determine a cause of the error condition. 20 . The method of claim 11 , comprising: monitoring for an error condition based on the metrics, and obtaining in-band metrics responsive to a detection of the error condition to determine a remedial action for the error condition. 21 . A computer-readable storage medium that stores instructions for execution by processing circuitry of a system manager, the instructions to cause the system manager to: determine a workload to schedule; access an event digest associated with a plurality of compute hosts, the event digest comprising event digest values determined using out-of-band information; determine metrics from the event digest; generate at least one host weight for at least a portion of the plurality of compute hosts based on the metrics; identify at least one candidate host from the portion of the plurality of compute hosts based on the at least one host weight; and schedule the workload on the at least one candidate host. 22 . The computer-readable storage medium of claim 21 , the metrics comprising at least one of processor power, C0 residency, efficient performance indicator (EPI), uncore clocks, memory bandwidth, or core frequency. 23 . The computer-readable storage medium of claim 21 , the instructions to cause the system manager to: obtain in-band information responsive to identifying a plurality of candidate hosts, and determine a single candidate host based on the in-band information. 24 . The computer-readable storage medium of claim 21 , the instructions to cause the system manager to: monitor for an error condition based on the metrics, and obtain in-band metrics responsive to a detection of the error condition to determine a cause of the error condition. 25 . The computer-readable storage medium of claim 21 , the instructions to cause the system manager to: monitor for an error condition based on the metrics, and obtain in-band metrics responsive to a detection of the error condition to determine a remedial action for the error condition.

Assignees

Inventors

Classifications

  • Data logging (G06F11/14, G06F11/2205 take precedence) · CPC title

  • G06F9/3844Primary

    using dynamic branch prediction, e.g. using branch history tables · CPC title

  • Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues · CPC title

  • considering the load · CPC title

  • Arrangements in telecontrol or telemetry systems for selectively calling a substation from a main station, in which substation desired apparatus is selected for applying a control signal thereto or for obtaining measured values therefrom · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018129503A1 cover?
Techniques and apparatus for managing a distributed computing environment using event digests are described. In one embodiment, for example, an apparatus may include at least one memory, and logic for a system manager, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to determine a workload to schedule, access an event digest associated with a …
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F11/3476. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 10 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).