Support action based self learning and analytics for datacenter device hardware/firmare fault management

US2017039105A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017039105-A1
Application numberUS-201415305292-A
CountryUS
Kind codeA1
Filing dateApr 30, 2014
Priority dateApr 30, 2014
Publication dateFeb 9, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for support activity based self learning and analytics for datacenter device hardware/firmware fault management are described. In one example, a service event including a unique service event ID, a set of prioritized cause and support engineer actions/recommendations and associated unique support engineer action codes are generated for the support engineer upon detecting a hardware/firmware failure event. Support engineer actions taken by the support engineer upon completing the service event are then received. The support engineer actions are then analyzed using the set of prioritized cause and support engineer actions/recommendations and FRU configuration information before and after servicing the hardware/firmware failure. Any potential errors resulting from the support engineer actions are then determined and notified at real-time to the support engineer based on the outcome of the analysis. Any needed updates to the set of prioritized cause and support engineer actions/recommendations are then recommended.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system, comprising: a support engineer's computing device; a centralized event and action code repository (CEACR); and a managed device coupled to the support engineer's computing device and the CEASR, wherein the managed device, comprising: a self learning manager; an event and action code repository snap shot (EACRS); an error analysis engine; and a hardware/firmware layer coupled to the error analysis engine, wherein the self learning manager and the error analysis engine are to: generate a service event including a unique service event ID, and associated set of prioritized cause and support engineer's actions/recommendations and unique support engineer action codes for a support engineer; receive support engineer actions taken by the support engineer upon completing the service event; analyze support engineer actions taken by the support engineer; determine any potential errors resulting from the support engineer actions and notifying the support engineer based on the outcome of the analysis; and dynamically recommend any needed updates to the set of prioritized cause and support engineer actions/recommendations based on the outcome of the analysis. 2 . The system of claim 1 , wherein the self learning manager and the error analysis engine are further to: store the support engineer actions taken by the support engineer in a service action history repository (SAHR). 3 . The system of claim 1 , wherein the self learning manager along with the error analysis engine are further configured to: send the unique service event ID and the support engineer actions taken by the support engineer along with a confirmation of completion of the service event to a remote datacenter to enable automatic closure of a service event. 4 . The system of claim 1 , wherein the self learning manager and the error analysis engine are further to: define a unique support engineer's action code for each possible support engineer's actions associated with hardware/firmware failure events; and store the mapping table including each of support engineer's actions and associated unique support engineer's action codes in EACRS and/or a central event and action code repository (CEACR) residing in a service provider cloud. 5 . The system of claim 1 , wherein the self learning manager and the error analysis engine are to analyze support engineer actions taken by the support engineer with the set of prioritized cause and support engineer actions/recommendations, comprising the step of: analyzing support engineer actions taken by the support engineer with the set of prioritized cause and support engineer actions/recommendations using statistical techniques. 6 . The system of claim 1 , wherein the set of prioritized cause and support engineer actions/recommendations, comprises: a combination of field replaceable unit (FRU) replacements and/or non FRU replacement procedures, and further the order of FRU replacement cause/recommendations are based on an annual failure rate of associated FRUs. 7 . The system of claim 6 , wherein the self learning manager and the error analysis engine are to generate the service event including a unique service event ID, a set of prioritized cause/support engineer actions/recommendations, associated unique support engineer action codes, location identifier as to where the hardware/firmware failure event occurred or detected, FRU vital information for the support engineer using the mapping table, wherein the vital information includes FRU serial number, FRU port number and/or FRU time of failure. 8 . A non-transitory computer readable storage medium comprising a set of instructions executable by at least one processor resource to: generating a service event including a unique service event ID, a set of prioritized cause and support engineer actions/recommendations and associated unique support engineer action codes for a support engineer; receiving support engineer actions taken by the support engineer upon completing the service event; analyzing support engineer actions taken by the support engineer with the set of prioritized cause and support engineer actions/recommendations; determining any potential errors resulting from the support engineer actions taken by the support engineer and notifying the support engineer based on the outcome of the analysis by the self learning manager; and dynamically recommending any needed updates to the set of prioritized cause and support engineer actions/recommendations based on the outcome of the analysis by the self learning manager. 9 . The non-transitory computer readable storage medium of claim 8 , comprising: storing the support engineer actions taken by the support engineer in a non-volatile memory for off-line analytics. 10 . The non-transitory computer readable storage medium of claim 8 , comprising: sending the service event ID and the support engineer actions taken by the support engineer along with a confirmation of completion of the service event to a remote datacenter. 11 . A method for support activity based self learning and analytics for datacenter device hardware/firmware fault management, comprising: generating a service event including a unique service event ID, a set of prioritized cause and support engineer actions/recommendations and associated unique support engineer action codes for a support engineer using a mapping table including each of support engineers actions and associated unique support engineer action codes by an error analysis engine upon detecting a hardware/firmware failure event via a hardware/firmware layer associated with a managed device in the datacenter; receiving support engineer actions taken by the support engineer upon completing the service event via a support engineer's user interface (UI) by a self learning manager; analyzing support engineer actions taken by the support engineer with the set of prioritized cause and support engineer actions/recommendations and field replaceable unit (FRU) configuration information before and after servicing the hardware/firmware failure by the self learning manager; determining any potential errors resulting from the support engineer actions taken by the support engineer and notifying the support engineer based on the outcome of the analysis by the self learning manager; and dynamically recommending any needed updates to the set of prioritized cause and support engineer actions/recommendations based on the outcome of the analysis by the self learning manager. 12 . The method of claim 11 , further comprising: storing the support engineer actions taken by the support engineer in a non-volatile memory for off-line analytics. 13 . The method of claim 11 , further comprising: sending the service event ID and the support engineer actions taken by the support engineer along with a confirmation of completion of the service event to a remote datacenter. 14 . The method of claim 11 , further comprising: defining a unique support engineer's action code for each possible support engineers actions/recommendations associated with hardware/firmware failure events; maintaining the mapping table including each of support engineer's actions and associated unique support engineer's action codes in the EACSR and/or in the central event and action code repository (CEACR) residing in a service provider cloud. 15 . The method of claim 11 , wherein analyzing support engineer actions taken by the support engineer with the set of prioritized cause and support engineer actions/recommendations, comprises: analyzing support eng

Assignees

Inventors

Classifications

  • G06Q50/00Primary

    Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism (healthcare informatics G16H) · CPC title

  • G06F11/142Primary

    Reconfiguring to eliminate the error (group management mechanisms in a peer-to-peer network H04L67/1044) · CPC title

  • Real-time · CPC title

  • Solving problems relating to consistency · CPC title

  • Administration of product repair or maintenance · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017039105A1 cover?
Techniques for support activity based self learning and analytics for datacenter device hardware/firmware fault management are described. In one example, a service event including a unique service event ID, a set of prioritized cause and support engineer actions/recommendations and associated unique support engineer action codes are generated for the support engineer upon detecting a hardware/f…
Who is the assignee on this patent?
Shivanna Suhas, Polackal Bino Joseph, Ahmed Afreen, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06Q50/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 09 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).