Method, apparatus, and storage medium for diagnosing failure based on a service monitoring indicator of a server by clustering servers with similar degrees of abnormal fluctuation

US10805151B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10805151-B2
Application numberUS-201916354074-A
CountryUS
Kind codeB2
Filing dateMar 14, 2019
Priority dateMay 31, 2018
Publication dateOct 13, 2020
Grant dateOct 13, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, an apparatus and a storage medium for diagnosing failure based on a service monitoring indicator are provided. Service monitoring indicator of a server is analyzed to obtain a degree of abnormal fluctuation in the service monitoring indicator. Servers with similar degrees of abnormal fluctuation are clustered according to the degree of abnormal fluctuation of the service monitoring indicator, to obtain clustered results that include the servers and the service monitoring indicator. A location where the service fails is determined according to the clustered results.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for diagnosing failure based on a service monitoring indicator, wherein the method comprises: analyzing, according to an abnormality fluctuation detection algorithm, a service monitoring indicator of a server, in which a service is located, to obtain a degree of abnormal fluctuation in the service monitoring indicator; clustering servers with similar degrees of abnormal fluctuation according to the degree of abnormal fluctuation of the service monitoring indicator, wherein clustered results comprise the servers and the service monitoring indicator; and determining, according to the clustered results, a location where the service fails; wherein the analyzing, according to an abnormality fluctuation detection algorithm, a service monitoring indicator of a server, in which a service is located, to obtain a degree of abnormal fluctuation in the service monitoring indicator, comprises: calculating a degree of first fluctuation and a degree of second fluctuation of the service monitoring indicator using kernel density estimation (KDE), wherein the degree of first fluctuation is a degree of fluctuation in the service monitoring indicator on the same day that the failure occurs, and the degree of second fluctuation is a degree of fluctuation in the service monitoring indicator within a preset time period before the failure occurs; and comparing, according to an extreme value theory, the degree of first fluctuation and the degree of second fluctuation to determine a probability that the degree of first fluctuation is an abnormal fluctuation, so as to obtain the degree of abnormal fluctuation of the service monitoring indicator. 2. The method according to claim 1 , wherein the clustering servers with similar degrees of abnormal fluctuation according to the degree of abnormal fluctuation of the service monitoring indicator comprises: clustering, according to the degree of abnormal fluctuation in the service monitoring indicator, servers with similar degrees of abnormal fluctuation using a preset clustering algorithm, wherein the preset clustering algorithm is a hierarchical clustering algorithm or a Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm. 3. The method according to claim 2 , wherein, before the clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm, the method further comprises: for each server, sorting the degree of abnormal fluctuation in the service monitoring indicator corresponding to the each server to obtain a sorted result corresponding to the each server; calculating a correlation coefficient between sorted results corresponding to two servers; and correspondingly, the clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm comprises: clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm based on the correlation coefficient. 4. The method according to claim 2 , wherein each computer room comprises a plurality of the servers, the service comprises at least one module, and the clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm comprises: clustering the servers comprised in the each computer room according to the module comprised in the service. 5. The method according to claim 1 , wherein the determining, according to the clustered results, the location where the service fails comprises: sorting the clustered results according to a preset sorting strategy; and determining, according to the sorted results, a location where the service fails. 6. The method according to claim 1 , wherein, before the analyzing, according to an abnormality fluctuation detection algorithm, a service monitoring indicator of a server, in which a service is located, to obtain a degree of abnormal fluctuation in the service monitoring indicator, the method further comprises: obtaining historical data of the service monitoring indicator of a server in which the service is located; and correspondingly, the analyzing, according to an abnormality fluctuation detection algorithm, a service monitoring indicator of a server, in which a service is located, to obtain a degree of abnormal fluctuation in the service monitoring indicator comprises: analyzing, according to an abnormality fluctuation detection algorithm, the historical data to obtain the degree of abnormal fluctuation in the service monitoring indicator. 7. An apparatus for diagnosing failure based on a service monitoring indicator, the apparatus comprising a processor and a computer-readable medium for storing program codes, which, when executed by the processor, cause the processor to: analyze, according to an abnormality fluctuation detection algorithm, a service monitoring indicator of a server, in which a service is located, to obtain a degree of abnormal fluctuation in the service monitoring indicator; cluster servers with similar degrees of abnormal fluctuation according to the degree of abnormal fluctuation of the service monitoring indicator, wherein clustered results comprise the servers and the service monitoring indicator; and determine, according to the clustered results, a location where the service fails; wherein the program codes further cause the processor to calculate a degree of first fluctuation and a degree of second fluctuation of the service monitoring indicator using kernel density estimation (KDE), wherein the degree of first fluctuation is a degree of fluctuation in the service monitoring indicator on the same day that the failure occurs, and the degree of second fluctuation is a degree of fluctuation in the service monitoring indicator within a preset time period before the failure occurs; and compare, according to an extreme value theory, the degree of first fluctuation and the degree of second fluctuation to determine a probability that the degree of first fluctuation is an abnormal fluctuation, so as to obtain the degree of abnormal fluctuation of the service monitoring indicator. 8. The apparatus according to claim 7 , wherein the program codes further cause the processor to: cluster, according to the degree of abnormal fluctuation in the service monitoring indicator, servers with similar degrees of abnormal fluctuation using a preset clustering algorithm, wherein the preset clustering algorithm is a hierarchical clustering algorithm or a Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm. 9. The apparatus according to claim 8 , wherein the program codes further cause the processor to: before the clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm, for each server, sort the degree of abnormal fluctuation in the service monitoring indicator corresponding to the each server to obtain a sorted result corresponding to the each server; calculate a correlation coefficient between sorted results corresponding to two servers; and correspondingly, when clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm, the program codes further cause the processor to: cluster servers with similar degrees of abnormal fluctuation using the preset clustering algorithm based on the correlation coefficient. 10. The apparatus according to claim 8 , wherein each computer room comprises a plurality of the servers, the service comprises at least one module; when clustering servers with similar degrees of abnormal fluctuation using the preset clustering algorithm, the program codes further cause the processor to: cluster the servers comprised in the each computer room according to the

Assignees

Inventors

Classifications

  • the faulty arrangement being the maintenance, administration or management system · CPC title

  • H04L41/142Primary

    using statistical or mathematical methods · CPC title

  • by actively collecting configuration information or by backing up configuration information · CPC title

  • Localisation of faults · CPC title

  • involving time analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10805151B2 cover?
A method, an apparatus and a storage medium for diagnosing failure based on a service monitoring indicator are provided. Service monitoring indicator of a server is analyzed to obtain a degree of abnormal fluctuation in the service monitoring indicator. Servers with similar degrees of abnormal fluctuation are clustered according to the degree of abnormal fluctuation of the service monitoring in…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04L41/0695. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Oct 13 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).