Detection of outage in cloud based service using usage data based error signals

US10073726B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10073726-B2
Application numberUS-201414475543-A
CountryUS
Kind codeB2
Filing dateSep 2, 2014
Priority dateSep 2, 2014
Publication dateSep 11, 2018
Grant dateSep 11, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Outage detection in a cloud based service is provided using usage data based error signals. Usage data is collected from component of the cloud based service or client devices of the cloud based service based on customer actions on the cloud based service. The usage data is aggregated and normalized to generate an error signal from errors generated from a component of the cloud based service. An outage is detected from the error signal. An alert that includes information associated with the outage and one or more customers impacted by the outage is generated.

First claim

Opening claim text (preview).

What is claimed is: 1. A method executed on a computing device to provide detection of an outage in a cloud based service using usage data based error signals, the method comprising: monitoring one or more customer actions with a client interface of the cloud based service, wherein the cloud based service includes one of: a document sharing service, an e-mail service and a storage service; collecting the usage data from a first set of log files associated with one or more components of the cloud based service and a second set of log files associated with one or more client devices of the cloud based service; aggregating and normalizing the collected usage data to generate an error signal by: processing the collected usage data by comparing the collected usage data against an error threshold value within a range of values, and detecting an error for each data point within the collected usage data that has a measurement value exceeding the error threshold value; detecting an outage from the error signal, wherein the outage includes one or more regressions of one or more components of the cloud based service; retrieving information associated with the outage, wherein the information associated with the outage includes a number of errors that cause the outage, identifying information associated with the errors and the one or more components generating the errors, and a time frame of the outage; identifying one or more customers impacted by the outage from a subset of the one or more customer actions associated with the errors that cause the outage, wherein the one or more customer actions include one of: authentication actions, edit actions and save actions associated with a document; generating an alert that includes the retrieved information associated with the outage; and transmit the alert to the one or more customers impacted by the outage. 2. The method of claimed 1 , further comprising: retrieving the collected usage data from the one or more components of the cloud based service that gather the collected usage data as measurements of the one or more customer actions on the cloud based service, wherein the measurements are stored in log files. 3. The method of claim 1 , further comprising: receiving the collected usage data from one or more client devices associated with the cloud based service that gather the collected usage data as measurements of the one or more customer actions on the cloud based service, wherein the measurements are stored in log files. 4. The method of claim 1 , further comprising: aggregating the collected usage data into one or more error signals based on one or more criteria that include an identification and a type of an associated component of the cloud based service. 5. The method of claim 1 , further comprising: anonymizing the collected usage data to remove one or more private information that include an identification and a location of one or more customer of the cloud based service, while aggregating the collected usage data. 6. The method of claim 1 , further comprising: normalizing the collected usage data by multiplying data points of the collected usage data with an adjustable factor to adjust the data points to be within the range of values. 7. The method of claim 6 , further comprising: including the error in the error signal. 8. The method of claim 1 , further comprising: allowing a stakeholder to configure the adjustable factor, the range of values, and the error threshold value, wherein the stakeholder includes an administrator of the cloud based service and one or more customers who are associated with the collected usage data. 9. The method of claim 1 , further comprising: analyzing the error signal to compute an error count from a number of errors associated with a component of the cloud based service, in the error signal; and comparing the error count to an outage threshold value. 10. The method of claim 9 , further comprising: weighing the outage with a high value in response to a detection that the error count exceeds the outage threshold value. 11. The method of claim 9 , further comprising: weighing the outage with a medium value in response to a detection that the error count falls below the outage threshold within a small variance of the outage threshold value. 12. A computing device to provide detection of an outage in a cloud based service using usage data based error signals, the computing device comprising: a memory; a processor coupled to the memory, the processor executing a management application in conjunction with instructions stored in the memory, wherein the management application is configured to: monitor one or more customer actions with a client interface of the cloud based service, wherein the cloud based service includes one of: a document sharing service, an e-mail service and a storage service; collect the usage data from a first set of log files associated with one or more components of the cloud based service and a second set of log files associated with one or more client devices of the cloud based service; aggregate and normalize the collected usage data to generate an error signal by: process the collected usage data by comparing the collected usage data against an error threshold value within a range of values, and detect an error for each data point within the collected usage data that has a measurement value exceeding the error threshold value; detect an outage from the error signal, wherein the outage includes one or more regressions of one or more components of the cloud based service; retrieve information associated with the outage, wherein the information associated with the outage includes a number of errors that cause the outage, identifying information associated with the errors and the one or more components generating the errors, and a time frame of the outage; identify one or more customers impacted by the outage from a subset of the one or more customer actions associated with the errors that cause the outage, wherein the one or more customer actions include one of: authentication actions, edit actions and save actions associated with a document; generate an alert that includes the retrieved information associated with the outage; and transmit the alert to the one or more customers impacted by the outage. 13. The computing device of claim 12 , wherein the management application is further configured to: identify the one or more customers from a subset of errors in the error signal that cause the outage, wherein the one or customers are associated with the subset. 14. The computing device of claim 13 , wherein the management application is further configured to: generate a customized alert for each one of the one or more customers, wherein the customized aim includes information associated with errors that impact the customer and the outage; and transmit the customized alert to the customer. 15. The computing device of claim 12 , wherein the management application is further configured to: identify a tenant that includes a subset of the one or more customers from an attribute of the one or more customers that associates the subset with the tenant. 16. The computing device of claim 15 , wherein the management application is further configured to: generate a customized alert for the tenant, wherein the customized alert includes information associated with errors that impact the subset and the outage; and transmit the customized alert to the tenant. 17. The computing device of claim 12 , wherein the management application is

Assignees

Inventors

Classifications

  • Threshold monitoring · CPC title

  • Means for error signaling, e.g. using interrupts, exception flags, dedicated error registers · CPC title

  • by anonymising data, e.g. decorrelating personal data from the owner's identification · CPC title

  • Generating service level reports · CPC title

  • Processing captured monitoring data, e.g. for logfile generation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10073726B2 cover?
Outage detection in a cloud based service is provided using usage data based error signals. Usage data is collected from component of the cloud based service or client devices of the cloud based service based on customer actions on the cloud based service. The usage data is aggregated and normalized to generate an error signal from errors generated from a component of the cloud based service. A…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F11/0772. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 11 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).