Mining for statistical enumerated type

US10073894B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10073894-B2
Application numberUS-201514939855-A
CountryUS
Kind codeB2
Filing dateNov 12, 2015
Priority dateDec 14, 2012
Publication dateSep 11, 2018
Grant dateSep 11, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Event stream attributes are analyzed to determine whether the attributes are of a statistical enumerated type, or, in other words, whether the attributes comprise statistically fixed sets of unique values, for instance. The analysis can involve determining a magnitude of change to a set of unique attribute values. In one instance, such a determination can be performed as a function of a number, or count, of unique values. Further, event stream processing can be performed dynamically, for instance, by partitioning data into time intervals and processing the intervals incrementally.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: employing at least one processor configured to execute computer-executable instructions stored in memory to perform the following acts: receiving a diagnostic event stream associated with a software application, wherein each event in the diagnostic event stream comprises an attribute and value of the attribute; identifying one or more unique values of the attribute from the diagnostic event stream, wherein the one or more unique values are non-duplicative; determining a count of the unique values; assigning the attribute to one of a plurality of categories based on variance in the count and one or more past counts of unique values; and presenting, on a display, the attribute based on the one of the plurality of categories assigned and generating at least one query suggestion based on the assignment of the attribute to the one of a plurality of categories for presentation on the display, the at least on query suggestion being associated with performance of diagnostics of the software application. 2. The method of claim 1 , assigning the attribute to one of a first category corresponding to a fixed set of unique values or a second category corresponding to an unfixed set of unique values. 3. The method of claim 2 further comprising producing an alert, on the display, after the attribute is reassigned from the first category corresponding to the fixed set of unique values to the second category corresponding to the unfixed set of unique values. 4. The method of claim 2 further comprises presenting, on the display, an attribute assigned to the first category corresponding to the fixed set of unique values. 5. The method of claim 4 further comprises presenting, on the display, the attribute assigned to the first category as a query suggestion. 6. The method of claim 1 , assigning the attribute to the one of the plurality of categories based on absolute and relative variance. 7. The method of claim 1 further comprises partitioning the diagnostic event stream based on a predetermined time interval. 8. The method of claim 7 further comprises identifying unique values for the attribute within a partition. 9. The method of claim 8 further comprises recording historical data and attribute category. 10. The method of claim 9 further comprises resetting the historical data upon a change in category assignment. 11. A system, comprising: a processor coupled to a memory, the processor configured to execute the following computer-executable components stored in the memory: a pre-process component configured to identify a count of unique values from a diagnostic event stream attribute associated with a software application, wherein the unique values are non-duplicative; an analysis component configured to determine if the attribute comprises a statistically fixed set of values based on variance in the count and one or more past counts of unique values; and a post-process component configured to present, on a display, an attribute that comprises the statistically fixed set of values and to generate at least one query suggestion based on the statistically fixed set of values, the at least one query being associated with performance of diagnostics of the software application. 12. The system of claim 11 , the analysis component is further configured to determine if the attribute comprises the statistically fixed set of values as a function of a mean standard deviation of the past numbers of unique values. 13. The system of claim 11 , the analysis component is further configured to determine if the attribute comprises the statistically fixed set of values as a function of a percent computed as mean standard deviation of the past numbers of unique values divided by mean of the past numbers of unique values. 14. The system of claim 11 further comprises a partition component configured to partition a diagnostic event stream associated with the diagnostic event stream attribute into segments that correspond to a predetermined time interval. 15. The system of claim 11 further comprises an extraction component configured to extract one or more unique values to extract one or more unique values of the attribute. 16. The system of claim 11 further comprises a model management component configured to manage a historical data model comprising unique attribute values, numbers of unique attribute values, and attribute state. 17. The system of claim 16 further comprises a state determination component configured to determine current attribute state based in part on the attribute state and the numbers of unique attribute values. 18. A computer-readable storage medium having instructions stored thereon that enable at least one processor to perform a method upon execution of the instructions, the method comprising: receiving a diagnostic event stream associated with a software application; identifying unique values for an attribute of the diagnostic event stream; determining a state of the attribute based on a past state, past counts of unique values per interval, and a number of unique values in a current interval, wherein state indicates whether the attribute is an enumerated type or a non-enumerated type; presenting, on a display, the attribute based on the type; generating at least one query suggestion based on whether the attribute is an enumerated type or a non-enumerated type, the at least one query suggestion being associated with performance of diagnostics of the software application. 19. The computer-readable storage medium of claim 18 , the method further comprises clearing historical data including the past counts after transitioning from a first state indicating the attribute is the enumerated type to a second state indicating the attribute is the non-enumerated type. 20. The computer-readable storage medium of claim 18 , the method further comprises producing an alert, on the display, after transitioning from a first state indicating the attribute is the enumerated type to a second state indicating the attribute is the non-enumerated type.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10073894B2 cover?
Event stream attributes are analyzed to determine whether the attributes are of a statistical enumerated type, or, in other words, whether the attributes comprise statistically fixed sets of unique values, for instance. The analysis can involve determining a magnitude of change to a set of unique attribute values. In one instance, such a determination can be performed as a function of a number,…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F17/30539. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 11 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).