Dynamic classification of time-series categorical data
US-12111851-B1 · Oct 8, 2024 · US
US2021174009A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2021174009-A1 |
| Application number | US-202117169254-A |
| Country | US |
| Kind code | A1 |
| Filing date | Feb 5, 2021 |
| Priority date | Sep 7, 2012 |
| Publication date | Jun 10, 2021 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The technology disclosed relates to formulating and refining field extraction rules that are used at query time on raw data with a late-binding schema. The field extraction rules identify portions of the raw data, as well as their data types and hierarchical relationships. These extraction rules are executed against very large data sets not organized into relational structures that have not been processed by standard extraction or transformation methods. By using sample events, a focus on primary and secondary example events help formulate either a single extraction rule spanning multiple data formats, or multiple rules directed to distinct formats. Selection tools mark up the example events to indicate positive examples for the extraction rules, and to identify negative examples to avoid mistaken value selection. The extraction rules can be saved for query-time use, and can be incorporated into a data model for sets and subsets of event data.
Opening claim text (preview).
1 . A computer-implemented method comprising: identifying a field associated with a set of events; identifying a set of unique values corresponding with the field in the set of events; determining a count of a number of times each unique value corresponds with the field in the set of events; causing concurrent display of the set of unique values corresponding with the field in the set of events and the counts of the number of times the unique value corresponds with the field in the set of events. 2 . The method of claim 1 , wherein the field is a field for which values are extracted from the set of events. 3 . The method of claim 1 , wherein each event in the set of events includes a portion of raw data. 4 . The method of claim 1 , wherein the set of events comprise a sample set of events selected by a user. 5 . The method of claim 1 , wherein the count comprises the number of times each unique value was extracted from the field in the set of events, the set of events having a plurality of fields. 6 . The method of claim 1 further comprising: determining a percent of the number of times each unique value corresponds with the field in the set of events relative to a total number of events in the set of events; and causing concurrent display of the percent with the set of unique values and the counts. 7 . The method of claim 1 further comprising: determining a percent of the number of times each unique value corresponds with the field in the set of events relative to a total number of events in the set of events; and causing concurrent display of a graphical illustration representing the percent along with the set of unique values and the counts. 8 . The method of claim 1 , wherein the displayed set of unique values are presented in rows in a first column and the counts are displayed in a second column in a corresponding row. 9 . The method of claim 1 , wherein the concurrent display includes row controls to populate a filter control with a selected value. 10 . The method of claim 1 , wherein each unique value in the concurrent display includes a link to populate a filter control with the corresponding unique value. 11 . The method of claim 1 further comprising: identifying a selection of a link of a displayed unique value; and causing presentation of the set of events with a key-value filter set in a filter control that corresponds with the selected displayed unique value. 12 . The method of claim 1 further comprising determining a coverage within a source type, the coverage comprising a percent of tokens within events of the source type that have extracted values. 13 . The method of claim 1 further comprising determining a confidence rating within a source type, the confidence rating indicating insight into an estimated success of existing extractions. 14 . A system comprising: one or more data processors; and one or more computer-readable storage media containing instructions which when executed on the one or more data processors, cause the one or more processors to perform operations including: identifying a field associated with a set of events; identifying a set of unique values corresponding with the field in the set of events; determining a count of a number of times each unique value corresponds with the field in the set of events; causing concurrent display of the set of unique values corresponding with the field in the set of events and the counts of the number of times the unique value corresponds with the field in the set of events. 15 . The system of claim 14 , wherein the count comprises the number of times each unique value was extracted from the field in the set of events, the set of events having a plurality of fields. 16 . The system of claim 14 further comprising: determining a percent of the number of times each unique value corresponds with the field in the set of events relative to a total number of events in the set of events; and causing concurrent display of the percent with the set of unique values and the counts. 17 . The system of claim 14 further comprising: determining a percent of the number of times each unique value corresponds with the field in the set of events relative to a total number of events in the set of events; and causing concurrent display of a graphical illustration representing the percent along with the set of unique values and the counts. 18 . One or more computer-storage media storing computer-executable instructions that, when executed by a computing device, perform a method for generating an extraction rule, the method comprising: identifying a field associated with a set of events; identifying a set of unique values corresponding with the field in the set of events; determining a count of a number of times each unique value corresponds with the field in the set of events; causing concurrent display of the set of unique values corresponding with the field in the set of events and the counts of the number of times the unique value corresponds with the field in the set of events. 19 . The one or more computer-storage media of claim 18 further comprising: identifying a selection of a link of a displayed unique value; and causing presentation of the set of events with a key-value filter set in a filter control that corresponds with the selected displayed unique value. 20 . The one or more computer-storage media of claim 18 further comprising: determining a coverage within a source type, the coverage comprising a percent of tokens within events of the source type that have extracted values.
Temporal data queries · CPC title
Recognition of textual entities · CPC title
Form filling; Merging · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.