Dynamic field extraction of data

US10042834B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10042834-B2
Application numberUS-201514790189-A
CountryUS
Kind codeB2
Filing dateJul 2, 2015
Priority dateMar 14, 2013
Publication dateAug 7, 2018
Grant dateAug 7, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A log analytics graphical user interface enables a user to dynamically extract and define a field from unstructured log data. The log analytics module automatically determines a definition for a field based on log text selected by the user. A portion of each log message is highlighted to reflect what the extracted field may be to assist users with understanding if input parameters are selected the intended log data. Changes to the definition of the field, by the user, may cause further highlighting that to indicate an incomplete or erroneous field definition.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for displaying a graphical user interface for analyzing unstructured data, the method comprising: displaying a plurality of items of unstructured data, including a first item of unstructured data comprised of text; receiving an indication to extract a field based on a specified portion of text of the first item; generating, by operation of one or more processing units, a definition of the extracted field having (1) a pattern that matches the specified portion of the text, and (2) a context for the extracted field, wherein the context is determined based on the specified portion of the first item; annotating a first portion of the text of the first item which matches the pattern; and annotating a second portion of the text of the first item which matches the context. 2. The method of claim 1 , wherein receiving the indication to extract the field based on the specified portion of text of the first item further comprises receiving a text selection, from a user via the graphical user interface, which indicates the specified portion of text. 3. The method of claim 1 , wherein the pattern associated with the definition of the extracted field is a value type determined based on a match from a pre-determined list of regular expressions. 4. The method of claim 1 , wherein the context associated with the definition of the extracted field comprises a before pattern that matches a token of text before an instance of the extracted field and an after pattern that matches a token of text after the instance of the extracted field. 5. The method of claim 1 , wherein annotating of the first and second portions of the text of the first item comprises: highlighting the first portion of the text using a first color; and highlighting the second portion of the text using a second color, wherein the first color has different color intensity than the second color. 6. The method of claim 1 , further comprising: annotating the plurality of items of unstructured data in the graphical user interface, such that for each of the plurality of items of unstructured data having an instance of the extracted field that satisfies the generated definition: annotating a first portion of the item to indicate a match with the pattern of the extracted field associated with the item; and annotating a second portion of the item, the second portion which matches with the context for the extracted field. 7. The method of claim 1 , further comprising: receiving an indication that the context associated with the extracted field has been modified; annotating the second portion of the first item to indicate an incomplete match with the modified context. 8. The method of claim 7 , wherein the annotating the second portion to indicate the incomplete match with the modified context further comprises: determining the modified context partially matches a token of text adjacent to an instance of the extracted field that matches the pattern; highlighting a portion of the token that matches the modified context with a first color; and highlighting a remainder of the token with a second color, wherein the first color has a different color intensity than the second color. 9. A non-transitory computer readable storage medium having stored thereon computer software executable by a processor, the computer software embodying a method for displaying a graphical user interface for analyzing unstructured data, the method comprising: displaying a plurality of items of unstructured data, including a first item of unstructured data comprised of text; receiving an indication to extract a field based on a specified portion of text of the first item; generating a definition of the extracted field having (1) a pattern that matches the specified portion of the text, and (2) a context for the extracted field, wherein the context is determined based on the specified portion of the first item; annotating a first portion of the text of the first item which matches the pattern; and annotating a second portion of the text of the first item which matches the context. 10. The non-transitory computer readable storage medium of claim 9 , wherein receiving the indication to extract the field based on the specified portion of text further comprises: receiving a text selection, from a user via the graphical user interface, which indicates the specified portion of text. 11. The non-transitory computer readable storage medium of claim 9 , wherein the pattern associated with the definition of the extracted field is a value type determined based on a match from a pre-determined list of regular expressions. 12. The non-transitory computer readable storage medium of claim 9 , wherein the context associated with the definition of the extracted field comprises a before pattern that matches a token of text before an instance of the extracted field and an after pattern that matches a token of text after the instance of the extracted field. 13. The non-transitory computer readable storage medium of claim 9 , wherein annotating display of the first and second portions of text of the first item comprises: annotating the first portion of the text using a first color; and annotating the second portion of the text using a second color, wherein the first color has different color intensity than the second color. 14. The non-transitory computer readable storage medium of claim 9 , further comprising: annotating the plurality of items of unstructured data in the graphical user interface: for each of the plurality of items of unstructured data having an instance of the extracted field that satisfies the generated definition, annotating a first portion of the item to indicate a match with the pattern of the extracted field associated with the item and annotating a second portion of the item which matches the context for the extracted field. 15. The non-transitory computer readable storage medium of claim 9 , further comprising: receiving an indication that the context associated with the extracted field has been modified; and annotating the second portion of the first item to indicate an incomplete match with the modified context. 16. The non-transitory computer readable storage medium of claim 15 , wherein annotating the second portion to indicate the incomplete match with the modified context further comprises: determining the modified context partially matches a token of text adjacent to an instance of the extracted field that matches the pattern; annotating a portion of the token that matches the modified context with a first color; and annotating a remainder of the token with a second color, wherein the first color has a different color intensity than the second color. 17. A computer system for displaying a graphical user interface for analyzing unstructured data for a computing environment, the computer system comprising: a system memory; a storage device having a plurality of items of unstructured data including a first item of unstructured data comprised of text; and a processor programmed to carry out the steps of: displaying the plurality of items of unstructured data; receiving an indication to extract a field based on a specified portion of text of the first item; generating a definition of the extracted field having (1) a pattern that matches the specified portion of the text, and (2) a context for the extracted field, wherein the context is determined based on the specified portion of the first item; modifying display of a first portion of the text of the first item which matches the pa

Assignees

Inventors

Classifications

  • Data logging (G06F11/14, G06F11/2205 take precedence) · CPC title

  • where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting · CPC title

  • Data acquisition and logging (for input to computer G06F3/00) · CPC title

  • Performance evaluation by statistical analysis · CPC title

  • G06F40/169Primary

    Annotation, e.g. comment data or footnotes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10042834B2 cover?
A log analytics graphical user interface enables a user to dynamically extract and define a field from unstructured log data. The log analytics module automatically determines a definition for a field based on log text selected by the user. A portion of each log message is highlighted to reflect what the extracted field may be to assist users with understanding if input parameters are selected …
Who is the assignee on this patent?
Vmware Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/169. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 07 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).