Producing insight information from tables using natural language processing

US9286290B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9286290-B2
Application numberUS-201414262288-A
CountryUS
Kind codeB2
Filing dateApr 25, 2014
Priority dateApr 25, 2014
Publication dateMar 15, 2016
Grant dateMar 15, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Mechanisms for generating insight statements from table data are provided. A portion of content comprising a table data structure and text associated with the table data structure is received and at least one of key terms or semantic relationships in the table data structure and the associated text are identified. Fields of an insight statement template are populated with information obtained from the key terms and semantic relationships to generate an insight statement data structure. The insight statement data structure is then output. The insight statement data structure is a natural language statement describing an aspect of the table data structure.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in a data processing system, for generating insight statements from table data, the method comprising: receiving, by the data processing system, a portion of content comprising a table data structure and text associated with the table data structure; identifying, by the data processing system, at least one of key terms or semantic relationships in the table data structure and the associated text; identifying, by the data processing system, insight data points in the table data structure based on the identification of at least one of key terms or semantic relationships in the table data structure and the associated text, wherein the insight data points are data points in the table data structure meeting an insight condition set forth in a predefined insight data point rule; generating, by the data processing system, an insight data structure specifying a field in the table data structure with which the insight data point is associated, an insight condition met by the insight data point, a location in the table data structure corresponding to the insight data point, and a value corresponding to the insight data point; populating, by the data processing system, fields of an insight statement template with information obtained from the key terms and semantic relationships, based on the insight data structure, to generate an insight statement data structure; and outputting, by the data processing system, the insight statement data structure, wherein the insight statement data structure is a natural language statement describing an aspect of the table data structure, wherein identifying an insight data point in the table data structure comprises: performing a matching operation that matches key terms and semantic relationships in the associated text with key terms and semantic relationships in the table data structure to identify matching portions of the table data structure, wherein the identification of the insight data point and generation of the insight data structure is performed for the matching portions of the table data structure. 2. The method of claim 1 , wherein identifying key terms and semantic relationships in the table data structure and the associated text comprises: performing natural language processing on the table data structure to identify key terms, and semantic relationships associated with key terms, in at least one of columns or rows of the table data structure; and performing natural language processing on the associated text to identify key terms and semantic relationships associated with key terms in the associated text. 3. The method of claim 1 , wherein the aspect of the table data structure specified in the insight statement is an aspect of the table data not explicitly specified in the table data structure but able to be extracted from the table data through application of insight analysis logic on the table data of the table data structure. 4. The method of claim 1 , wherein key terms and semantic relationships in the associated text are weighted according to a proximity of the key terms and semantic relationships to the table data structure in the content. 5. The method of claim 1 , wherein the text is one of a table summary text associated with the table data structure or a predetermined amount of text in close proximity to the table data structure in the content. 6. The method of claim 1 , wherein the insight statement is output to a natural language processing mechanism which processes the insight statement in association with the table data structure. 7. The method of claim 6 , wherein the natural language processing mechanism is part of a question and answer system, and wherein the question and answer system performs natural language processing on the insight statement when generating candidate answers to an input question. 8. The method of claim 1 , wherein the insight statement is output as one of an annotation of the table data structure, metadata associated with the content, or a separate file linked to the content. 9. The method of claim 1 , wherein the insight data point rules comprise a rule for determining derived information from the information present in the table data structure, and wherein the derived information comprises at least one of a maximum value for a portion of the table data structure, a variance for values in the portion of the table data structure, a distribution of values in the portion of the table data structure, an equivalence for values in the portion of the table data structure, a difference for values in the portion of the table data structure, and a total of values in the portion of the table data structure. 10. The method of claim 1 , wherein the at least one of key terms or semantic relationships in the table data structure and the associated text are key terms and semantic relationships specified in a knowledge base specifically configured for a pre-defined domain. 11. A computer program product comprising a non-transitory computer readable medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to: receive a portion of content comprising a table data structure and text associated with the table data structure; identify at least one of key terms or semantic relationships in the table data structure and the associated text; identify, by the data processing system, insight data points in the table data structure based on the identification of at least one of key terms or semantic relationships in the table data structure and the associated text, wherein the insight data points are data points in the table data structure meeting an insight condition set forth in a predefined insight data point rule; generate, by the data processing system, an insight data structure specifying a field in the table data structure with which the insight data point is associated, an insight condition met by the insight data point, a location in the table data structure corresponding to the insight data point, and a value corresponding to the insight data point; populate fields of an insight statement template with information obtained from the key terms and semantic relationships, based on the insight data structure, to generate an insight statement data structure; and output the insight statement data structure, wherein the insight statement data structure is a natural language statement describing an aspect of the table data structure, wherein the computer readable program further causes the computing device to identify an insight data point in the table data structure at least by: performing a matching operation that matches key terms and semantic relationships in the associated text with key terms and semantic relationships in the table data structure to identify matching portions of the table data structure, wherein the identification of the insight data point and generation of the insight data structure is performed for the matching portions of the table data structure. 12. The computer program product of claim 11 , wherein the computer readable program further causes the computing device to identify key terms and semantic relationships in the table data structure and the associated text at least by: performing natural language processing on the table data structure to identify key terms, and semantic relationships associated with key terms, in at least one of columns or rows of the table data structure; and performing natural language processing on the associated text to identify key terms and semantic relationships associated with key terms in the associated text.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9286290B2 cover?
Mechanisms for generating insight statements from table data are provided. A portion of content comprising a table data structure and text associated with the table data structure is received and at least one of key terms or semantic relationships in the table data structure and the associated text are identified. Fields of an insight statement template are populated with information obtained f…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/245. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 15 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).