Generating Answer Variants Based on Tables of a Corpus

US2017351677A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017351677-A1
Application numberUS-201615172216-A
CountryUS
Kind codeA1
Filing dateJun 3, 2016
Priority dateJun 3, 2016
Publication dateDec 7, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Mechanisms are provided for implementing a candidate answer variant engine. The mechanisms receive an indication of a structured portion of content in a corpus, generate a plurality of groupings of elements of the structured portion of content, and generate, for each grouping of elements in the plurality of groupings of elements, and for each element in the grouping of elements, a corresponding grouping vector representation, corresponding to the element. The mechanisms, for each grouping vector representation of each grouping of elements in the plurality of groupings of elements perform a similarity measure calculation between the grouping vector representation and a vector representation of an input question, and select an element corresponding to the grouping vector representation for inclusion as a candidate answer variant based on results of the similarity measure calculation. The mechanisms perform question answering operations based on an analysis of one or more candidate answer variants.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method, in a data processing system comprising a processor and a memory, the memory comprising instructions executed by the processor to cause the processor to implement a candidate answer variant engine, the method comprising: receiving, by the candidate answer variant engine, an indication of a structured portion of content in a corpus; generating, by the candidate answer variant engine, a plurality of groupings of elements of the structured portion of content; generating, by the candidate answer variant engine, for each grouping of elements in the plurality of groupings of elements, and for each element in the grouping of elements, a corresponding grouping vector representation, corresponding to the element; for each grouping vector representation of each grouping of elements in the plurality of groupings of elements: performing, by the candidate answer variant engine, a similarity measure calculation between the grouping vector representation and a vector representation of an input question; and selecting, by the candidate answer variant engine, an element corresponding to the grouping vector representation for inclusion as a candidate answer variant in a candidate answer variant data structure based on results of the similarity measure calculation; and performing question answering operations for answering the input question based on an analysis of one or more candidate answer variants in the candidate answer variant data structure. 2 . The method of claim 1 , wherein the structured portion of content comprises a table data structure present in a portion of content of the corpus, and wherein each grouping of elements in the plurality of grouping of elements comprises a triad of at least one row header, at least one column header, and a cell content. 3 . The method of claim 1 , wherein the structured portion of content is a structured portion of content corresponding to a candidate answer found in the corpus by a primary search operation of a question answering (QA) system. 4 . The method of claim 1 , wherein performing the similarity measure calculation comprises calculating a cosine similarity between the grouping vector representation and the vector representation of the input question. 5 . The method of claim 4 , wherein the similarity measure calculation comprises calculating the cosine similarity between other elements of the grouping vector representation than an element of the grouping vector representation being considered for selection as a candidate answer variant from the grouping of elements. 6 . The method of claim 5 , wherein selecting the element corresponding to the grouping vector representation for inclusion as a candidate answer variant in the candidate answer variant data structure based on results of the similarity measure calculation comprises: comparing a cosine similarity value calculated as the cosine similarity between the other elements of the grouping vector representation and the vector representation of the input question, with a threshold value; and selecting the element being considered for selection as a candidate answer variant to be a candidate answer variant in response to the cosine similarity value having a predefined relationship with the threshold value. 7 . The method of claim 1 , Wherein generating a plurality of groupings of elements of the structured portion of content comprises: analyzing at least one of metadata or computer code associated with the structured portion of content to identify a structure of the structured portion of content; identifying elements of the structure based on the analysis; and generating groupings of elements based on the identified elements and the identified structured of the structured portion of content. 8 . The method of claim 1 , wherein the vector representation of the input question comprises a vector representation of one or more features of the input question identified by a natural language processing of the input question. 9 . The method of claim 1 , wherein the similarity measure calculation comprises at least one of a textual syntactic similarity measure calculation or textual semantic similarity measure calculation. 10 . The method of claim 1 , wherein performing question answering operations for answering the input question based on an analysis of one or more candidate answer variants in the candidate answer variant data structure comprises: generating, for each candidate answer variant in the candidate answer variant data structure, a confidence score value indicating a confidence that a corresponding candidate answer variant is a correct answer for the input question; ranking the candidate answer variants relative to one another, and to other candidate answers found during a primary search operation of a question answering (QA) system, based on the generated confidence scores to generate a ranked listing data structure; selecting at least one final answer from the ranked listing data structure; and outputting the at least one final answer to a source of the input question. 11 . A computer program product comprising a non-transitory computer readable medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to: receive an indication of a structured portion of content in a corpus; generate a plurality of groupings of elements of the structured portion of content; generate, for each grouping of elements in the plurality of groupings of elements, and for each element in the grouping of elements, a corresponding grouping vector representation, corresponding to the element, of each of the other elements in the grouping of elements; for each grouping vector representation of each grouping of elements in the plurality of groupings of elements: perform a similarity measure calculation between the grouping vector representation and a vector representation of an input question; and select an element corresponding to the grouping vector representation for inclusion as a candidate answer variant in a candidate answer variant data structure based on results of the similarity measure calculation; and perform question answering operations for answering the input question based on an analysis of one or more candidate answer variants in the candidate answer variant data structure. 12 . The computer program product of claim 11 , wherein the structured portion of content comprises a table data structure present in a portion of content of the corpus, and wherein each grouping of elements in the plurality of grouping of elements comprises a triad of at least one row header, at least one column header, and a cell content. 13 . The computer program product of claim 11 , wherein the structured portion of content is a structured portion of content corresponding to a candidate answer found in the corpus by a primary search operation. 14 . The computer program product of claim 11 , wherein performing the similarity measure calculation comprises calculating a cosine similarity between the grouping vector representation and the vector representation of the input question. 15 . The computer program product of claim 14 , wherein the similarity measure calculation comprises calculating the cosine similarity between other elements of the grouping vector representation than an element of the grouping vector representation being considered for selection as a candidate answer variant from the grouping of elements. 16 . The computer program product of claim 15

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017351677A1 cover?
Mechanisms are provided for implementing a candidate answer variant engine. The mechanisms receive an indication of a structured portion of content in a corpus, generate a plurality of groupings of elements of the structured portion of content, and generate, for each grouping of elements in the plurality of groupings of elements, and for each element in the grouping of elements, a corresponding…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/24578. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 07 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).