Generating ground truth for questions based on data found in structured resources

US10482180B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10482180-B2
Application numberUS-201715816089-A
CountryUS
Kind codeB2
Filing dateNov 17, 2017
Priority dateNov 17, 2017
Publication dateNov 19, 2019
Grant dateNov 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Ground truth for a cognitive system is generated from a structured resource such as a table by identifying a subject of the structured resource and field headers. Linguistic analysis is performed on a given header to establish an interrogative context, and a question is generated relating to the subject based on the interrogative context, including an implementation of one or more mathematical operators. The question is generated using a question template, and has a question phrase based on the interrogative context, an operator phrase based on the selected operator, and a keyword phrase based on the subject. An answer to the question is determined by carrying out a computation that applies the selected operator(s) to one or more of the data values, to form a question-and-answer pair that is added to the ground truth. A filtering step is preferably used to ensure that the question-and-answer pair is valid.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of providing ground truth for a cognitive system comprising: receiving a structured resource having a set of data values, by executing first instructions in a computer system; receiving a set of operators, by executing second instructions in the computer system; identifying a subject of the structured resource and at least one field header of the structured resource, by executing third instructions in the computer system; performing linguistic analysis on the field header to determine an interrogative context, by executing fourth instructions in the computer system; generating at least one question relating to the subject based on the interrogative context wherein the question includes an implementation of a selected one of the operators, by executing fifth instructions in the computer system; and determining an answer for the question to form a question-and-answer pair. 2. The method of claim 1 wherein the question includes a question phrase based on the interrogative context, an operator phrase based on the selected operator, and a keyword phrase based on the subject. 3. The method of claim 1 wherein the data values are numerical values and the operators are mathematical operators. 4. The method of claim 1 wherein the question is generated using a question template. 5. The method of claim 1 wherein said determining the answer includes carrying out a computation by applying the selected operator to one or more of the data values. 6. The method of claim 1 further comprising determining that the question-and-answer pair is valid. 7. The method of claim 1 further comprising: storing the question-and-answer pair as part of the ground truth for the cognitive system; and using the cognitive system to formulate a response to a natural language query. 8. A computer system comprising: one or more processors which process program instructions; a memory device connected to said one or more processors; and program instructions residing in said memory device for providing ground truth to a cognitive system by receiving a structured resource having a set of data values, receiving a set of operators, identifying a subject of the structured resource and at least one field header of the structured resource, performing linguistic analysis on the field header to determine an interrogative context, generating at least one question relating to the subject based on the interrogative context wherein the question includes an implementation of a selected one of the operators, and determining an answer for the question to form a question-and-answer pair. 9. The computer system of claim 8 wherein the question includes a question phrase based on the interrogative context, an operator phrase based on the selected operator, and a keyword phrase based on the subject. 10. The computer system of claim 8 wherein the data values are numerical values and the operators are mathematical operators. 11. The computer system of claim 8 wherein the question is generated using a question template. 12. The computer system of claim 8 wherein determining the answer includes carrying out a computation by applying the selected operator to one or more of the data values. 13. The computer system of claim 8 wherein said program instructions further determine that the question-and-answer pair is valid. 14. The computer system of claim 8 wherein said program instructions further store the question-and-answer pair as part of the ground truth for the cognitive system. 15. A computer program product comprising: a computer readable storage medium; and program instructions residing in said storage medium for providing ground truth to a cognitive system by receiving a structured resource having a set of data values, receiving a set of operators, identifying a subject of the structured resource and at least one field header of the structured resource, performing linguistic analysis on the field header to determine an interrogative context, generating at least one question relating to the subject based on the interrogative context wherein the question includes an implementation of a selected one of the operators, and determining an answer for the question to form a question-and-answer pair. 16. The computer program product of claim 15 wherein the question includes a question phrase based on the interrogative context, an operator phrase based on the selected operator, and a keyword phrase based on the subject. 17. The computer program product of claim 15 wherein the data values are numerical values and the operators are mathematical operators. 18. The computer program product of claim 15 wherein the question is generated using a question template. 19. The computer program product of claim 15 wherein determining the answer includes carrying out a computation by applying the selected operator to one or more of the data values. 20. The computer program product of claim 15 wherein said program instructions further determine that the question-and-answer pair is valid.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10482180B2 cover?
Ground truth for a cognitive system is generated from a structured resource such as a table by identifying a subject of the structured resource and field headers. Linguistic analysis is performed on a given header to establish an interrogative context, and a question is generated relating to the subject based on the interrogative context, including an implementation of one or more mathematical …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/3329. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).