Corpus augmentation system

US10031952B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10031952-B2
Application numberUS-201615052151-A
CountryUS
Kind codeB2
Filing dateFeb 24, 2016
Priority dateJan 2, 2015
Publication dateJul 24, 2018
Grant dateJul 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An approach is provided for automatically ingesting additional corpus based on an interaction history that is mined to identify a question that meets specified answer deficiency criteria, and then generate a second question which is correlated to the first question by requesting additional answer information for answering the first question, where the second question is posted to a forum using a selected persona so that forum responses can be monitored and ingested as additional content in the knowledge base.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in an information handling system comprising a processor and a memory, for ingesting additional content in a knowledge base, the method comprising: mining, by the system, an interaction history comprising a plurality of questions and answer results to identify a first question by performing a natural language processing (NLP) analysis of the plurality of questions and answer results to detect the first question that meets specified answer deficiency criteria; generating, by the system, a second question which is correlated to the first question by extracting a text sentence from one or more documents correlated to the first question and parsing the text sentence to populate a defined question template used to construct the second question requesting additional answer information for answering the first question; selecting, by the system, at least one persona to post the second question; posting, by the system, the second question to a forum using the at least one persona; monitoring, by the system, the forum for responses to the second question; and ingesting, by the system, any response to the second question as additional content in the knowledge base. 2. The method of claim 1 , wherein the NLP analysis identifies the first question by detecting an answer for the first question that has a confidence measure below a minimum confidence threshold, by detecting an answer for the first question that provides no response, by detecting an answer for the first question that has an associated negative sentiment, or by detecting an answer for the first question that has no supporting evidence. 3. The method of claim 1 , wherein generating the second question comprises: retrieving, by the system, one or more documents associated with the identified first question; extracting, by the system, a text sentence from the one or more documents which is correlated to the first question; and generating, by the system, the second question by parsing the text sentence to populate a defined question template used to construct the second question. 4. The method of claim 1 , wherein selecting the at least one persona comprises selecting a persona from a group consisting of a primed persona, a curated persona, an automated persona, and a selected persona. 5. The method of claim 1 , wherein the at least one persona is registered at the forum where the second question is posted. 6. The method of claim 1 , further comprising generating, by the system, one or more persona associated with a selected forum by extracting user profile information from users registered with the selected forum. 7. The method of claim 1 , further comprising generating, by the system, one or more question templates associated with a selected forum by using machine learning processing to identify one or more questions from users registered with the selected forum and extract therefrom one or more question templates.

Assignees

Inventors

Classifications

  • Parsing · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Query processing support for facilitating data mining operations in structured databases · CPC title

  • Updates performed during online database operations; commit processing · CPC title

  • using natural language analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10031952B2 cover?
An approach is provided for automatically ingesting additional corpus based on an interaction history that is mined to identify a question that meets specified answer deficiency criteria, and then generate a second question which is correlated to the first question by requesting additional answer information for answering the first question, where the second question is posted to a forum using …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).