Using cohorts to infer attributes for an input case in a question answering system

US9747552B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9747552-B2
Application numberUS-201514658973-A
CountryUS
Kind codeB2
Filing dateMar 16, 2015
Priority dateFeb 19, 2015
Publication dateAug 29, 2017
Grant dateAug 29, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A cohort analysis mechanism analyzes cohorts to infer one or more additional attributes for an input case to provide a refined input case to answer a question in a question answering system. The refined input case is then used to answer a question in the question answering system. The refined input case can be used in a traditional question answering flow or in a flow that again uses cohort analysis to extract relevant data to answer the question. The cohort analysis mechanism analyzes cohorts to find common attributes in the cohorts and then determines whether to infer the common attributes into the refined input case. The cohort analysis mechanism may determine to dialog with a user to confirm an inferred attribute.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a cohort analysis mechanism residing in the memory and executed by the at least one processor that analyzes an input case representing a question to determine whether cohorts can be used with the input case, wherein the cohorts are similar entities with similar characteristics to an entity of the input case and available in a corpus of data; wherein the cohort analysis mechanism uses cohort analysis to infer an attribute for the input case by: determining if the input case has an entity that has available cohorts; identifying cohorts in the corpus of data using fuzzy matching with a threshold percentage and where the cohorts are similar to the entity in the input case; analyzing the cohorts to find common attributes in the cohorts; finding missing attributes for the entity of the input case from the common attributes in the cohorts to infer a missing attribute for the input case; scoring the missing attributes according to consistency and sample size; applying confidence thresholds to the missing attributes to determine whether to infer the attribute for the refined input case; determining whether to dialog with a user to confirm the inferred attribute using a lower threshold compared to inferring the attribute without dialoging with the user; generating a refined input case that includes the inferred attribute applied to the entity of the input case; and using the refined input case with the inferred attribute to answer the question. 2. The apparatus of claim 1 wherein the input case is a medical question for a patient and the cohort analysis mechanism processes the medical history of the patient and cohorts, wherein the cohorts comprise a set of other patients with similar attributes, by processing medical histories of other patients to infer the attribute for the refined input case from the cohorts. 3. The apparatus of claim 1 wherein the cohort analysis mechanism is part of a question answering application that answers a natural language question. 4. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a cohort analysis mechanism residing in the memory and executed by the at least one processor that analyzes an input case representing a question, wherein the input case is a medical question about a patient, and wherein cohorts comprise a set of other patients with similar attributes; wherein the cohort analysis mechanism uses cohort analysis to infer an attribute for the input case by: identifying similar patients in the corpus of data where the similar patients are similar to the patient in the input case; analyzing the similar patients to find common attributes in the similar patients; scoring the missing attributes according to consistency and sample size; finding missing attributes for the patient of the input case from the common attributes in the similar patients to infer an attribute for the input case from the missing attributes; generating a refined input case that includes the inferred attribute from the common attributes of the similar patients; and using the refined input case with the inferred attribute to answer the question by submitting the refined input case with the inferred attribute to the question answering system. 5. A program product comprising: a cohort analysis mechanism that analyzes an input case representing a question to determine whether cohorts can be used with the input case, wherein the cohorts are similar entities with similar characteristics to an entity of the input case and available in a corpus of data; wherein the cohort analysis mechanism uses cohort analysis to infer an attribute for the input case by: determining if the input case has an entity that has available cohorts; identifying cohorts in the corpus of data using fuzzy matching with a threshold percentage and where the cohorts are similar to the entity in the input case; analyzing the cohorts to find common attributes in the cohorts; finding missing attributes for the entity of the input case from the common attributes in the cohorts to infer a missing attribute for the input case; scoring the missing attributes according to consistency and sample size; applying confidence thresholds to the missing attributes to determine whether to infer the attribute for the refined input case; determining whether to dialog with a user to confirm the inferred attribute using a lower threshold compared to inferring the attribute without dialoging with the user; generating a refined input case that includes the inferred attribute applied to the entity of the input case; and using the refined input case with the inferred attribute to answer the question; and a non-transistory computer-readable storage medium bearing the cohort analysis mechanism. 6. The program product of claim 5 wherein the input case is a medical question for a patient and the cohort analysis mechanism processes the medical history of the patient and cohorts, wherein the cohorts comprise a set of other patients with similar attributes, by processing medical histories of other patients to infer the attribute for the refined input case from the cohorts. 7. The program product of claim 5 wherein the cohort analysis mechanism is part of a question answering application that answers a natural language question. 8. The program product of claim 5 wherein the cohorts are people. 9. The program product of claim 5 wherein the cohorts are chosen from one of the following: patients, animals, and computer components.

Assignees

Inventors

Classifications

  • G06N5/04Primary

    Inference or reasoning models · CPC title

  • Physics · mapped topic

  • Natural language query formulation or dialogue systems · CPC title

  • using system suggestions · CPC title

  • Translation of natural language queries to structured queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9747552B2 cover?
A cohort analysis mechanism analyzes cohorts to infer one or more additional attributes for an input case to provide a refined input case to answer a question in a question answering system. The refined input case is then used to answer a question in the question answering system. The refined input case can be used in a traditional question answering flow or in a flow that again uses cohort ana…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N5/04. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 29 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).