Using cohorts in a question answering system

US9836693B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9836693-B2
Application numberUS-201414489124-A
CountryUS
Kind codeB2
Filing dateSep 17, 2014
Priority dateSep 17, 2014
Publication dateDec 5, 2017
Grant dateDec 5, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A cohort analysis mechanism analyzes cohorts with similar attributes to extrapolate additional knowledge and answer a question in a question answering system. The cohort analysis mechanism identifies cohorts for an entity of the question and extracts relevant data concerning the cohorts. The cohort analysis mechanism aggregates the relevant information for evidence scoring and answer scoring to answer a question posed to the question answering system. The aggregating of the data includes combining and ranking answers from the cohorts, gathering evidence and then answering the question with the gathered evidence.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a cohort analysis mechanism residing in the memory and executed by the at least one processor that answers a question by identifying cohorts for a person in the question, wherein the cohorts are persons meeting a threshold of similar attributes to the person of the question and available in a corpus of data; and wherein the cohort analysis mechanism aggregates data of the cohorts, extracts answers and evidence from data of the cohorts and combines and ranks the answers and evidence from data of the cohorts to answer the question. 2. The apparatus of claim 1 wherein the attributes include age, race and symptoms. 3. The apparatus of claim 2 wherein the cohort analysis mechanism uses the ranked answers to gather statistically significant evidence of the cohorts to answer the question. 4. The apparatus of claim 3 wherein the cohort analysis mechanism uses the ranked answers to gather statistically significant evidence of the cohorts to answer the question and wherein the cohort analysis mechanism determines an answer to the question using evidence gathered from a set of cohorts where case attributes in the set of cohorts does not exactly match attributes of the question. 5. The apparatus of claim 4 wherein the person in the question is a patient, the question is a medical question for the patient and the cohort analysis mechanism processes the medical history of the patient and finds the cohorts from medical histories of other patients to predict an answer from the cohorts. 6. The apparatus of claim 3 wherein the cohort analysis mechanism combines the answer from the cohorts with answers using conventional question answering in an answer scoring block of a conventional question answering system to answer the question. 7. The apparatus of claim 1 wherein the cohort analysis mechanism is part of a question answering application that answers a natural language question. 8. The apparatus of claim 1 wherein the person in the question is a patient, the question is a medical question for the patient and the cohort analysis mechanism processes the medical history of the patient and finds the cohorts from medical histories of other patients to predict an answer from the cohorts. 9. The method of claim 1 wherein the threshold is a reference percentage allowing the cohorts to be divided into relative strengths for different percentage thresholds and wherein the relative strengths of the cohorts are used to score the answers and evidence. 10. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a cohort analysis mechanism residing in the memory and executed by the at least one processor that answers a question by identifying cohorts for a person in the question, wherein the cohorts are persons meeting a threshold of similar attributes to the person of the question and available in a corpus of data wherein the attributes include age, race and symptoms; wherein the cohort analysis mechanism aggregates data of the cohorts, extracts answers and evidence from data of the cohorts, combines and ranks the answers and evidence from data of the cohorts and uses the ranked answers and evidence to gather statistically significant evidence of the cohorts to answer the question, wherein the person in the question is a patient, the question is a medical question for the patient and the cohort analysis mechanism processes the medical history of the patient and finds the cohorts from medical histories of other patients to predict the answer from the cohorts; and wherein the cohort analysis mechanism is part of a question answering application that answers a natural language question. 11. The apparatus of claim 10 wherein the attributes include age, race and symptoms. 12. The apparatus of claim 10 wherein the threshold is a reference percentage allowing the cohorts to be divided into relative strengths for different percentage thresholds and wherein the relative strengths of the cohorts are used to score the answers and evidence. 13. A computer-readable article of manufacture comprising: a cohort analysis mechanism that answers a question by identifying cohorts for a person in the question, wherein the cohorts are persons meeting a threshold of similar attributes to the person of the question and available in a corpus of data wherein the attributes include age, race and symptoms; wherein the cohort analysis mechanism aggregates data of the cohorts, extracts answers and evidence from data of the cohorts, combines and ranks the answers and evidence from data of the cohorts and uses the ranked answers and evidence to gather statistically significant evidence of the cohorts to answer the question, wherein the person in the question is a patient, the question is a medical question for the patient and the cohort analysis mechanism processes the medical history of the patient and finds the cohorts from medical histories of other patients to predict the answer from the cohorts; and a non-transitory computer recordable medium bearing the cohort analysis mechanism. 14. The article of manufacture of claim 13 wherein the attributes include age, race and symptoms. 15. The article of manufacture of claim 13 wherein the threshold is a reference percentage allowing the cohorts to be divided into relative strengths for different percentage thresholds and wherein the relative strengths of the cohorts are used to score the answers and evidence.

Assignees

Inventors

Classifications

  • G06N5/041Primary

    Abduction · CPC title

  • Physics · mapped topic

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • Forward inferencing; Production systems · CPC title

  • Distributed expert systems; Blackboards · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9836693B2 cover?
A cohort analysis mechanism analyzes cohorts with similar attributes to extrapolate additional knowledge and answer a question in a question answering system. The cohort analysis mechanism identifies cohorts for an entity of the question and extracts relevant data concerning the cohorts. The cohort analysis mechanism aggregates the relevant information for evidence scoring and answer scoring to…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N5/041. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).