Identifying potential patient candidates for clinical trials

US10162866B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10162866-B2
Application numberUS-201715672717-A
CountryUS
Kind codeB2
Filing dateAug 9, 2017
Priority dateJun 3, 2016
Publication dateDec 25, 2018
Grant dateDec 25, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer system gleans data from patient records and clinical trial descriptions using NLP techniques. NLP annotation data is used to generate clinical trial feature vectors and patient feature vectors. Clinical trial feature vectors and patient feature vectors are compared to match appropriate patient candidates with clinical trial openings.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for matching clinical trial openings with candidates from a patient population, the method comprising: identifying a first clinical trial description; generating annotations of the first clinical trial description, based on natural language processing techniques including Unstructured Information Management Architecture (UIMA), wherein the UIMA includes a language identification module using n-gram models for determining the language of the first clinical trial description; a linguistic analysis module for identifying parts of speech, syntactic position, and function; a dictionary module for defining terms and identifying synonyms; a named entity recognition module for identifying names, locations, and companies; a pattern recognition module for interpreting phrases and strings of numbers; a classification module for classifying the first clinical trial description according to content; and one or more custom annotators for identifying clinical features, disease states, diagnoses, one or more patient medications, one or more patient demographics, a patient-provider relationship, and a stage of disease progression; and wherein the annotations populate text indices, triple stores, and relational databases; generating a first trial feature vector based on the annotations of the first clinical trial description, the first trial feature vector comprising a first array of values in a matrix, wherein the first array of values represents disease states, diagnoses, one or more patient medications, one or more patient demographics, a patient-provider relationship, and a stage of disease progression; determining that a first value in the first array of values should be prioritized; transforming the first value into a weighted vector value, wherein the weighted vector value is used to generate a second trial feature vector; retrieving the first patient record and other patient records stored in a collection database, using a crawler, based on similarities between the annotations of the clinical trial description and contents of the patient records; applying natural language processing techniques to the first patient record, the natural language processing techniques including Unstructured Information Management Architecture (UIMA), wherein the UIMA includes a language identification module using n-gram models for determining the language of the first patient record; a linguistic analysis module for identifying parts of speech, syntactic position and function; a dictionary module for defining terms and identifying synonyms; a named entity recognition module for identifying names, locations, and companies; a pattern recognition module for interpreting phrases and strings of numbers; a classification module for classifying the first patient record according to content; and one or more custom annotators for identifying clinical features, disease states, diagnoses, one or more patient medications, one or more patient demographics, a patient-provider relationship, and a stage of disease progression; wherein the UIMA generates annotations of the first patient record, the annotations populating one or more of text indices, triple stores, and relational databases; generating a first patient feature vector based on the annotations of the first patient record, the first patient feature vector comprising the second array of values in a matrix, wherein the second array of values represents disease states, diagnoses, one or more patient medications, one or more patient demographics, a patient-provider relationship, and a stage of disease progression; generating a comparison value based on the second trial feature vector and the first patient feature vector, wherein the comparison value represents a logical distance between the clinical trial feature vector and the patient feature vector; displaying, via an interactive user interface, information about a patient associated with the patient record to a user based on the comparison value, wherein the information about the patient, including at least the comparison value, an identification of the patient, and a contact information for the patient, is displayed with information about other patients based on a plurality of comparison values generated from a plurality of patient records associated with the other patients, and wherein the interactive user interface allows the user to sort the information about the patient and the information about other patients based on sorting options comprising comparison values, patient name, and patient contact information; receiving, through the interactive user interface, a threshold comparison value from the user; and displaying, via the interactive user interface, information about a set of patients, the set of patients associated with comparison values that meet the threshold comparison value, and wherein the set of patients is filtered and sorted based on patient name, contact information, geographic region, and disease type.

Assignees

Inventors

Classifications

  • Named entity recognition · CPC title

  • G16H10/20Primary

    for electronic clinical trials or questionnaires · CPC title

  • for patient-specific data, e.g. for electronic patient records · CPC title

  • Morphological analysis · CPC title

  • for local operation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10162866B2 cover?
A computer system gleans data from patient records and clinical trial descriptions using NLP techniques. NLP annotation data is used to generate clinical trial feature vectors and patient feature vectors. Clinical trial feature vectors and patient feature vectors are compared to match appropriate patient candidates with clinical trial openings.
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G16H10/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 25 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).