Clinically relevant medical concept clustering
US-2017193185-A1 · Jul 6, 2017 · US
US10832802B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10832802-B2 |
| Application number | US-201715635750-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 28, 2017 |
| Priority date | Jan 6, 2016 |
| Publication date | Nov 10, 2020 |
| Grant date | Nov 10, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present invention embodiments are directed to methods, systems, and computer programs for identifying relations, within at least one taxonomy, between taxonomy categories and concepts extracted from electronic content. The relations represent semantic similarities for the concepts. The concepts are clustered based on the identified relations within the at least one taxonomy.
Opening claim text (preview).
What is claimed is: 1. A computer implemented method comprising: identifying, within a plurality of different taxonomies, relations between taxonomy categories of the different taxonomies and concepts extracted from electronic content, wherein the electronic content is from a medical record, the concepts include medical concepts extracted from the medical record, and the plurality of different taxonomies includes medical taxonomies, and wherein the relations represent semantic similarities for the concepts and the identifying relations further includes: mapping the concepts to each of the plurality of different taxonomies, wherein mapping the concepts includes: determining a first concept extracted from the electronic content not found in a selected taxonomy of the plurality of different taxonomies; identifying one or more other taxonomies of the plurality of different taxonomies containing the first concept and determining a second concept that resides in the selected taxonomy and the identified one or more other taxonomies; and mapping the first concept to the second concept within the selected taxonomy when the second concept is closest to the first concept in the identified one or more other taxonomies and within a distance limit of the first concept, wherein the first concept remains unmapped to the selected taxonomy in response to the second concept not satisfying the distance limit; generating concept vectors relating each of the concepts to one or more corresponding taxonomy categories of the different taxonomies, wherein each concept vector is associated with a concept and includes a plurality of values with each value indicating a relationship between that associated concept and a corresponding taxonomy category, and wherein at least one concept has relations to taxonomy categories in two or more taxonomies; and determining a similarity measure between each of the concept vectors of the concepts based on distances between the concept vectors; clustering the concepts based on the determined similarity measure between the concept vectors; and generating a visualization of the electronic content with information arranged according to the clustered concepts to identify information within the electronic content relevant to a situation. 2. The method of claim 1 , further comprising: performing named-entity recognition and disambiguation on the concepts, which includes concept identification, named-entity detection, and linking of each identified concept to a meaning. 3. The method of claim 1 , further comprising: identifying the concepts from structured information within the electronic content. 4. The method of claim 1 , further comprising: identifying the concepts from unstructured information within the electronic content. 5. The method of claim 1 , further comprising: generating a similarity matrix relating the concepts based on the similarity measures; and clustering the concepts based on the similarity measures. 6. The method of claim 1 , wherein the taxonomy categories represent a feature space for clustering of the concepts, and wherein the method further comprising: performing dimensionality reduction to remove features from the feature space to reduce processing time. 7. The method of claim 1 , wherein the semantic similarities for the concepts represent relative relationships of the concepts to the taxonomy categories such that the concepts are clustered based on identified relevance.
for patient-specific data, e.g. for electronic patient records · CPC title
for mining of medical data, e.g. analysing previous cases of other patients · CPC title
Clustering or classification · CPC title
Query execution (filtering based on additional data G06F16/335) · CPC title
into predefined classes · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.