Unsupervised ontology-based graph extraction from texts
US-10169454-B2 · Jan 1, 2019 · US
US11977843B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11977843-B2 |
| Application number | US-202217648748-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 24, 2022 |
| Priority date | Jan 24, 2022 |
| Publication date | May 7, 2024 |
| Grant date | May 7, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method, apparatus, system, and computer program code for intelligent data discovery with dynamic ontology are provided. According to one illustrative embodiment, the method using a number of processors to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; and responsive to identifying a data item that is not recognized in the data schema: storing the data item with labels; generating a weight for the data item; and responsive to the weight exceeding a threshold, updating the schema to include the data item that was not recognized.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for intelligent data discovery with dynamic ontology, the method comprising: using a number of processors to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: storing the data item with labels; generating a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, updating the dynamic data schema to include the data item that was not recognized. 2. The method of claim 1 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 3. The method of claim 1 , further comprising: identifying the data item using a named entity recognition; and identifying a set of relationships that relate the data item to the set of data items through a relation detection. 4. The method of claim 1 , further comprising: responsive to updating the dynamic data schema, updating the dynamic ontology according to the dynamic data schema. 5. The method of claim 1 , further comprising: classifying the unstructured content according to the set of data items and the dynamic ontology. 6. A computer system for intelligent data discovery with dynamic ontology, the computer system comprising: one or more processors; a set of one or more non-transitory computer-readable storage media; and program instructions, collectively stored in the set of one more non-transitory computer-readable storage media, for causing the one or more processors to perform the following computer operations: identify a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: store the data item with labels; generate a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, update the dynamic data schema to include the data item that was not recognized. 7. The computer system of claim 6 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 8. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: identify the data item using a named entity recognition; and identify a set of relationships that relate the data item to the set of data items through a relation detection. 9. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: responsive to updating the dynamic data schema, update the dynamic ontology according to the dynamic data schema. 10. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: classify the unstructured content according to the set of data items and the dynamic ontology. 11. A computer program product for intelligent data discovery with dynamic ontology, the computer program product comprising: a non-transitory computer-readable storage medium having program instructions embodied thereon to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: storing the data item with labels; generating a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, updating the dynamic data schema to include the data item that was not recognized. 12. The computer program product of claim 11 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 13. The computer program product of claim 11 , further comprising: identifying the data item using a named entity recognition; and identifying a set of relationships that relate the data item to the set of data items through a relation detection. 14. The computer program product of claim 11 , further comprising: responsive to updating the dynamic data schema, updating the dynamic ontology according to the dynamic data schema. 15. The computer program product of claim 11 , further comprising: classifying the unstructured content according to the set of data items and the dynamic ontology.
Named entity recognition · CPC title
Ontology · CPC title
Thesauruses; Synonyms · CPC title
Lexical tools · CPC title
Recognition of textual entities · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.