Dynamic ontology for intelligent data discovery

US11977843B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11977843-B2
Application numberUS-202217648748-A
CountryUS
Kind codeB2
Filing dateJan 24, 2022
Priority dateJan 24, 2022
Publication dateMay 7, 2024
Grant dateMay 7, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, apparatus, system, and computer program code for intelligent data discovery with dynamic ontology are provided. According to one illustrative embodiment, the method using a number of processors to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; and responsive to identifying a data item that is not recognized in the data schema: storing the data item with labels; generating a weight for the data item; and responsive to the weight exceeding a threshold, updating the schema to include the data item that was not recognized.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for intelligent data discovery with dynamic ontology, the method comprising: using a number of processors to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: storing the data item with labels; generating a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, updating the dynamic data schema to include the data item that was not recognized. 2. The method of claim 1 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 3. The method of claim 1 , further comprising: identifying the data item using a named entity recognition; and identifying a set of relationships that relate the data item to the set of data items through a relation detection. 4. The method of claim 1 , further comprising: responsive to updating the dynamic data schema, updating the dynamic ontology according to the dynamic data schema. 5. The method of claim 1 , further comprising: classifying the unstructured content according to the set of data items and the dynamic ontology. 6. A computer system for intelligent data discovery with dynamic ontology, the computer system comprising: one or more processors; a set of one or more non-transitory computer-readable storage media; and program instructions, collectively stored in the set of one more non-transitory computer-readable storage media, for causing the one or more processors to perform the following computer operations: identify a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: store the data item with labels; generate a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, update the dynamic data schema to include the data item that was not recognized. 7. The computer system of claim 6 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 8. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: identify the data item using a named entity recognition; and identify a set of relationships that relate the data item to the set of data items through a relation detection. 9. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: responsive to updating the dynamic data schema, update the dynamic ontology according to the dynamic data schema. 10. The computer system of claim 6 , wherein the one or more processors are further configured to execute the program instructions to cause the system to: classify the unstructured content according to the set of data items and the dynamic ontology. 11. A computer program product for intelligent data discovery with dynamic ontology, the computer program product comprising: a non-transitory computer-readable storage medium having program instructions embodied thereon to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; responsive to identifying a data item that is not recognized in the dynamic data schema: storing the data item with labels; generating a weight for the data item, wherein the weight is based on a proximity to other data elements in the dynamic ontology, and the weight is dynamically adjusting as additional documents are processed; and responsive to the weight exceeding a threshold, updating the dynamic data schema to include the data item that was not recognized. 12. The computer program product of claim 11 , wherein the schema comprises: known attributes of data items; known aliases of data items; and specified relationships between data items. 13. The computer program product of claim 11 , further comprising: identifying the data item using a named entity recognition; and identifying a set of relationships that relate the data item to the set of data items through a relation detection. 14. The computer program product of claim 11 , further comprising: responsive to updating the dynamic data schema, updating the dynamic ontology according to the dynamic data schema. 15. The computer program product of claim 11 , further comprising: classifying the unstructured content according to the set of data items and the dynamic ontology.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11977843B2 cover?
A method, apparatus, system, and computer program code for intelligent data discovery with dynamic ontology are provided. According to one illustrative embodiment, the method using a number of processors to perform the steps of: identifying a set of data items in unstructured content using a dynamic data schema populated from a dynamic ontology; and responsive to identifying a data item that is…
Who is the assignee on this patent?
S&P Global Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/295. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 07 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).