Identifying and scoring data values

US9886489B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9886489-B2
Application numberUS-201514724763-A
CountryUS
Kind codeB2
Filing dateMay 28, 2015
Priority dateSep 23, 2014
Publication dateFeb 6, 2018
Grant dateFeb 6, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Text including at least a first term can be presented on a display. An enterprise glossary is queried to identify other terms that match the first term. Data assets to which each of the other terms are linked and which include data values for the other terms can be identified. A first score indicating a level of relevance of the respective data asset to an enterprise is assigned to each of the data assets. A frequency distribution of the data values in the data assets is determined. Based at least on the first scores indicating the level of relevance of the respective data assets to the enterprise and the frequency distribution of the data values in the data assets, second scores are assigned to each of the data values. A plurality the data values which are assigned highest of the second scores are presented on the display.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: presenting on a display text including at least a first term; querying an enterprise glossary to identify other terms that match the first term; identifying data assets to which each of the other terms that match the first term are linked and which include data values for the other terms, and assigning to each of the data assets a first score indicating a level of relevance of the respective data asset to an enterprise; determining a frequency distribution of the data values in the data assets; based at least on the first scores indicating the level of relevance of the respective data assets to the enterprise and the frequency distribution of the data values in the data assets, assigning, using a processor, second scores to each of the data values; and presenting on the display a plurality the data values which are assigned highest of the second scores. 2. The method of claim 1 , wherein assigning, using a processor, second scores to each of the data values further is based on, for each of the data values, a third score assigned to a respective other term that matches the first term indicating a level of relevance of the other term to the enterprise. 3. The method of claim 1 , wherein the second score assigned to each data value is based on a use of the data value across a plurality of the data assets. 4. The method of claim 1 , further comprising: identifying a plurality of data assets to which the first term is linked and which include data values for the first term, and assigning to each of the data assets to which the first term is linked a third score indicating a level of relevance to an enterprise; wherein assigning, using the processor, the second scores to each of the data values further is based on the third scores. 5. The method of claim 4 , wherein determining the frequency distribution of the data values in the data assets comprises determining a frequency of distribution of the data values in the plurality of data assets to which the first term is linked. 6. The method of claim 1 , wherein querying the enterprise glossary to identify the other terms that match the first term is responsive to presenting on the display the text including at least the first term. 7. The method of claim 1 , wherein querying the enterprise glossary to identify the other terms that match the first term is responsive to a user selection of the first term.

Assignees

Inventors

Classifications

  • Query execution (filtering based on additional data G06F16/335) · CPC title

  • G06Q10/00Primary

    Administration; Management · CPC title

  • Presentation of query results · CPC title

  • Presentation of query results · CPC title

  • using ranking · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9886489B2 cover?
Text including at least a first term can be presented on a display. An enterprise glossary is queried to identify other terms that match the first term. Data assets to which each of the other terms are linked and which include data values for the other terms can be identified. A first score indicating a level of relevance of the respective data asset to an enterprise is assigned to each of the …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06Q10/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).