Generating a trust factor for data in non-relational or relational databases

US2018060370A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2018060370-A1
Application numberUS-201615252282-A
CountryUS
Kind codeA1
Filing dateAug 31, 2016
Priority dateAug 31, 2016
Publication dateMar 1, 2018
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method includes detecting an update to a record in an entity table of a database. At least one of an age score, a lineage score, and a completeness score for the record is calculated, responsive to the update. A trust factor is calculated, by a computer processor, based on the at least one of the age score, the lineage score, and the completeness score for the record. The trust factor indicates a level of trustworthiness of the record. It is decided whether to use data in the record based on the trust factor.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method, comprising: detecting an update to a record in an entity table of a database; calculating at least one of an age score, a lineage score, and a completeness score for the record, responsive to the update; calculating, by a computer processor, a trust factor based on the at least one of the age score, the lineage score, and the completeness score for the record, wherein the trust factor indicates a level of trustworthiness of the record; and deciding, by the computer processor, whether to use data in the record based on the trust factor. 2 . The computer-implemented method of claim 1 , wherein the database is a non-relational database. 3 . The computer-implemented method of claim 1 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: determining an age of the record; identifying a maximum age of a plurality of records in the entity table; calculating the age score of the record based on the age of the record and the maximum age of the plurality of records. 4 . The computer-implemented method of claim 1 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a data source of the record; identifying a weight associated with the data source; and determining the lineage score based on the weight associated with the data source. 5 . The computer-implemented method of claim 1 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a first family of columns in a synonym table, wherein the first family of columns comprises a first set of columns deemed to be synonyms to one another; identifying a corresponding weight for each column in the first set of columns of the first family; determining which of the first set of columns are filled in the record; calculating the completeness score based on the corresponding weight for each of the first set of columns that are filled in the record. 6 . The computer-implemented method of claim 5 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a second family of columns in a synonym table, wherein the second family of columns comprises a second set of columns deemed to be synonyms to one another, wherein the second family is distinct from the first family; identifying a corresponding weight for each column in the second set of columns of the second family; and determining which of the second set of columns are filled in the record; wherein the calculating the completeness score based on the corresponding weight for each of the first set of columns that are filled in the record further comprises basing the completeness score on the corresponding weight for of the second set of columns that are filled in the record. 7 . The computer-implemented method of claim 1 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record is based on a configuration table, and further comprising: modifying the configuration table based on historical use of the entity table. 8 . A system comprising: a memory having computer readable instructions; and one or more processors for executing the computer readable instructions, the computer readable instructions comprising: detecting an update to a record in an entity table of a database; calculating at least one of an age score, a lineage score, and a completeness score for the record, responsive to the update; calculating a trust factor based on the at least one of the age score, the lineage score, and the completeness score for the record, wherein the trust factor indicates a level of trustworthiness of the record; and deciding whether to use data in the record based on the trust factor. 9 . The system of claim 8 , wherein the database is a non-relational database. 10 . The system of claim 8 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: determining an age of the record; identifying a maximum age of a plurality of records in the entity table; calculating the age score of the record based on the age of the record and the maximum age of the plurality of records. 11 . The system of claim 8 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a data source of the record; identifying a weight associated with the data source; and determining the lineage score based on the weight associated with the data source. 12 . The system of claim 8 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a first family of columns in a synonym table, wherein the first family of columns comprises a first set of columns deemed to be synonyms to one another; identifying a corresponding weight for each column in the first set of columns of the first family; determining which of the first set of columns are filled in the record; calculating the completeness score based on the corresponding weight for each of the first set of columns that are filled in the record. 13 . The system of claim 12 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record comprises: identifying a second family of columns in a synonym table, wherein the second family of columns comprises a second set of columns deemed to be synonyms to one another, wherein the second family is distinct from the first family; identifying a corresponding weight for each column in the second set of columns of the second family; and determining which of the second set of columns are filled in the record; wherein the calculating the completeness score based on the corresponding weight for each of the first set of columns that are filled in the record further comprises basing the completeness score on the corresponding weight for of the second set of columns that are filled in the record. 14 . The system of claim 8 , wherein the calculating at least one of the age score, the lineage score, and the completeness score for the record is based on a configuration table, and further comprising: modifying the configuration table based on historical use of the entity table. 15 . A computer program product for determining a trust factor of a record, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform a method comprising: detecting an update to a record in an entity table of a database; calculating at least one of an age score, a lineage score, and a completeness score for the record, responsive to the update; calculating a trust factor based on the at least one of the age score, the lineage score, and the completeness score for the record, wherein the trust factor indicates a level of trustworthiness of the record; and deciding whether to use data in the record based on the trust factor. 16 . The computer program product of claim 15 , wherein the database is a non-relational database. 17 . The computer program product of claim 15 , wherein the calculating at least one of the age score, the lineage

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • G06F16/215Primary

    Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • G06F16/288Primary

    Entity relationship models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018060370A1 cover?
A computer-implemented method includes detecting an update to a record in an entity table of a database. At least one of an age score, a lineage score, and a completeness score for the record is calculated, responsive to the update. A trust factor is calculated, by a computer processor, based on the at least one of the age score, the lineage score, and the completeness score for the record. The…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F17/30339. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 01 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).