Database system for triggering event notifications based on updates to database records
US-2024419652-A1 · Dec 19, 2024 · US
US10089335B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10089335-B2 |
| Application number | US-201213545398-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 10, 2012 |
| Priority date | Jul 10, 2012 |
| Publication date | Oct 2, 2018 |
| Grant date | Oct 2, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Tracking lineage of data. A method may be practiced in a network computing environment including a plurality of interconnected systems where data is shared between the systems. A method includes accessing a dataset. The dataset is associated with lineage metadata. The lineage metadata includes data indicating the original source of the data, one or more intermediary entities that have performed operations on the dataset, and the nature of operations performed on the dataset. A first entity performs an operation on the dataset. As a result of performing a first operation on the dataset, the method includes updating the lineage metadata to indicate that the first entity performed the operation on the dataset. The method further includes providing functionality for determining if the lineage metadata has been compromised in that the lineage metadata has been at least one of removed from association with the dataset, is corrupted, or is incomplete.
Opening claim text (preview).
What is claimed is: 1. In a network computing environment comprising a plurality of interconnected systems where data is shared between the systems, a method of tracking the source, lineage, and integrity of data, the method comprising: accessing a dataset, the dataset having been signed by a first authority to ensure that the dataset has not been compromised; accessing lineage metadata associated with the dataset, the lineage metadata comprising data indicating the original source of the data and information about one or more operations which have been performed on the dataset, the information for each of the one or more operations including when the each operation was performed, an identity of an entity which performed the each operation, and the nature of the each operation, wherein the lineage metadata is signed by a second authority using a cryptographic certificate which allows dataset users to determine whether the lineage metadata has been compromised and whether to trust the second authority; determining a validity for the dataset by analyzing at least the signature of the first authority; determining a validity for the lineage metadata by analyzing at least the signature of the second authority; determining a trust level for the second authority; and based upon the determined validity of the dataset, the validity of the lineage metadata, and the determined trust level of the authority, performing an action that is associated with the dataset and the determined validity of the dataset, validity of the lineage metadata, and the trust level for the second authority. 2. The method of claim 1 , further comprising determining that the lineage metadata has been compromised including performing a checksum on the dataset and the lineage metadata. 3. The method of claim 1 , further comprising determining that the lineage metadata has been compromised including determining that embedded lineage metadata has been removed from the dataset. 4. The method of claim 1 , wherein invalidating the dataset comprises making the dataset generally unavailable. 5. The method of claim 1 , wherein invalidating the dataset comprises marking the dataset as invalid, but nonetheless allowing entities to obtain the dataset. 6. In a network computing environment comprising a plurality of interconnected systems where data is shared between the systems, a method of tracking lineage of data, the method comprising: accessing a dataset, the dataset having been signed by a first authority to ensure that the dataset has not been compromised; at a first entity, performing an operation on the dataset; accessing lineage metadata associated with the dataset, the lineage metadata comprising data indicating the original source of the data and information about one or more operations which have been performed on the dataset, the information for each of the one or more operations including when the each operation was performed, an identity of an entity which performed the each operation, and the nature of the each operation, wherein the lineage metadata is signed by a second authority using a cryptographic certificate which allows dataset users to determine whether the lineage metadata has been compromised and whether to trust the second authority; as a result of performing an operation on the dataset at the first entity, updating the lineage metadata to indicate that the first entity performed the operation on the dataset such that the lineage metadata includes when the operation performed at the first entity was performed, information about the first entity, an indication that the operation performed at the first entity was performed at the first entity, and the nature of the operation performed at the first entity; and computing a value which another entity can use to determine validity of the dataset and the lineage metadata. 7. The method of claim 6 , further comprising providing functionality for determining if the lineage metadata has been compromised including performing a checksum on the dataset and the lineage metadata. 8. The method of claim 6 , further comprising providing functionality for determining if the lineage metadata has been compromised including signing the dataset and the lineage metadata using an encryption key. 9. The method of claim 8 , wherein the encryption key is part of a chain of keys used by various entities to add lineage metadata as the result of previous operations. 10. The method of claim 6 , wherein the lineage metadata is associated with the dataset by a user manually creating the lineage metadata based on information the user has about the dataset, and the user manually associating the lineage metadata with the dataset. 11. The method of claim 6 , wherein the lineage metadata is associated with the dataset by a system automatically parsing logged operations on data in the dataset. 12. The method of claim 6 , wherein the lineage metadata is associated with the dataset by a system searching database repositories to determine the ultimate source of the dataset. 13. The method of claim 6 , wherein the lineage metadata is associated with the dataset and updated by a central governance entity that has API's allowing other entities to manage lineage metadata. 14. The method of claim 6 , wherein the lineage metadata is configured to be tracked on a database level, table level, row level, and cell level. 15. In a network computing environment comprising a plurality of interconnected systems where data is shared between the systems, a system for tracking lineage of data, the system comprising one or more processors having access to computer executable instructions that, when executed by the one or more processors, enable the processors to: access a dataset, the dataset having been signed by a first authority to ensure that the dataset has not been compromised; at a first entity, perform an operation on the dataset; access lineage metadata associated with the dataset, the lineage metadata comprising data indicating the original source of the data and information about one or more operations which have been performed on the dataset, the information for each of the one or more operations including when the each operation was performed, an identity of an entity which performed the each operation, and the nature of the each operation, wherein the lineage metadata is signed by a second authority using a cryptographic certificate which allows dataset users to determine whether the lineage metadata has been compromised and whether to trust the second authority; as a result of performing a first operation on the dataset, update the lineage metadata to indicate that the first entity performed the operation on the dataset such that the lineage metadata includes when the operation performed at the first entity was performed, information about the first entity, an indication that the operation performed at the first entity was performed at the first entity, and the nature of the operation performed at the first entity; computing a value which another entity can use to determine validity of the dataset and the lineage metadata; manage the lineage metadata at a central governance entity to allow the lineage metadata to enter and leave various repositories while still providing consistent management of the lineage metadata. 16. The system of claim 15 , further comprising providing functionality for determining if the lineage metadata has been compromised including performing a checksum on the dataset and the lineage metadata. 17. The system of claim 15 , further comprising providing function
Managing data history or versioning (querying versioned data G06F16/2474; querying temporal data G06F16/2477) · CPC title
Ensuring data consistency and integrity · CPC title
Change logging, detection, and notification (replication G06F16/27) · CPC title
Physics · mapped topic
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.