Active management of files being processed in enterprise data warehouses utilizing time series predictions
US-2024256573-A1 · Aug 1, 2024 · US
US9870419B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9870419-B2 |
| Application number | US-201213407712-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 28, 2012 |
| Priority date | Dec 31, 2010 |
| Publication date | Jan 16, 2018 |
| Grant date | Jan 16, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In an embodiment of the invention, a method for data profiling incorporating an enterprise service bus (ESB) coupling the target and source systems following an extraction, transformation, and loading (ETL) process for a target system and a source system is provided. The method includes receiving baseline data profiling results obtained during ETL from a source application to a target application, caching the updates, determining current data profiling results within the ESB for cached updates, and triggering an action if a threshold disparity is detected upon the current data profiling results and the baseline data profiling results.
Opening claim text (preview).
We claim: 1. A method for in-memory cache data profiling, comprising: executing an Extract Transfer Load (ETL) process in which data is first moved from a source database of a source application to a target database of a target application during which movement the data is extracted from the source database into a persistency comprising a staging area, an alignment area and a preload area, then transformed to a model that is common to both the source and target databases, and finally cleansed and loaded into the target database so as to initially populate a data warehouse; performing baseline data profiling on the extracted data in the persistency during ETL in order to produce baseline data profiling results; and, subsequent to the ETL, receiving in an enterprise service bus (ESB) updates to the source database and placing the updates in a cache memory on the ESB, determining whether multi-record profiling or only single record profiling has been selected for profiling cached updates, on condition that multi-record profiling is selected, performing data profiling on the updates in the cache on the ESB and determining current data profiling results for the cached updates and, comparing the current data profiling results for the cached updates to the baseline data profiling results, but otherwise performing single record profiling on the updates without comparing the current data profiling results for the cached updates to the baseline data profiling results, and triggering an action if a threshold disparity is detected based upon the current data profiling results. 2. The method of claim 1 , wherein the action is a data governance action. 3. The method of claim 2 , wherein the data governance action is integrated with at least one data governance application. 4. The method of claim 2 , wherein the action is notifying a data steward.
Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.