Data stream quality management for analytic environments

US9460131B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9460131-B2
Application numberUS-201213463850-A
CountryUS
Kind codeB2
Filing dateMay 4, 2012
Priority dateMay 4, 2012
Publication dateOct 4, 2016
Grant dateOct 4, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to one aspect of the present disclosure, a system and technique for data quality management is disclosed. The system includes a processor and an ingress quality specification (IQS) module executable by the processor in a runtime environment with a data stream analytic module. The IQS module is configured to: receive the data stream; analyze a subset of data of the data stream to determine if the subset of data meets a quality expectation of the analytic module; annotate the subset of data to indicate a quality status based on whether the subset of data meets the quality expectation of the analytic module; and output the data stream to the analytic module.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: an analytic module configured to analyze a data stream output by an object and output an analysis of the object, the analytic module having a data quality expectation of data of the data stream; a memory storing a plurality of ingress quality specification (IQS) modules each corresponding to a different quality characteristic and each associated with the analytic module; and an interface configured to enable a selection of at least one IQS module from the plurality of IQS modules to deploy with the analytic module to: receive the data stream from the object; analyze a subset of data of the data stream to determine if the subset of data meets the quality expectation of the analytic module; modify the subset of data by annotating the subset of data to indicate a quality status based on whether the subset of data meets the quality expectation of the analytic module; and output the data stream to the analytic module; and wherein the analytic module is configured to receive the data stream from the IQS module, identify data not meeting the quality expectation based on the annotations, and omit from its analysis of the object the data not meeting the quality expectation; and wherein the selected IQS module is configured to determine whether a selected subset of data of the data stream includes a minimum quantity of data samples based on the quality expectation of the analytic module. 2. The system of claim 1 , wherein the selected IQS module is configured to: apply a predicate to the subset of data. 3. The system of claim 2 , wherein the selected IQS module is configured to flag data of the subset passing the predicate. 4. The system of claim 3 , wherein the selected IQS module is configured to flag data of the subset failing the predicate. 5. The system of claim 1 , wherein the selected IQS module is configured to annotate a field of the data indicating the quality status. 6. The system of claim 1 , wherein the selected IQS module includes: a selector module configured to select the subset of the data stream; and a predicate module configured to apply a predicate to the selected subset of data to determine whether the selected subset of data meets the quality expectation of the analytic module. 7. The system of claim 6 , wherein the selector module is configured to select the subset of data based on a predetermined time period.

Assignees

Inventors

Classifications

  • Ensuring data consistency and integrity · CPC title

  • Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • Annotation, e.g. comment data or footnotes · CPC title

  • Data stream processing; Continuous queries · CPC title

  • G06F16/22Primary

    Indexing; Data structures therefor; Storage structures · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9460131B2 cover?
According to one aspect of the present disclosure, a system and technique for data quality management is disclosed. The system includes a processor and an ingress quality specification (IQS) module executable by the processor in a runtime environment with a data stream analytic module. The IQS module is configured to: receive the data stream; analyze a subset of data of the data stream to deter…
Who is the assignee on this patent?
George Randy, Mckeown Robert J, IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/24568. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 04 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).