Method and apparatus for data quality management and control

US10248674B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10248674-B2
Application numberUS-201615230308-A
CountryUS
Kind codeB2
Filing dateAug 5, 2016
Priority dateDec 4, 2015
Publication dateApr 2, 2019
Grant dateApr 2, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present invention provide a method and an apparatus for data quality management and control. The method includes: receiving application information transmitted by a service sub-system; resolving datasheet operation trigger information to obtain datasheet flow information; receiving user information transmitted by the service sub-system and a target datasheet transmitted by the service sub-system; if a name of the target datasheet is different from a plurality of datasheet names corresponding to the service sub-system identifier, then instructing the service sub-system to store the target datasheet into a data center; if the datasheet operation information is updating a datasheet, instructing the data center to replace datasheet contents corresponding to the datasheet name with contents of the target datasheet.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for data quality management and control, comprising: receiving, by a system for data quality management and control, application information transmitted by a service sub-system, wherein the application information comprises a service sub-system identifier, datasheet operation information and datasheet operation trigger information; resolving, by the system, the datasheet operation trigger information to obtain datasheet flow information; receiving, by the system, user information transmitted by the service sub-system and a target datasheet transmitted by the service sub-system; if the datasheet operation information is adding a datasheet, then querying, by the system, pieces of history information according to the service sub-system identifier to obtain a plurality of datasheet names corresponding to the service sub-system identifier; if a name of the target datasheet is different from the plurality of datasheet names, then instructing, by the system, the service sub-system to store the target datasheet into a data center; if the datasheet operation information is updating a datasheet, then querying, by the system, pieces of history information according to the name of the target datasheet to obtain a datasheet name that is the same as the name of the target datasheet, and instructing, by the system, the data center to replace datasheet contents corresponding to the datasheet name with contents of the target datasheet; wherein the pieces of history information at least comprise the service sub-system identifier and the datasheet name. 2. The method according to claim 1 , after the instructing, by the system, the service sub-system to store the target datasheet into the data center, further comprising: receiving, by the system, a name of a tablespace storing the target datasheet as transmitted by the data center and an interface mode between the service sub-system and the data center transmitted by the data center; and generating, by the system, current record information, wherein the current record information comprises the service sub-system identifier, the name of the tablespace, the name of the target datasheet, the user information, the datasheet flow information, operation authorization information of the target datasheet and the interface mode. 3. The method according to claim 2 , after the receiving, by the system, the user information transmitted by the service sub-system and the target datasheet transmitted by the service sub-system, further comprising: determining, by the system, whether data in the target datasheet is in conformity with a preset data rule; if the data in the target datasheet is not in conformity with the preset data rule, then transmitting, by the system, warning information to the service sub-system to enable a user to modify a data format in the target datasheet. 4. The method according to claim 3 , after the generating, by the system, the current record information, further comprising: analyzing, by the system, an importance degree of each datasheet according to the current record information and the pieces of history information, the more the datasheet flow information corresponding to the datasheet name, the higher the importance degree of the datasheet. 5. The method according to claim 4 , further comprising: setting, by the system, a datasheet collecting rule, and collecting by the system, a plurality of datasheets from the data center according to the datasheet collecting rule; determining, by the system, whether names of any two datasheets in the plurality of the datasheets are the same, if the names of the two datasheets are the same, then determining, by the system, whether contents of the two datasheets are the same; if the contents of the two datasheets are the same, then transmitting, by the system, a first deleting command to the data center to enable the data center to delete any one of the two datasheets; if the contents of the two datasheets are different, then acquiring, by the system, timestamps of the two datasheets from the data center, and transmitting, by the system, a second deleting command to the data center to enable the data center to delete a datasheet having a smaller timestamp in the two datasheets. 6. An apparatus for data quality management and control, comprising: a processor; and a computer-readable medium for storing program codes, which, when executed by the processor, cause the processor to: receive application information transmitted by a service sub-system, wherein the application information comprises a service sub-system identifier, datasheet operation information and datasheet operation trigger information; receive user information transmitted by the service sub-system and a target datasheet transmitted by the service sub-system; resolve the datasheet operation trigger information to obtain datasheet flow information; if the datasheet operation information is adding a datasheet, then query pieces of history information according to the service sub-system identifier to obtain a plurality of datasheet names corresponding to the service sub-system identifier; if the datasheet operation information is updating a datasheet, then query pieces of history information according to a name of the target datasheet to obtain a datasheet name that is the same as the name of the target datasheet; and if the name of the target datasheet is different from the plurality of datasheet names, then instruct the service sub-system to store the target datasheet into a data center; and instruct the data center to replace datasheet contents corresponding to the datasheet name with contents of the target datasheet; wherein the pieces of history information at least comprise the service sub-system identifier and the datasheet name. 7. The apparatus for data quality management and control according to claim 6 , wherein the program codes further cause the processor to: after instruct the service sub-system to store the target datasheet into the data center, receive a name of a tablespace storing the target datasheet as transmitted by the data center and an interface mode between the service sub-system and the data center transmitted by the data center; the program codes further cause the processor to: generate current record information, wherein the current record information comprises the service sub-system identifier, the name of the tablespace, the name of the target datasheet, the user information, the datasheet flow information, operation authorization information of the target datasheet and the interface mode. 8. The apparatus for data quality management and control according to claim 7 , wherein the program codes further cause the processor to: after receive the user information transmitted by the service sub-system and the target datasheet transmitted by the service sub-system, determine whether data in the target datasheet is in conformity with a preset data rule; if the data in the target datasheet is not in conformity with the preset data rule, then transmit warning information to the service sub-system to enable a user to modify a data format in the target datasheet. 9. The apparatus for data quality management and control according to claim 8 , wherein, the program codes further cause the processor to: analyze an importance degree of each datasheet according to the current record information and the pieces of history information, the more the datasheet flow information corresponding to the datasheet name, the higher the importance degree of the datasheet. 10. The apparatus for data quality management and control according to claim 9 , wherein the program codes further cause the processor to: set a datasheet

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • G06F16/219Primary

    Managing data history or versioning (querying versioned data G06F16/2474; querying temporal data G06F16/2477) · CPC title

  • De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10248674B2 cover?
Embodiments of the present invention provide a method and an apparatus for data quality management and control. The method includes: receiving application information transmitted by a service sub-system; resolving datasheet operation trigger information to obtain datasheet flow information; receiving user information transmitted by the service sub-system and a target datasheet transmitted by th…
Who is the assignee on this patent?
Jiangxi Electric Power Corp Information And Communications Branch Of State Grid, State Grid Corp China
What technology area does this patent fall under?
Primary CPC classification G06F17/30303. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 02 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).