Integrated data extraction and retrieval system

US2017223003A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017223003-A1
Application numberUS-201715486480-A
CountryUS
Kind codeA1
Filing dateApr 13, 2017
Priority dateJan 19, 2015
Publication dateAug 3, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A first computing device connected to an internal network de-anonymizes data. A record including a surrogate key is received from a second computing device connected to an external network to the internal network. Each identity data record includes a second surrogate key, an entity identifier field value, a record type field value, and a de-identified field value. The second surrogate key uniquely identifies the respective record. The surrogate key is compared to the second surrogate key to identify a matching record. The matching entity identifier field value is selected and compared to the entity identifier field value of the plurality of records to identify a master record for the surrogate key. The record type field value includes an indicator indicating whether the record is the master record. The de-identified field value included in the identified master record is selected. The received record is supplemented with the selected de-identified field value.

First claim

Opening claim text (preview).

What is claimed is: 1 . A non-transitory computer-readable medium having stored thereon computer-readable instructions that when executed by a first computing device control the first computing device to: receive a record from a second computing device connected to an external network separated from an internal network by a firewall, wherein the first computing device is connected to the internal network, wherein the record includes a surrogate key; read identity data stored on the internal network that includes a plurality of records, wherein each record includes a second surrogate key, an entity identifier field value, a record type field value, and a de-identified field value, wherein the second surrogate key uniquely identifies the respective record; compare the surrogate key to the second surrogate key to identify a matching record from the read identity data; select the entity identifier field value included in the identified matching record; compare the selected entity identifier field value to the entity identifier field value of the plurality of records to identify a master record for the surrogate key from the read identity data, wherein the record type field value includes an indicator that indicates whether or not the respective record is the master record; select the de-identified field value included in the identified master record; supplement the received record with the selected de-identified field value; and store the supplemented record on the internal network. 2 . The non-transitory computer-readable medium of claim 1 , wherein each record includes a plurality of de-identified field values, wherein the de-identified field value is one of the plurality of de-identified field values. 3 . The non-transitory computer-readable medium of claim 1 , wherein, before selecting the entity identifier field value, the computer-readable instructions further control the first computing device to: receive a result type value, wherein the result type value indicates whether or not to retrieve the master record; when the received result type value indicates to retrieve the master record, the entity identifier field value is selected, the selected entity identifier field value is compared to the entity identifier field value of the plurality of records, and the de-identified field value included in the identified master record is selected; and when the received result type value does not indicate to retrieve the master record, the de-identified field value is selected from the identified matching record. 4 . The non-transitory computer-readable medium of claim 1 , wherein, before comparing the surrogate key to the second surrogate key, the computer-readable instructions further control the first computing device to decrypt the received record, wherein the decrypted surrogate key is compared to the second surrogate key. 5 . The non-transitory computer-readable medium of claim 1 , wherein, before receiving the record, the computer-readable instructions further control the first computing device to: read retrieval instructions stored on the internal network, wherein the retrieval instructions include connection information used to connect to the second computing device; and connect to the second computing device using the read retrieval instructions. 6 . The non-transitory computer-readable medium of claim 5 , wherein the retrieval instructions are read from a handle lookup table using a predefined output handle name. 7 . The non-transitory computer-readable medium of claim 5 , wherein the connection information includes a protocol, credentials, and a connectivity option. 8 . The non-transitory computer-readable medium of claim 7 , wherein the credentials include a username and a password for connecting to the second computing device. 9 . The non-transitory computer-readable medium of claim 7 , wherein the connectivity option includes a hostname of the second computing device and a port number of the second computing device to which to connect. 10 . The non-transitory computer-readable medium of claim 7 , wherein the connection information further includes an indicator defining whether receipt of the record is sequential or parallel. 11 . The non-transitory computer-readable medium of claim 1 , wherein the computer-readable instructions further control the first computing device to: read a post-process script stored on the internal network; and execute the post-process script with the supplemented record on the internal network. 12 . The non-transitory computer-readable medium of claim 11 , wherein the post-process script streams the supplemented record to an event stream processing engine. 13 . A computing device comprising: a processor; and a non-transitory computer-readable medium operably coupled to the processor, the computer-readable medium having computer-readable instructions stored thereon that, when executed by the processor, control the computing device to receive a record from a second computing device connected to an external network separated from an internal network by a firewall, wherein the computing device is connected to the internal network, wherein the record includes a surrogate key; read identity data stored on the internal network that includes a plurality of records, wherein each record includes a second surrogate key, an entity identifier field value, a record type field value, and a de-identified field value, wherein the second surrogate key uniquely identifies the respective record; compare the surrogate key to the second surrogate key to identify a matching record from the read identity data; select the entity identifier field value included in the identified matching record; compare the selected entity identifier field value to the entity identifier field value of the plurality of records to identify a master record for the surrogate key from the read identity data, wherein the record type field value includes an indicator that indicates whether or not the respective record is the master record; select the de-identified field value included in the identified master record; supplement the received record with the selected de-identified field value; and store the supplemented record on the internal network. 14 . The computing device of claim 13 , wherein the computer-readable instructions further control the computing device to: read a post-process script stored on the internal network; and execute the post-process script with the supplemented record on the internal network. 15 . The computing device of claim 14 , wherein the post-process script streams the supplemented record to an event stream processing engine. 16 . The computing device of claim 13 , wherein each record includes a plurality of de-identified field values, wherein the de-identified field value is one of the plurality of de-identified field values. 17 . The computing device of claim 13 , wherein, before selecting the entity identifier field value, the computer-readable instructions further control the computing device to: receive a result type value, wherein the result type value indicates whether or not to retrieve the master record; when the received result type value indicates to retrieve the master record, the entity identifier field value is selected, the selected entity identifier field value is compared to the entity identifier field value of the plurality of records, and the de-identified field value included in the identified master record is selected; and when the received result type value does not indicat

Assignees

Inventors

Classifications

  • for separating internal from external traffic, e.g. firewalls · CPC title

  • H04L63/08Primary

    for authentication of entities (cryptographic mechanisms or cryptographic arrangements for entity authentication H04L9/32) · CPC title

  • to a single file or object, e.g. in a secure envelope, encrypted and accessed using a key, or with access control rules appended to the object itself · CPC title

  • by anonymising data, e.g. decorrelating personal data from the owner's identification · CPC title

  • G06F16/116Primary

    Details of conversion of file system types or formats · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017223003A1 cover?
A first computing device connected to an internal network de-anonymizes data. A record including a surrogate key is received from a second computing device connected to an external network to the internal network. Each identity data record includes a second surrogate key, an entity identifier field value, a record type field value, and a de-identified field value. The second surrogate key uniqu…
Who is the assignee on this patent?
Sas Inst Inc
What technology area does this patent fall under?
Primary CPC classification H04L63/08. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Aug 03 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).