Method and system for dynamic de-identification of data sets

US9887964B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9887964-B2
Application numberUS-201514694262-A
CountryUS
Kind codeB2
Filing dateApr 23, 2015
Priority dateApr 23, 2015
Publication dateFeb 6, 2018
Grant dateFeb 6, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for combining disparate data sets using dynamic de-identifiers includes: receiving a first data set from a first external entity, the data set including first data entries including attribute values, and wherein the first data set does not include any personally identifiable information; receiving a second data set from a second external entity, the data set including second data entries including attribute values, and wherein the second data set does not include any personally identifiable information; identifying a dynamic de-identifier (DDID) for each first data entry and second data entry; matching each second data entry to a first data entry based on a correspondence between the associated attribute values; identifying insights for at least one first data entry based on at least the attribute values included in the corresponding matched second data entry; and transmitting at least the identified insights to the first external entity.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for combining disparate data sets using dynamic de-identifiers, comprising: receiving, by a receiving device of a processing system, a first data set from a first entity external to the processing system, wherein the first data set includes a plurality of first data entries, each first data entry including a plurality of attribute values and an associated identification key, and wherein the first data set does not include any personally identifiable information; receiving, by the receiving device of the processing system, a second data set from a second entity external to the processing system, wherein the second data set includes a plurality of second data entries, each second data entry including a plurality of attribute values and an associated identification key, and wherein the second data set does not include any personally identifiable information; identifying a dynamic de-identifier (DDID) for each identification key included in each first data entry in the first data set and each second data entry in the second data set, wherein the DDID is one of: (i) based on the corresponding identification key, or (ii) equivalent to the corresponding identification key; matching, by a processing device of the processing system, each second data entry included the second data set to a first data entry included in the first data set based on a correspondence between the identified DDIDs corresponding to the respective associated identification keys and a measurement of commonality between the plurality of attribute values of each first data entry and the plurality of attribute values of each second data entry, wherein the matching is based on the measurement of commonality being higher than a threshold; identifying, by the processing device of the processing system, one or more insights for at least one first data entry included in the first data set based on at least the plurality of attribute values included in the corresponding matched second data entry; and transmitting, by a transmitting device of the processing system, at least the identified one or more insights to the first entity external to the processing system. 2. The method of claim 1 , wherein transmitting the identified one or more insights includes transmitting at least one of: the associated identification key and corresponding DDID for each of the at least one first data entry. 3. The method of claim 1 , wherein the identification of a DDID for each identification key is performed by the processing device of the processing system. 4. The method of claim 1 , wherein the identification of a DDID for each identification key is performed by a first processing device of a first computing device of the processing system, and the matching is performed by a second processing device of a second computing device of the processing system. 5. The method of claim 4 , wherein the second computing device does not receive or possess any identification keys. 6. The method of claim 4 , wherein the receiving device is included in the first computing device of the processing system. 7. The method of claim 4 , wherein the transmitting device is included in the first computing device of the processing system. 8. The method of claim 4 , wherein the first computing device does not possess any data associated with the matching step. 9. The method of claim 1 , wherein if the DDID is based on the corresponding identification key, the identification of a DDID for each identification key is performed upon receipt of the respective data set, and wherein the processing system does not possess any identification keys. 10. The method of claim 1 , wherein each second data entry included in the second data set is matched to a plurality of first data entries included in the first data set based on the correspondence between the identified DDIDs corresponding to the respective associated identification keys. 11. A method for combining disparate data sets using dynamic de-identifiers, comprising: receiving, by a receiving device of a processing system, a first data set from a first entity external to the processing system, wherein the first data set includes a plurality of first data entries, each first data entry including a plurality of attribute values, and wherein the first data set does not include any personally identifiable information; receiving, by the receiving device of the processing system, a second data set from a second entity external to the processing system, wherein the second data set includes a plurality of second data entries, each second data entry including a plurality of attribute values, and wherein the second data set does not include any personally identifiable information; identifying a dynamic de-identifier (DDID) for each first data entry in the first data set and each second data entry in the second data set; matching, by a processing device of the processing system, each second data entry included in the second data set to a first data entry included in the first data set based on a measurement of commonality between one or more of the plurality of attribute values included in each first respective data entry and one or more of the plurality of attribute values included in each second data entry, wherein the matching is based on the measurement of commonality being higher than a threshold; identifying, by the processing device of the processing system, one or more insights for at least one first data entry included in the first data set based on at least the plurality of attribute values included in the corresponding matched second data entry; and transmitting, by a transmitting device of the processing system, at least the identified one or more insights to the first entity external to the processing system. 12. The method of claim 11 , transmitting the identified one or more insights includes transmitting the DDID associated with each of the at least one first data entry. 13. The method of claim 11 , wherein the identification of a DDID for each first data entry and each second data entry is performed by the processing device of the processing system. 14. The method of claim 11 , wherein the identification of a DDID for each first data entry and each second data entry is performed by a first processing device of a first computing device of the processing system, and the matching is performed by a second processing device of a second computing device of the processing system. 15. The method of claim 14 , wherein the second computing device does not receive or possess any DDIDs. 16. The method of claim 14 , wherein the receiving device is included in the first computing device of the processing system. 17. The method of claim 14 , wherein the transmitting device is included in the first computing device of the processing system. 18. The method of claim 14 , wherein the first computing device does not possess any data associated with the matching step. 19. The method of claim 11 , wherein each second data entry included in the second data set is matched to a plurality of first data entries included in the first data set based on the correspondence between one or more of the plurality of attribute values included in each respective data entry. 20. The method of claim 11 , wherein each first data entry included in the first data set further includes an associated identification key. 21. The method of claim 20 , wherein the identified DDID for each first data entry in the first data set is based on the

Assignees

Inventors

Classifications

  • Anonymous communication, i.e. the party's identifiers are hidden from the other party or parties, e.g. using an anonymizer · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9887964B2 cover?
A method for combining disparate data sets using dynamic de-identifiers includes: receiving a first data set from a first external entity, the data set including first data entries including attribute values, and wherein the first data set does not include any personally identifiable information; receiving a second data set from a second external entity, the data set including second data entri…
Who is the assignee on this patent?
Mastercard International Inc
What technology area does this patent fall under?
Primary CPC classification H04L63/0421. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).