System, method, and recording medium for data mining between private and public domains

US2017364595A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017364595-A1
Application numberUS-201615187191-A
CountryUS
Kind codeA1
Filing dateJun 20, 2016
Priority dateJun 20, 2016
Publication dateDec 21, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A data mining method, system, and non-transitory computer readable medium, include defining a set of filter constraints as a filter function for clustering users' private records of data of a private domain, obtaining a set of data from a public domain by applying the filter function to users' public records of data of the public domain, selecting a subset of the users' public records of data that is common with the users' private records of data, and performing data mining on the selected subset of the users' public records of data in combination with the users' private records of data to match a user of the private domain to public records of the user of the private domain.

First claim

Opening claim text (preview).

What is claimed is: 1 . A data mining method, comprising: defining a set of filter constraints as a filter function for clustering users' private records of data of a private domain; obtaining a set of data from a public domain by applying the filter function to users' public records of data of the public domain; selecting a subset of the users' public records of data that is common with the users' private records of data; and performing data mining on the selected subset of the users' public records of data in combination with the users' private records of data to match a user of the private domain to public records of the user of the private domain. 2 . The method of claim 1 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is less than 1.05. 3 . The method of claim 1 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is based on a network limitation between the private domain and the public domain. 4 . The method of claim 1 , wherein the defining defines the filter function such that the obtaining obtains a number of the users' public records less a predetermined threshold size of obtained data divided by a data size of each of the users' public records. 5 . The method of claim 1 , wherein the defining further negotiates the filter function with the public domain such that a size of the users' public records obtained by the obtaining is less than a threshold value. 6 . The method of claim 1 , wherein the obtained set of data from the public domain includes a greater number of users' records than a number of users' private records of data clustered in the private domain. 7 . The method of claim 1 , wherein an identity of the users corresponding to the users' private records of data of the private domain is unobtainable by the public domain. 8 . The method of claim 1 , wherein the defining further negotiates the filter function using an obfuscation function such that the set of filter constraints is greater than a first set of filter constraints that overlaps with all of users' private records of data. 9 . A data mining system, comprising: a processor; and a memory, the memory storing instructions to cause the processor to: define a set of filter constraints as a filter function for clustering users' private records of data of a private domain; obtain a set of data from a public domain by applying the filter function to users' public records of data of the public domain; select a subset of the users public records of data that is common with the users' private records of data; and perform data mining on the selected subset of the users' public records of data in combination with the users' private records of data to match a user of the private domain to public records of the user of the private domain. 10 . The system of claim 9 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is less than 1.05. 11 . The system of claim 9 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is based on a network limitation between the private domain and the public domain. 12 . The system of claim 9 , wherein the defining defines the filter function such that the obtaining obtains a number of the users' public records less a predetermined threshold size of obtained data divided by a data size of each of the users' public records. 13 . The system of claim 9 , wherein the defining further negotiates the filter function with the public domain such that a size of the users' public records obtained by the obtaining is less than a threshold value. 14 . The system of claim 9 , wherein the obtained set of data from the public domain includes a greater number of users' records than a number of users' private records of data clustered in the private domain. 15 . The system of claim 9 , wherein an identity of the users corresponding to the users' private records of data of the private domain is unobtainable by the public domain. 16 . The system of claim 9 , wherein the defining further negotiates the filter function using an obfuscation function such that the set of filter constraints is greater than a first set of filter constraints that overlaps with all of users' private records of data. 17 . A non-transitory computer-readable recording medium recording a data mining program, the program causing a computer to perform: defining a set of filter constraints as a filter function for clustering users' private records of data of a private domain; obtaining a set of data from a public domain by applying the filter function to users' public records of data of the public domain; selecting a subset of the users public records of data that is common with the users' private records of data; and performing data mining on the selected subset of the users public records of data in combination with the users private records of data to match a user of the private domain to public records of the user of the private domain. 18 . The non-transitory computer-readable recording medium of claim 17 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is less than 1.05. 19 . The non-transitory computer-readable recording medium of claim 17 , wherein the defining defines the filter function such that a ratio of the obtained users' public records of data to the users' private records of data is based on a network limitation between the private domain and the public domain. 20 . The non-transitory computer-readable recording medium of claim 17 , wherein the defining defines the filter function such that the obtaining obtains a number of the users' public records less a predetermined threshold size of obtained data divided by a data size of each of the users' public records.

Assignees

Inventors

Classifications

  • Integrating or interfacing systems involving database management systems · CPC title

  • Query processing support for facilitating data mining operations in structured databases · CPC title

  • Search customisation based on user profiles and personalisation · CPC title

  • G06F16/906Primary

    Clustering; Classification · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017364595A1 cover?
A data mining method, system, and non-transitory computer readable medium, include defining a set of filter constraints as a filter function for clustering users' private records of data of a private domain, obtaining a set of data from a public domain by applying the filter function to users' public records of data of the public domain, selecting a subset of the users' public records of data t…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/2465. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).