Systems and methods for targeted data discovery

US11797528B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11797528-B2
Application numberUS-202117370650-A
CountryUS
Kind codeB2
Filing dateJul 8, 2021
Priority dateJul 8, 2020
Publication dateOct 24, 2023
Grant dateOct 24, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various embodiments provide methods, apparatus, systems, computing devices, computing entities, and/or the like for identifying targeted data for a data subject across a plurality of data objects in a data source. In accordance with one embodiment, a method is provided comprising: receiving a request to identify targeted data for a data subject; identifying a first data object using metadata for a data source that identifies the first data object as associated with a first targeted data type for a data portion from the request; identifying a first data field from a graph data structure of the first data object that identifies the first data field as used for storing data having the first targeted data type; and querying the first data object based on the first data field and the data for the first targeted data type to identify a first targeted data portion for the data subject.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by computing hardware, a request to identify targeted data for a data subject; determining, by the computing hardware accessing metadata for a data source, that a first targeted data type is eligible for querying the targeted data from the data source, wherein the first targeted data type is associated with a data portion from the request the metadata comprising data types corresponding to one or more data objects in the data source and a set of known queryable data types for the data source; identifying, by the computing hardware and based on the first targeted data type, a first data object from a plurality of data objects using the metadata for the data source, wherein each data object of the plurality of data objects is used in storing data for the data source and the metadata identifies the first data object as associated with the first targeted data type; identifying, by the computing hardware, a first data field from a first graph data structure of the first data object by using the first targeted data type as a start node to traverse the first graph data structure, wherein the first graph data structure provides a graphical representation mapping a first plurality of targeted data types for the targeted data to a first plurality of data fields used for storing first portions of the data having the first plurality of targeted data types in the first data object, the first plurality of targeted data types comprises the first targeted data type, and the first graph data structure identifies that the first data field from the first plurality of data fields is used for storing a first particular portion of the first portions of the data having the first targeted data type; querying, by the computing hardware, the first data object based on the first data field and the first particular portion for the first targeted data type to identify a first targeted data portion for the data subject; determining, by the computing hardware traversing the first graph data structure from the start node associated with the first targeted data type to a second node and a third node of the first graph data structure, that the first targeted data portion is associated with a second targeted data type corresponding to the second node and a third targeted data type corresponding to the third node; determining, by the computing hardware accessing the metadata for the data source and traversing one or more graph data structures associated with the data source comprising the first graph data structure, that the second targeted data type is eligible for querying the targeted data from the data source and the third targeted data type is not eligible for querying the targeted data from the data source; identifying, by the computing hardware and based on the second targeted data type and not on the third targeted data type, a second data object from the plurality of data objects using the metadata for the data source, wherein the metadata identifies the second data object as associated with the second targeted data type; identifying, by the computing hardware, a second data field from a second graph data structure of the second data object by using the second targeted data type as a start node to traverse the second graph data structure, wherein the second graph data structure provides a graphical representation mapping a second plurality of targeted data types to a second plurality of data fields used for storing second portions of the data having the second plurality of targeted data types in the second data object, the second plurality of targeted data types comprises the second targeted data type, and the second graph data structure identifies that the second data field from the second plurality of data fields is used for storing a second particular portion of the second portions of the data having the second targeted data type; querying, by the computing hardware, the second data object based on the second data field and the first targeted data portion associated with the second targeted data type to identify a second targeted data portion for the data subject; and performing a targeted data action based on the first targeted data portion or the second targeted data portion. 2. The method of claim 1 further comprising: determining, by the computing hardware, that the first targeted data portion is associated with a fourth targeted data type for a second data source; identifying, by the computing hardware from second metadata for the second data source, that the fourth targeted data type is eligible for querying the targeted data from the second data source; identifying, by the computing hardware and based on the fourth targeted data type, a third data object from the second data source using the second metadata for the second data source, wherein the second metadata identifies the third data object as associated with the fourth targeted data type; identifying, by the computing hardware, a third data field from a third graph data structure of the third data object by using the fourth targeted data type as a start node to traverse the third graph data structure, wherein the third graph data structure provides a graphical representation mapping a third plurality of targeted data types to a third plurality of data fields used for storing third portions of the data having the third plurality of targeted data types in the third data object, and the third plurality of targeted data types comprises the fourth targeted data type, and the third graph data structure identifies that the third data field from the third plurality of data fields is used for storing a third particular portion of the third portions of the data having the fourth targeted data type; and querying, by the computing hardware, the third data object based on the third data field and the first targeted data portion associated with the fourth targeted data type to identify a third targeted data portion for the data subject, wherein the targeted data action is based on at least one of the first targeted data portion, the second targeted data portion, or the third targeted data portion. 3. The method of claim 1 further comprising generating, by the computing hardware, the metadata for the data source, wherein generating the metadata comprises: scanning the data source to identify a plurality of targeted data types found in the data source, the plurality of targeted data types including the first plurality of targeted data types and the second plurality of targeted data types; performing a determination that the first targeted data type and the second targeted data type is eligible to query the targeted data from the data source and the third targeted data type is not eligible to query the targeted data from the data source; and modifying, in response to determining that the second targeted data type is eligible for querying the targeted data from the data source and the third targeted data type is not eligible for querying the targeted data from the data source, the metadata of the data source to include the first targeted data type and the second targeted data type within the set of known queryable data types for the data source and exclude the third targeted data type from the set of known queryable data types for the data source. 4. The method of claim 3 , wherein scanning the data source to identify the plurality of targeted data types found in the data source comprises: identifying a plurality of data fields used for storing the targeted data in the plurality of data objects for the data source; processing combinations of data fields of the plurality of data fields using a machine learning model to generate an indication that each combination of the combinations of data fields is used for storing data associated with a common targete

Assignees

Inventors

Classifications

  • Interactive query statement specification based on a database schema · CPC title

  • for performance assessment · CPC title

  • Change logging, detection, and notification (replication G06F16/27) · CPC title

  • with adaptation to user needs · CPC title

  • Data format conversion from or to a database · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11797528B2 cover?
Various embodiments provide methods, apparatus, systems, computing devices, computing entities, and/or the like for identifying targeted data for a data subject across a plurality of data objects in a data source. In accordance with one embodiment, a method is provided comprising: receiving a request to identify targeted data for a data subject; identifying a first data object using metadata fo…
Who is the assignee on this patent?
Onetrust Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/2423. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 24 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).