Focused probabilistic entity resolution from multiple data sources

US12229154B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12229154-B2
Application numberUS-202318330746-A
CountryUS
Kind codeB2
Filing dateJun 7, 2023
Priority dateAug 19, 2016
Publication dateFeb 18, 2025
Grant dateFeb 18, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various systems and methods are provided for performing soft entity resolution. A plurality of data objects are retrieved from a plurality of data stores to create aggregated data objects for one or more entities. One or more retrieved data objects may be associated with the same entity, based at least in part upon one or more attribute types and attribute values of the data objects. In response to a determination that the one or more of the retrieved data objects should be associated with the same entity, metadata is generated that associates the data objects with the entity, the metadata being stored separately from the data objects, such that the underlying data objects remain unchanged. In addition, one or more additional attributes may be determined for the entity, based upon the data objects associated with the entity.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: creating an aggregated data object for an entity, wherein creating the aggregated data object for the entity comprises: determining a confidence value for an association between one or more of a plurality of data objects and the entity based at least in part upon one or more attribute values associated with the one or more data objects; and generating metadata associating the one or more data objects with the entity based at least in part on the confidence value, wherein the metadata is stored separately from the one or more data objects; and in response to receiving one or more events: calculating a score for the aggregated data object; and providing an alert associated with the score. 2. The computer-implemented method of claim 1 , wherein calculating the score for the aggregated data object further comprises: applying a scoring model to the aggregated data object, wherein applying the scoring model further comprises: extracting a plurality of scoring factors associated with the aggregated data object; and combining the plurality of scoring factors and a plurality of weights, wherein combining the plurality of scoring factors and the plurality of weights results in the score. 3. The computer-implemented method of claim 2 , further comprising: receiving training data comprising a plurality of entities and a plurality of events; and training the scoring model with the training data. 4. The computer-implemented method of claim 1 , wherein calculating the score for the aggregated data object further comprises: determining that an event corresponds to a certain type of event; and in response to determining that the event corresponds to the certain type of event, increasing a value of the score to an updated score. 5. The computer-implemented method of claim 1 , wherein calculating the score for the aggregated data object further comprises: determining that a plurality of events occurred within a period of time; and in response to determining that a plurality of events occurred within the period of time, increasing a value of the score to an updated score. 6. The computer-implemented method of claim 1 , wherein calculating the score for the aggregated data object further comprises: extracting a first attribute from the aggregated data object, wherein the first attribute indicates a first geographic location of the entity; extracting a second attribute from an event, wherein the second attribute indicates a second geographic location of the event; determining that the first geographic location is within a threshold distance of the second geographic location; and in response to determining that the first geographic location is within the threshold distance of the second geographic location, increasing a value of the score to an updated score. 7. The computer-implemented method of claim 1 , further comprising: causing presentation, in a user interface, of (i) a representation of the entity and (ii) the score. 8. The computer-implemented method of claim 7 , further comprising: causing presentation, in the user interface, of (i) representations for a plurality of entities and (ii) a corresponding score for each entity of the plurality of entities. 9. The computer-implemented method of claim 8 , further comprising: causing presentation, in the user interface, of a geographic location associated with each entity of the plurality of entities. 10. The computer-implemented method of claim 1 , wherein providing the alert further comprises: transmitting the alert to a user computing device. 11. A computer system, comprising: one or more computer readable storage mediums configured to store computer executable instructions; and one or more computer processors in communication with the one or more computer readable storage mediums and configured to execute the computer executable instructions to cause the computer system to: create an aggregated data object for an entity, wherein creating the aggregated data object for the entity comprises: determining a confidence value for an association between one or more of a plurality of data objects and the entity based at least in part upon one or more attribute values associated with the one or more data objects; and generating metadata associating the one or more data objects with the entity based at least in part on the confidence value, wherein the metadata is stored separately from the one or more data objects; and in response to receive one or more events: calculate a score for the aggregated data object; and provide an alert associated with the score. 12. The computer system of claim 11 , wherein calculating the score for the aggregated data object further comprises: applying a scoring model to the aggregated data object, wherein applying the scoring model further comprises: extracting a plurality of scoring factors associated with the aggregated data object; and combining the plurality of scoring factors and a plurality of weights, wherein combining the plurality of scoring factors and the plurality of weights results in the score. 13. The computer system of claim 12 , wherein the one or more computer processors are configured to execute the computer executable instructions to further cause the computer system to: receive training data comprising a plurality of entities and a plurality of events; and train the scoring model with the training data. 14. The computer system of claim 11 , wherein calculating the score for the aggregated data object further comprises: determining that an event corresponds to a certain type of event; and in response to determining that the event corresponds to the certain type of event, increasing a value of the score to an updated score. 15. The computer system of claim 11 , wherein calculating the score for the aggregated data object further comprises: determining that a period of time has passed following an event; and in response to determining that the period of time has passed following the event, decreasing a value of the score to an updated score. 16. The computer system of claim 15 , wherein decreasing the value of the score further comprises: decreasing a weight associated with the event. 17. The computer system of claim 11 , wherein calculating the score for the aggregated data object further comprises: extracting a first attribute from the aggregated data object, wherein the first attribute indicates a first geographic location of the entity; extracting a second attribute from an event, wherein the second attribute indicates a second geographic location of the event; determining that the first geographic location is within a threshold distance of the second geographic location; and in response to determining that the first geographic location is within the threshold distance of the second geographic location, increasing a value of the score to an updated score. 18. The computer system of claim 11 , wherein the one or more computer processors are configured to execute the computer executable instructions to further cause the computer system to: cause presentation, in a user interface, of (i) a representation of the entity and (ii) the score. 19. The computer system of claim 18 , wherein the one or more computer processors are configured to execute the computer executable instructions to further cause the computer system to: cause presentation, in the user interface, of (i) representations for a plurality of entitie

Assignees

Inventors

Classifications

  • Filtering based on additional data, e.g. user or group profiles (filtering in web context G06F16/9535, G06F16/9536) · CPC title

  • Document management systems · CPC title

  • Query execution · CPC title

  • Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title

  • Temporal data queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12229154B2 cover?
Various systems and methods are provided for performing soft entity resolution. A plurality of data objects are retrieved from a plurality of data stores to create aggregated data objects for one or more entities. One or more retrieved data objects may be associated with the same entity, based at least in part upon one or more attribute types and attribute values of the data objects. In respons…
Who is the assignee on this patent?
Palantir Technologies Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/24578. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).