System and method for aggregating data from a plurality of data sources

US10198515B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10198515-B1
Application numberUS-201514816599-A
CountryUS
Kind codeB1
Filing dateAug 3, 2015
Priority dateDec 10, 2013
Publication dateFeb 5, 2019
Grant dateFeb 5, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to certain aspects, a computer system may be configured to aggregate and analyze data from a plurality of data sources. The system may obtain data from a plurality of data sources, each of which can include various types of data, including email data, system logon data, system logoff data, badge swipe data, employee data, job processing data, etc. associated with a plurality of individuals. The system may also transform data from each of the plurality of data sources into a format that is compatible for combining the data from the plurality of data sources. The system can resolve the data from each of the plurality of data sources to unique individuals of the plurality of individuals. The system can also determine an efficiency indicator based at least in part on a comparison of individuals of the unique individuals that have at least one common characteristic.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer system comprising: a hardware computer processor configured to execute code to cause the computer system to: access first data from a first data source, the first data source comprising electronic communication data; determine, for each of a plurality of electronic communications of the first data, an individual associated with the electronic communication; generate, for each individual, a summary of electronic communications associated with the individual; obtain second data from a second data source, the second data source comprising one or more logs of work activities, wherein the second data source is different from the first data source; determine, for each of a plurality of work activity logs of the second data, an individual associated with the work activity log; generate, for each individual, a second summary of work activity logs associated with the individual, wherein at least the summary of electronic communications and the second summary of work activity logs are each accessible by the computer system; determine, a first group of unique individuals each sharing a first common characteristic indicated in the second data and a second group of unique individuals each sharing a second common characteristic indicated in the second data; generate a first intermediate output aggregating summaries of electronic communications of individuals in the first group, wherein the intermediate output comprises a reduced version of at least some of the summaries of electronic communications; generate a second intermediate output aggregating summaries of electronic communications of individuals in the second group; determine a first efficiency indicator for the first group based at least in part on comparison of the first intermediate output associated with the first group and the second intermediate output associated with the second group, generate user interface data for displaying a user interface on a user computing device, the user interface including an indication of the first group, an indication of the second group, and the determined first efficiency indicator; receive, via input from the user interface, selection of a comparison characteristic; determine, a third group of unique individuals each sharing the comparison characteristic; generate a third intermediate output aggregating summaries of electronic communications of individuals in the third group; determine a second efficiency indicator for the first group based at least in part on comparison of the first intermediate output associated with the first group and the third intermediate output associated with the third group, and update the user interface data so that the user interface includes an indication of the first group, an indication of the third group and the determined second efficiency indicator. 2. The computer system of claim 1 , wherein the hardware computer processor is further configured to execute the code in order to cause the computer system to: access expected format information indicating an expected format of data from the first or second data source; and detect an inconsistency in a format of the data from the first or second data source as compared to the expected format. 3. The computer system of claim 2 , wherein the hardware computer processor is further configured to execute the code in order to cause the computer system to: in response to the detection of the inconsistency, obtain the first or second data from the respective first or second data source such that the first or second data no longer has the inconsistency. 4. The computer system of claim 2 , wherein the hardware computer processor is further configured to execute the code in order to cause the computer system to: display, in the user interface, an indicator of the inconsistency. 5. The computer system of claim 1 , wherein the hardware computer processor is further configured to execute the code in order to cause the computer system to: determine a first file size for the first or second data; access a previous file size for a previous version of the first or second data; detect a discrepancy in size between the previous file size and the first file size; and in response to the detection of the discrepancy, obtain the first or second data from the respective first or second data source such that the first or second data no longer has the discrepancy. 6. The computer system of claim 1 , wherein the first common characteristic comprises: a badge swipe time within a timespan; a software version; or a number of email messages sent or received. 7. A computer implement method comprising: accessing first data from a first data source, the first data source comprising electronic communication data; determining, for each of a plurality of electronic communications of the first data, an individual associated with the electronic communication; generating, for each individual, a summary of electronic communications associated with the individual; obtaining second data from a second data source, the second data source comprising one or more logs of work activities, wherein the second data source is different from the first data source; determining, for each of a plurality of work activity logs of the second data, an individual associated with the work activity log; generate, for each individual, a second summary of work activity logs associated with the individual, wherein at least the summary of electronic communications and the second summary of work activity logs are each accessible by the computer system; determining, a first group of unique individuals each sharing a first common characteristic indicated in the second data and a second group of unique individuals each sharing a second common characteristic indicated in the second data: generating a first intermediate output aggregating summaries of electronic communications of individuals in the first group, wherein the intermediate output comprises a reduced version of at least some of the summaries of electronic communications; generating a second intermediate output aggregating summaries of electronic communications of individuals in the second group; determining a first efficiency indicator for the first group based at least in part on comparison of the first intermediate output associated with the first group and the second intermediate output associated with the second group, generating user interface data for displaying a user interface on a user computing device, the user interface including an indication of the first group, an indication of the second group, and the determined first efficiency indicator; receiving, via input from the user interface, selection of a comparison characteristic; determining, a third group of unique individuals each sharing the comparison characteristic; generating a third intermediate output aggregating summaries of electronic communications of individuals in the third group; determining a second efficiency indicator for the first group based at least in part on comparison of the first intermediate output associated with the first group and the third intermediate output associated with the third group, and updating the user interface data so that the user interface includes an indication of the first group, an indication of the third group and the determined second efficiency indicator. 8. A non-transitory computer readable medium storing software instructions configured to cause a computing system to: access first data from a first data source, the first data source comprising electronic communication data; determine, for each of a plurality of electronic communications of the first data, an individual associated with the electronic communication; generate, for each

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10198515B1 cover?
According to certain aspects, a computer system may be configured to aggregate and analyze data from a plurality of data sources. The system may obtain data from a plurality of data sources, each of which can include various types of data, including email data, system logon data, system logoff data, badge swipe data, employee data, job processing data, etc. associated with a plurality of indivi…
Who is the assignee on this patent?
Palantir Technologies Inc
What technology area does this patent fall under?
Primary CPC classification G06F17/30867. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 05 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).