Identifying users' identity through tracking common activity

US10715612B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10715612-B2
Application numberUS-201514854726-A
CountryUS
Kind codeB2
Filing dateSep 15, 2015
Priority dateSep 15, 2015
Publication dateJul 14, 2020
Grant dateJul 14, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for identifying users according to their activity are disclosed. The identification of a user includes accessing a user activity log having a plurality of identifiers and corresponding activity information for each identifier, determining identifiers having correlating activity information, and assigning identifiers having correlating activity information to a common user.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method, comprising: accessing a user activity log comprising a plurality of identifiers and corresponding activity information for each identifier, the user activity log based on activities of mobile devices and Internet accounts of one or more users; filtering the user activity log to exclude popular activity information associated with activities exceeding a popularity threshold and generate a filtered user activity log; building an inverted index of the filtered user activity log, the inverted index having at least some activity information remaining in the user activity log after the filtering as a key and sets of identifiers associated with the at least some activity information as data; enumerating possible combinations of identifiers to generate a plurality of potential user groups, wherein each potential user group (i) includes a plurality of users, (ii) is associated with a common set of one or more websites, and (iii) is generated based upon a determination that each of the plurality of users of the potential user group are determined to have visited the common set of one or more websites associated with the potential user group, wherein generating the plurality of potential user groups comprises: generating a first potential user group, including a first plurality of users, in association with a first common set of one or more websites based upon a determination that the first plurality of users visited the first common set of one or more websites, wherein generating the first potential user group comprises grouping a first user of the first plurality of users and a second user of the first plurality of users into the first potential user group based upon (i) a determination that the first user visited a first website of the first common set of one or more websites and a second website of the first common set of one or more websites and (ii) a determination that the second user visited the first website and the second website; and generating a second potential user group, including a second plurality of users, in association with a second common set of one or more websites based upon a determination that the second plurality of users visited the second common set of one or more websites, wherein generating the second potential user group comprises grouping a third user of the second plurality of users and a fourth user of the second plurality of users into the second potential user group based upon (i) a determination that the third user visited a third website of the second common set of one or more websites and a fourth website of the second common set of one or more websites and (ii) a determination that the fourth user visited the third website and the fourth website; scoring each potential user group of the plurality of potential user groups based on a quantity of websites in the common set of one or more websites for the potential user group; determining that users of at least one potential user group of the plurality of potential user groups are associated with a common identity of a user based upon the scoring; selecting advertisements to be targeted to the user; and controlling transmission of the advertisements to the user. 2. The method of claim 1 wherein the plurality of identifiers of the user activity log comprise at least one of hardware identifiers, cookies, or device profiles. 3. The method of claim 1 , wherein the corresponding activity information of the user activity log comprises at least one of internet browsing activity, online shopping activity, or internet app usage. 4. The method of claim 1 , wherein enumerating the possible combinations of identifiers comprises: enumerating a limited number of user combinations; grouping the user combinations according to common sites; and merging two or more user combinations having one or more common sites. 5. The method of claim 1 , wherein filtering the user activity log data comprises ranking at least some second activity information according to inverse popularity and selecting internet activities having a ranking higher than a threshold. 6. The method of claim 1 , wherein scoring each potential user group of the plurality of potential user groups comprises: scoring the first potential user group with a first score based upon a first quantity of websites in the first common set of one or more websites that the first plurality of users visited; and scoring the second potential user group with a second score based upon a second quantity of websites in the second common set of one or more websites that the second plurality of users visited. 7. The method of claim 1 , wherein scoring each potential user group of the plurality of potential user groups comprises scoring each potential user group based upon are scored according to a sum of an inverse document frequency for each common website of the common set of one or more websites associated with the potential user group. 8. A system for identifying a group of activities having a common user, the system comprising: a processor configured to implement computer instructions; and memory storing computer executable instructions, that when implemented by the processor, cause the system to perform functions comprising: accessing data comprising a plurality of identifiers and corresponding activity information for each identifier; filtering the data to exclude popular activity information associated with activities exceeding a popularity threshold and generate filtered data; building an inverted index of the filtered data, the inverted index having at least some activity information remaining in the data after the filtering as a key and sets of identifiers associated with the at least some activity information as data; enumerating possible combinations of identifiers to generate a plurality of potential user groups, wherein each potential user group (i) includes a plurality of users, (ii) is associated with a common set of one or more websites, and (iii) is generated based upon a determination that each of the plurality of users of the potential user group are determined to have visited the common set of one or more websites associated with the potential user group, wherein generating the plurality of potential user groups comprises: generating a first potential user group, including a first plurality of users, in association with a first common set of one or more websites based upon a determination that the first plurality of users visited the first common set of one or more websites, wherein generating the first potential user group comprises grouping a first user of the first plurality of users and a second user of the first plurality of users into the first potential user group based upon (i) a determination that the first user visited a first website of the first common set of one or more websites and a second website of the first common set of one or more web sites and (ii) a determination that the second user visited the first website and the second website; and generating a second potential user group, including a second plurality of users, in association with a second common set of one or more websites based upon a determination that the second plurality of users visited the second common set of one or more websites; scoring each potential user group of the plurality of potential user groups based on a quantity of websites in the common set of one or more websites for the potential user group; determining that users of at least one potential user group of the plurality of potential user groups are associated with a common identity of a user based upon the scoring; selecting content to be targeted to the user; and controlling transmission of the content to the user.

Assignees

Inventors

Classifications

  • H04L67/535Primary

    Tracking the activity of the user (network monitoring arrangements H04L43/00; recording of computer activity G06F11/34) · CPC title

  • G06Q30/02Primary

    Marketing; Price estimation or determination; Fundraising · CPC title

  • H04L67/22Primary

    Electricity · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10715612B2 cover?
Systems and methods for identifying users according to their activity are disclosed. The identification of a user includes accessing a user activity log having a plurality of identifiers and corresponding activity information for each identifier, determining identifiers having correlating activity information, and assigning identifiers having correlating activity information to a common user.
Who is the assignee on this patent?
Yahoo Holdings Inc, Oath Inc
What technology area does this patent fall under?
Primary CPC classification H04L67/535. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jul 14 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).