Machine Learning and Security Classification of User Accounts

US2022051144A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022051144-A1
Application numberUS-202117515407-A
CountryUS
Kind codeA1
Filing dateOct 30, 2021
Priority dateJul 2, 2018
Publication dateFeb 17, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Machine learning techniques are used in combination with graph data structures to perform automated classification of accounts. Graphs may be constructed using a seed node and then expanded outward to second-degree nodes and third-degree nodes that are connected to a seed user account node via direct interaction between the accounts. Characterization information regarding the interaction between accounts can be stored in the graph (e.g., quantity of interactions, types of interactions) as well as other metrics and metadata. A classifier, using random forest or another technique, may be trained using a number of different graphs that can then be used to reach a determination as to whether a user account falls into one particular category or another. These techniques can identify accounts that may be violating terms of service, committing a security violation, and/or performing illegal actions in a way that is not ascertainable from human analysis.

First claim

Opening claim text (preview).

1 . (canceled) 2 . A system, comprising: a processor; and a non-transitory computer readable medium having stored thereon instructions executable to cause the system to perform operations comprising: creating a seed node based on a seed account that satisfies one or more selection criteria; creating a graph data structure corresponding to the seed node that includes information on other nodes, corresponding to other user accounts, connected to the seed node, including: determining a plurality of second-degree accounts with which the seed account has transacted; adding plurality of second-degree accounts as second-degree nodes connected to the seed node in the graph data structure, wherein each second-degree node added has less than a threshold number of connections to other nodes; creating a first group of edges in the graph data structure indicating one or more respective transactions between the seed node and each of the one or more second degree nodes; and for each of the plurality of more second-degree nodes: determining, for that second-degree node, whether one or more third-degree accounts exist that have transacted with a second-degree account for that second-degree node; if the one or more third-degree accounts exist, adding the one or more third-degree accounts as one or more third-degree nodes to the graph data structure; and if the one or more third-degree accounts exist, creating a second respective group of edges in the graph data structure indicating links between that second-degree node and the one or more third-degree nodes; for each of the edges in the graph data structure, calculating and storing, in the graph data structure one, or more attribute values based on one or more transactions occurring between the nodes connected to that edge; and providing the graph data structure as an input to a machine learning model. 3 . The system of claim 2 , wherein providing the graph data structure as input to the machine learning model comprises providing a label value for the seed node to the machine learning model, wherein the label value indicates whether the seed node corresponds to a user account that has engaged in behavior prohibited by an authorized use policy applicable to the user account. 4 . The system of claim 2 , wherein the operations further comprise: providing a plurality of graph data structures to the machine learning model; and the machine learning model producing a trained classifier, based on the plurality of graph data structures, that is configured to accept an unclassified graph data structure and predict a classification value for an unclassified seed node for the unclassified graph data structure. 5 . The system of claim 4 , wherein the classification value is a categorization of whether an account is likely to engage in behavior prohibited by an authorized use policy applicable to the account. 6 . The machine learning system of claim 4 , wherein the classification value has a corresponding confidence value. 7 . The system of claim 2 , wherein the one or more attribute values for at least one of the edges in the graph include a dispute claim type for one or more transactions. 8 . The system of claim 2 , wherein the operations further comprise calculating and storing graph-level attributes for the graph data structure based on attribute values for the nodes in the graph. 9 . The system of claim 8 , wherein the graph-level attributes include a proportion of nodes in the graph corresponding to accounts believed to have violated an acceptable use policy. 10 . A method for machine-learning based account classification, comprising: accessing, by a computer system, a graph data structure having a seed node that corresponds to an unclassified seed account; providing, by the computer system, the graph data structure to a trained machine learning (ML) classifier, wherein the ML classifier was trained using a plurality of graph data structures each built using operations comprising: determining one or more second-degree accounts with which a seed account for the graph data structure has transacted; adding the one or more second-degree accounts as one or more second-degree nodes connected to the seed node in the graph data structure; creating a first group of edges in the graph data structure indicating links between the seed node and each of the one or more second degree nodes; and for each of the one or more second-degree nodes: determining, for that second-degree node, whether one or more third-degree accounts exist that have transacted with a second-degree account for that second-degree node; if the one or more third-degree accounts exist, adding the one or more third-degree accounts as one or more third-degree nodes to the graph data structure; and if the one or more third-degree accounts exist, creating a second respective group of edges in the graph data structure indicating links between that second-degree node and the one or more third-degree nodes; receiving, by the computer system from the trained ML classifier, a classification of the seed account. 11 . The method of claim 10 , wherein the computer system comprises a plurality of computing devices linked via one or more networks, and wherein the computer system comprises the trained ML classifier. 12 . The method of claim 10 , further comprising: determining, by the computer system, whether to take a corrective action against the seed account based on the classification. 13 . The method of claim 12 , wherein the classification indicates the seed account is believed to have engaged in collusion, further comprising taking corrective action including causing a suspension of transaction privileges for the seed account. 14 . The method of claim 10 , wherein the operations to build each of the plurality of graph data structures further comprise: for each of the edges in the graph data structure, calculating and storing, in the graph data structure, one or more attribute values based on one or more transactions occurring between the nodes connected to that edge. 15 . The method of claim 10 , wherein the ML classifier comprises an artificial neural network (ANN) based classifier. 16 . The method of claim 10 , wherein the classification of the seed account indicates that the seed account is likely to violate or has violated an authorized use policy (AUP) applicable to the seed account. 17 . The method of claim 10 , further comprising: based on the classification of the seed account, denying one or more electronic transactions requested by the seed account. 18 . The method of claim 17 , wherein the denied one or more electronic transactions include one or more electronic monetary transactions to purchase an item. 19 . A non-transitory computer-readable medium having stored thereon instructions that are executable by a computer system to cause the computer system to perform operations comprising: accessing a graph data structure having a seed node that corresponds to an unclassified seed account; providing the graph data structure to a trained machine learning (ML) classifier, wherein the ML classifier was trained using a plurality of graph data structures each built using operations comprising: determining one or more second-degree accounts with which a seed account for the graph data structure has transacted; adding the one or more second-degree accounts as one or more second-degree nodes connected to the seed node in the graph data structure; creating a first group of edges in the graph data stru

Assignees

Inventors

Classifications

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

  • Combinations of networks · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Supervised learning · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022051144A1 cover?
Machine learning techniques are used in combination with graph data structures to perform automated classification of accounts. Graphs may be constructed using a seed node and then expanded outward to second-degree nodes and third-degree nodes that are connected to a seed user account node via direct interaction between the accounts. Characterization information regarding the interaction betwee…
Who is the assignee on this patent?
Paypal Inc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 17 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).