Fraud detection using graph databases

US11316874B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11316874-B2
Application numberUS-202016944932-A
CountryUS
Kind codeB2
Filing dateJul 31, 2020
Priority dateJan 10, 2020
Publication dateApr 26, 2022
Grant dateApr 26, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Aspects discussed herein relate to the storage of data in graph databases and detecting fraudulent behavior in the stored data. Fraud detection systems may use graph databases to store data, allowing for querying the graph database to obtain data using a variety of graph semantics such as nodes, edges, and properties. Graph databases in accordance with embodiments of the invention may include account nodes and attribute nodes, where nodes of the same type are not directly linked to each other. When a particular node is updated, an updated node may be created with a higher version number than the existing node. Each node may include an indication of the node being associated with fraudulent activity. Fraud indicators may be calculated based on the relationships between the nodes and fraud indicators for the nodes.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for fraud detection, comprising: obtaining account data comprising an account number; determining a corresponding account node having an account number corresponding to the account number, the corresponding account node being stored in a fraud detection database comprising a set of account nodes, a set of attribute nodes, and a set of edge data, each edge in the set of edge data identifying an account node in the set of account nodes, an attribute node in the set of attribute nodes, and a label corresponding to a value in the attribute node; generating a revised account node comprising a unique identifier, the account number, a suspicious account identifier, and version data indicating that the revised account node is a later version of the corresponding account node; generating revised edge data based on at least one edge linking the revised account node to an attribute node in the set of attribute nodes; inserting the revised account node into the fraud detection database; inserting the revised edge data into the fraud detection database, wherein each edge in the revised edge data links the revised account node to each attribute node indicated in the revised edge data; calculating a fraud score for the revised account node, the fraud score being calculated based on the revised edge data and each attribute node indicated in the revised edge data; and setting, based on the fraud score, a suspicious account indicator for the revised account node such that the revised account node is indicated as a suspicious account. 2. The computer-implemented method of claim 1 , further comprising preprocessing an attribute node in the set of attribute nodes to format the value of the attribute node in a standardized format based on a label indicated in an edge linking the attribute node to an account node. 3. The computer-implemented method of claim 1 , further comprising: identifying an abbreviated keyword in an attribute node; and updating the value of the attribute node to replace the abbreviated keyword with a standardized keyword determined based on a label indicated in an edge linking the attribute node to an account node. 4. The computer-implemented method of claim 1 , further comprising encrypting the value for each attribute node based on a random string and a hashing algorithm. 5. The computer-implemented method of claim 4 , further comprising: regenerating the random string on a predetermined schedule; and encrypting the value for each attribute node based on the regenerated random string and the hashing algorithm on the predetermined schedule. 6. The computer-implemented method of claim 1 , further comprising: determining a whitelist comprising a set of values occurring in the set of attribute nodes; and deleting, from the fraud detection database, attributes nodes having a value corresponding to the set of values in the whitelist. 7. The computer-implemented method of claim 6 , further comprising determining the set of values in the whitelist based on a number of times the value occurs in an attribute node in the set of attribute nodes. 8. The computer-implemented method of claim 6 , further comprising determining the set of values in the whitelist based on a set of default values corresponding to a particular label associated with one or more edges in the set of edge data. 9. The computer-implemented method of claim 1 , further comprising querying the fraud detection database by: obtaining an account identifier; mapping the account identifier to an account number; obtaining a related query node associated with the account number; determining a current account node based on the account nodes linked to the related query node, wherein the current account node comprises the account node linked to the related query node and having version data indicating the current account node is the latest version; determining a set of related attribute nodes based on a set of attribute notes nodes linked to the current account node; and generating a query result based on the current account node and the set of related attribute nodes. 10. The computer-implemented method of claim 1 , wherein: each account node in the set of account nodes comprising a unique identifier, an account number, version data, and a suspicious account indicator indicating if the account node corresponds to a suspicious account; each attribute node in the set of attribute nodes comprising a value; each edge in the set of edge data linking an account node to an attribute node and comprising a label corresponding to the value in the attribute node and an edge weight; and wherein the fraud detection database further comprises: a set of query nodes, each query node in the set of query nodes comprising an account number from the set of account nodes; and a set of query edge data, each query edge in the set of query edge data linking a query node to the account node having the account number corresponding to the account number in the linked query node. 11. A non-transitory machine-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform steps comprising: obtaining account data comprising an account number; determining a corresponding account node having an account number corresponding to the account number, the corresponding account node being stored in a fraud detection database comprising a set of account nodes, a set of attribute nodes, and a set of edge data, each edge in the set of edge data identifying an account node in the set of account nodes, an attribute node in the set of attribute nodes, and a label corresponding to a value in the attribute node; generating a revised account node comprising a unique identifier, the account number, a suspicious account identifier, and version data indicating that the revised account node is a later version of the corresponding account node; generating revised edge data based on at least one edge linking the revised account node to an attribute node in the set of attribute nodes; inserting the revised account node into the fraud detection database; inserting the revised edge data into the fraud detection database, wherein each edge in the revised edge data links the revised account node to each attribute node indicated in the revised edge data; calculating a fraud score for the revised account node, the fraud score being calculated based on the revised edge data and the each attribute node indicated in the revised edge data; and setting, based on the fraud score, a suspicious account indicator for the revised account node such that the revised account node is indicated as a suspicious account. 12. The non-transitory machine-readable medium of claim 11 , wherein the instructions, when executed by one or more processors, further cause the one or more processors to perform steps comprising preprocessing an attribute node in the set of attribute nodes to format the value of the attribute node in a standardized format based on a label indicated in an edge linking the attribute node to an account node. 13. The non-transitory machine-readable medium of claim 11 , wherein the instructions, when executed by one or more processors, further cause the one or more processors to perform steps comprising: obtaining whitelist data comprising a set of whitelist values; and for each attribute node in the fraud detection database having a value corresponding to a whitelist value in the whitelist data, setting an edge weight of each edge linking the attribute node to an account node to zero. 14.

Assignees

Inventors

Classifications

  • involving fraud or risk level assessment in transaction processing · CPC title

  • Graphs; Linked lists (G06F16/9027 takes precedence) · CPC title

  • Query processing · CPC title

  • Traffic logging, e.g. anomaly detection · CPC title

  • Use of message hashing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11316874B2 cover?
Aspects discussed herein relate to the storage of data in graph databases and detecting fraudulent behavior in the stored data. Fraud detection systems may use graph databases to store data, allowing for querying the graph database to obtain data using a variety of graph semantics such as nodes, edges, and properties. Graph databases in accordance with embodiments of the invention may include a…
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/9024. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 26 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).