System and method for detecting fraudulent documents

US10839208B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10839208-B2
Application numberUS-201816215159-A
CountryUS
Kind codeB2
Filing dateDec 10, 2018
Priority dateDec 10, 2018
Publication dateNov 17, 2020
Grant dateNov 17, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method to detect fraudulent documents is disclosed. The system uses a generative adversarial network to generate synthetic document data including new fraud patterns. The synthetic document data is used to train a fraud classifier to detect potentially fraudulent documents as part of a document validation workflow. The method includes extracting document features from sample data corresponding to target regions of the documents, such as logo regions and watermark regions. The method may include updating a cost function of the generators to reduce the tendency of the system to generate repeated fraud patterns.

First claim

Opening claim text (preview).

We claim: 1. A method of document fraud detection, the method comprising the steps of: generating a set of synthetic training documents, wherein generating the set of synthetic training documents comprises: receiving an initial set of training documents; extracting a document feature from the training documents; and using a generative adversarial network to generate the set of synthetic training documents, wherein the generative adversarial network receives information about the document feature as input, wherein the generative adversarial network comprises a generator and a discriminator, and the generator comprises a neural network with a generator cost function; identifying a fraud pattern associated with the set of synthetic training documents; updating the generator cost function; and training a document fraud detection system using the set of synthetic training documents. 2. The method according to claim 1 , the method further comprising steps of: receiving a document; using the document fraud detection system to detect tampered regions in the document; and providing an alert that tampered regions have been detected in the document. 3. The method according to claim 1 , wherein extracting the document feature includes using a conditional generative adversarial network. 4. The method according to claim 1 , wherein the document fraud detection system includes a semi-supervised learning algorithm, and wherein training the document fraud detection system comprises training the semi-supervised learning algorithm. 5. The method according to claim 1 , wherein the document feature is associated with a region of a document. 6. The method according to claim 5 , wherein the region is associated with a logo in the document. 7. The method according to claim 5 , wherein the region is associated with a watermark in the document. 8. The method according to claim 5 , wherein the region is associated with a signature in the document. 9. The method according to claim 1 , further comprising: penalizing the generator for producing repeated fraud patterns in the set of synthetic training documents. 10. The method according to claim 1 , wherein updating the generator cost function further comprises updating the generator cost function to reduce the likelihood that the generator will produce another synthetic document with the identified fraud pattern. 11. A non-transitory computer-readable medium storing software comprising instructions that are executable by one or more device processors to detect fraudulent documents by: generating a set of synthetic training documents, wherein generating the set of synthetic training documents comprises: receiving an initial set of training documents; extracting a document feature from the training documents; using a generative adversarial network to generate the set of synthetic training documents, wherein the generative adversarial network receives information about the document feature as input, wherein the generative adversarial network comprises a generator and a discriminator and the generator comprises a neural network with a generator cost function; identifying a fraud pattern associated with the set of synthetic training documents; updating the generator cost function; and training a document fraud detection system using the set of synthetic training documents. 12. The non-transitory computer-readable medium according to claim 11 , wherein extracting the document feature includes using a conditional generative adversarial network. 13. The non-transitory computer-readable medium according to claim 11 , wherein the document fraud detection system includes a semi-supervised learning algorithm, and wherein training the document fraud detection system comprises training the semi-supervised learning algorithm. 14. The non-transitory computer-readable medium according to claim 11 , wherein the document feature is associated with a region of a document. 15. The non-transitory computer-readable medium according to claim 11 , wherein the region is associated with a logo in the document. 16. The non-transitory computer-readable medium according to claim 11 , wherein the region is associated with a watermark in the document. 17. The non-transitory computer-readable medium according to claim 11 , wherein the region is associated with a signature in the document. 18. A system for detecting fraudulent documents, the system comprising: a device processor; and a non-transitory computer readable medium storing instructions that are executable by the device processor to: generate a set of synthetic training documents by: receiving an initial set of training documents; extracting a document feature from the training documents; and using a generator of a generative adversarial network to generate the set of synthetic training documents, wherein the generator receives information about the document feature as input; train a document fraud detection system using the set of synthetic training documents; identify a fraud pattern associated with the set of synthetic training documents; and penalize the generator for producing repeated fraud patterns in the set of synthetic training documents. 19. The system according to claim 18 , wherein the document feature is associated with a region of a document. 20. The system according to claim 18 , wherein the generative adversarial network comprises a discriminator, wherein the generator comprises a neural network with a generator cost function, and wherein the non-transitory computer readable medium storing instructions are also executable by the device processor to: update the generator cost function.

Assignees

Inventors

Classifications

  • Document-oriented image-based pattern recognition · CPC title

  • Classification techniques · CPC title

  • using neural networks · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • Document matching, e.g. of document images · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10839208B2 cover?
A system and method to detect fraudulent documents is disclosed. The system uses a generative adversarial network to generate synthetic document data including new fraud patterns. The synthetic document data is used to train a fraud classifier to detect potentially fraudulent documents as part of a document validation workflow. The method includes extracting document features from sample data c…
Who is the assignee on this patent?
Accenture Global Solutions Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 17 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).