Machine learning module training using input reconstruction techniques and unlabeled transactions

US11734558B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11734558-B2
Application numberUS-202016900129-A
CountryUS
Kind codeB2
Filing dateJun 12, 2020
Priority dateJun 12, 2020
Publication dateAug 22, 2023
Grant dateAug 22, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are disclosed relating to improving machine learning classification using both labeled and unlabeled data, including electronic transactions. A computing system may train a machine learning module using a first set of transactions (of any classifiable data) with label information that indicates designated classifications for those transactions and a second set of transactions without label information. This can allow for improved classification error rates, particularly when additional labeled data may not be present (e.g., if a transaction was disallowed, it may not be later labeled as fraudulent or not). The training process may include generating first error data based on classification results for the first set of transactions, generating second error data based on reconstruction results for both the first and second sets of transactions, and updating the machine learning module based on the first and second error data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for training machine learning models using input reconstruction results for unlabeled, incomplete transactions, comprising: training, by a computing system, a machine learning module to classify electronic transactions, such that the machine learning module learns a characteristic of the electronic transactions that is indicative of different error data for those transactions, wherein the training uses a set of labeled transactions with labels indicating designated classifications for those transactions and a set of unlabeled transactions, and wherein the training includes: generating first error data based on classification results generated by the machine learning module for the set of labeled transactions; generating second, different error data based on reconstruction results generated via reconstruction of both the set of labeled transactions and the set of unlabeled transactions input into the machine learning module; and updating the machine learning module based on the first and second error data; and determining, by the computing system in response to receiving a request for authorization of a newly initiated transaction, to authorize the newly initiated transaction, wherein the determining is performed using the machine learning module that is trained based on reconstruction results for both labeled and unlabeled transactions. 2. The method of claim 1 , wherein the set of unlabeled transactions are incomplete transactions and the set of labeled transactions are completed transactions. 3. The method of claim 1 , wherein transactions in the set of labeled transactions and the set of unlabeled transactions are electronic monetary transactions and wherein classifications output by the trained machine learning module specify whether transactions are authorized by an entity that had been requested to authorize those transactions. 4. The method of claim 1 , further comprising: determining whether classification results for the set of unlabeled transactions match, within a threshold degree, with an expected distribution of classification results for unlabeled transactions. 5. The method of claim 4 , further comprising: subsequent to the updating, further updating the machine learning module based on error data from the set of labeled transactions but not the set of unlabeled transactions based on determining that the classification results for the set of unlabeled transactions do not match the expected distribution. 6. The method of claim 1 , wherein the reconstruction results indicate whether a model implemented by the machine learning module is stale. 7. The method of claim 1 , further comprising: tagging transactions in the set of unlabeled transactions to indicate that these transactions do not have label information, wherein one or more elements of the machine learning module are configured not to generate error data based on classification results for transactions of the set of unlabeled transactions based on detecting the tagging. 8. The method of claim 1 , further comprising: determining whether classification results for the set of unlabeled transactions include a threshold percentage of results that have a particular classification; and rolling back updates to the machine learning module, wherein the rolling back is performed based on determining that the classification results for the set of unlabeled transactions do not include the threshold percentage. 9. The method of claim 1 , wherein the reconstruction results are generated by generating output for the set of labeled transactions and the set of unlabeled transactions based on output of a first hidden layer of a neural network and not based on output of subsequent layers of the neural network. 10. A non-transitory computer-readable medium having instructions stored thereon that are executable by a computing device to perform operations comprising: training a machine learning module to classify electronic transactions, such that the machine learning module learns a characteristic of the electronic transactions that is indicative of different error data for those transactions, wherein the training uses a set of labeled transactions with labels that indicate designated classifications for those transactions and a second, different set of unlabeled transactions, wherein the training includes: generating first error data based on classification results generated by the machine learning module for the set of labeled transactions, wherein transactions in the set of labeled transactions are completed transactions; generating second error data based on reconstruction results generated by an input reconstruction module for both the set of labeled transactions and the set of unlabeled transactions, wherein transactions in the set of unlabeled transactions are incomplete transactions; and updating the machine learning module based on the first and second error data; and determining, in response to receiving a request for authorization of a transaction, to authorize the transaction, wherein the determining is performed using the machine learning module that is trained based on reconstruction results for both labeled and unlabeled transactions. 11. The non-transitory computer-readable medium of claim 10 , wherein transactions in the sets of transactions are electronic monetary transactions, and wherein transactions in the set of labeled transactions are approved transactions and transactions in the set of unlabeled transactions are denied transactions. 12. The non-transitory computer-readable medium of claim 10 , wherein the operations further comprise: determining whether classification results for the set of unlabeled transactions include a threshold percentage of results that have a particular classification. 13. The non-transitory computer-readable medium of claim 12 , wherein the operations further comprise: rolling back updates to the machine learning module, wherein the rolling back is performed based on determining that the classification results for the set of unlabeled transactions do not include the threshold percentage. 14. The non-transitory computer-readable medium of claim 10 , wherein the operations further comprise: tagging transactions in the set of unlabeled transactions to indicate that these transactions do not have label information, wherein one or more elements of the machine learning module are configured not to generate error data based on classification results for transactions of the set of unlabeled transactions based on detecting the tagging. 15. The non-transitory computer-readable medium of claim 10 , wherein the first error data indicates differences between the classification results and the designated classifications, and wherein the second error data indicates differences between the reconstruction results and input being reconstructed. 16. A method, comprising receiving, by a computing system, a request for authorization of an electronic transaction; accessing, by the computing system, a machine learning module that was trained such that the machine learning module learns a characteristic of the electronic transactions that is indicative of different error data for those transactions, wherein the machine learning model is trained prior to the accessing, by: generating first error data based on classification results generated by the machine learning module for a first set of labeled transactions with labels indicating correct classifications for those transactions; generating second, different error data based on reconstruction results generated via inpu

Assignees

Inventors

Classifications

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Supervised learning · CPC title

  • Feedforward networks · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11734558B2 cover?
Techniques are disclosed relating to improving machine learning classification using both labeled and unlabeled data, including electronic transactions. A computing system may train a machine learning module using a first set of transactions (of any classifiable data) with label information that indicates designated classifications for those transactions and a second set of transactions without…
Who is the assignee on this patent?
Paypal Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).