Risky transaction identification method and apparatus

US11087180B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11087180-B2
Application numberUS-202016802627-A
CountryUS
Kind codeB2
Filing dateFeb 27, 2020
Priority dateMar 27, 2018
Publication dateAug 10, 2021
Grant dateAug 10, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A feature extraction is performed on transaction data to obtain a user classification feature and a transaction classification feature. A first dimension feature is constructed based on the user classification feature and the transaction classification feature. A dimension reduction processing is performed on the first dimension feature to obtain a second dimension feature. A probability that the transaction data relates to a risky transaction is determined based on a decision classification of the second dimension feature, where the decision classification is based on a pre-trained deep forest network including a plurality of levels of decision tree forest sets.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: training a deep forest model that includes multiple levels of respective base classifiers on training data that specifies a plurality of transaction samples, comprising: collecting a plurality of black samples and white samples, wherein each black sample relates to a risky transaction, and wherein each white sample relates to a normal transaction; extracting feature data from data associated with the black samples and data associated with the white samples; generating sampled feature data from the feature data; and iteratively performing a training process on the deep forest model, wherein the training process comprises, for a current level of respective base classifiers: training each base classifier included in the current level on the sampled feature data; concatenating one or more output features of the current level to features from the sampled feature data to generate concatenated features; training each base classifier included in a next level by using the concatenated features; and terminating the training process upon determining that a predetermined termination condition is satisfied; after the training, obtaining new feature data describing a transaction initiated by a user of a transaction service, wherein the new feature data comprises a set of features belonging to respective feature categories; for each feature category: determining, based at least on a sampling density used in collecting the features belonging to the feature category, a respective rate for use in selecting sampled features; and selecting, from the features belonging to the feature category and in accordance with the respective rate, a plurality of sampled features; and generating, based on processing the sampled features using the deep forest model, an output specifying a predicted classification of the transaction. 2. The computer-implemented method of claim 1 , wherein generating the sampled feature data from the feature data comprises: performing a dimension reduction process on the feature data having a first dimension to obtain the sampled feature data that has a lower dimension than the first dimension. 3. The computer-implemented method of claim 1 , wherein a number of the black samples is not equal to a number of the white samples, and the method further comprises, prior to training each base classifier: dividing data associated with the black samples and data with the white samples through a k-fold cross validation into one or more training datasets and one or more corresponding validation datasets; training a base classifier on the training datasets; and testing the base classifier on the validation datasets to obtain an indicator that evaluates a performance of the base classifier. 4. The computer-implemented method of claim 1 , further comprising: determining a maximum decision tree depth threshold based on a black-to-white sample ratio; and setting a maximum value of the decision tree depth to the maximum depth threshold. 5. A non-transitory, computer-readable medium storing one or more instructions executable by a computer system to perform operations comprising: training a deep forest model that includes multiple levels of respective base classifiers on training data that specifies a plurality of transaction samples, comprising: collecting a plurality of black samples and white samples, wherein each black sample relates to a risky transaction, and wherein each white sample relates to a normal transaction; extracting feature data from data associated with the black samples and data associated with the white samples; generating sampled feature data from the feature data; and iteratively performing a training process on the deep forest model, wherein the training process comprises, for a current level of respective base classifiers: training each base classifier included in the current level on the sampled feature data; concatenating one or more output features of the current level to features from the sampled feature data to generate concatenated features; training each base classifier included in a next level by using the concatenated features; and terminating the training process upon determining that a predetermined termination condition is satisfied; after the training, obtaining new feature data describing a transaction initiated by a user of a transaction service, wherein the new feature data comprises a set of features belonging to respective feature categories; for each feature category: determining, based at least on a sampling density used in collecting the features belonging to the feature category, a respective rate for use in selecting sampled features; and selecting, from the features belonging to the feature category and in accordance with the respective rate, a plurality of sampled features; and generating, based on processing the sampled features using the deep forest model, an output specifying a predicted classification of the transaction. 6. The non-transitory, computer-readable medium of claim 5 , wherein generating the sampled feature data from the feature data comprises: performing a dimension reduction process on the feature data having a first dimension to obtain the sampled feature data that has a lower dimension than the first dimension. 7. A computer-implemented system, comprising: one or more computers; and one or more computer memory devices interoperably coupled with the one or more computers and having tangible, non-transitory, machine-readable media storing one or more instructions that, when executed by the one or more computers, perform one or more operations comprising: training a deep forest model that includes multiple levels of respective base classifiers on training data that specifies a plurality of transaction samples, comprising: collecting a plurality of black samples and white samples, wherein each black sample relates to a risky transaction, and wherein each white sample relates to a normal transaction; extracting feature data from data associated with the black samples and data associated with the white samples; generating sampled feature data from the feature data; and iteratively performing a training process on the deep forest model, wherein the training process comprises, for a current level of respective base classifiers: training each base classifier included in the current level on the sampled feature data; concatenating one or more output features of the current level to features from the sampled feature data to generate concatenated features; training each base classifier included in a next level by using the concatenated features; and terminating the training process upon determining that a predetermined termination condition is satisfied; after the training, obtaining new feature data describing a transaction initiated by a user of a transaction service, wherein the new feature data comprises a set of features belonging to respective feature categories; for each feature category: determining, based at least on a sampling density used in collecting the features belonging to the feature category, a respective rate for use in selecting sampled features; and selecting, from the features belonging to the feature category and in accordance with the respective rate, a plurality of sampled features; and generating, based on processing the sampled features using the deep forest model, an output specifying a predicted classification of the transaction. 8. The computer-implemented system of claim 7 , wherein generating the sampled feature data from the feature data comprises: performing a dimension reduction process on the feature data having a first dimension to obtain

Assignees

Inventors

Classifications

  • based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate · CPC title

  • involving fraud or risk level assessment in transaction processing · CPC title

  • Tree-organised classifiers · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11087180B2 cover?
A feature extraction is performed on transaction data to obtain a user classification feature and a transaction classification feature. A first dimension feature is constructed based on the user classification feature and the transaction classification feature. A dimension reduction processing is performed on the first dimension feature to obtain a second dimension feature. A probability that t…
Who is the assignee on this patent?
Advanced New Technologies Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06Q20/4016. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 10 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).