Federated learning for improving matching efficiency

US12468985B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12468985-B2
Application numberUS-202117461979-A
CountryUS
Kind codeB2
Filing dateAug 30, 2021
Priority dateOct 16, 2020
Publication dateNov 11, 2025
Grant dateNov 11, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes: sending, by one or more computers, in response to the number of data providers for federated learning being greater than a first threshold, a data field required for the federated learning to a coordinator, the coordinator comprising a computer; receiving, by one or more computers, from the coordinator, information about one or more data providers comprising the required data field, for determining the data providers comprising the required data field as the remaining data providers, wherein the coordinator stores a data field of each data provider; and performing, by one or more computers, federated learning-based modeling with each of the remaining data providers.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A computer-implemented method comprising: sending, by one or more computers, in response to a number of data providers for federated learning being greater than a first threshold, a data field required for the federated learning to a coordinator, the coordinator comprising a computer; receiving, by the one or more computers, from the coordinator, information about one or more data providers comprising the required data field, for determining the data providers comprising the required data field as remaining data providers, wherein the coordinator stores a data field of each data provider; acquiring, by the one or more computers, in response to a number of the remaining data providers being greater than the first threshold, a first public key from the coordinator to encrypt a local user ID, wherein the local user ID is a user ID of a data user; acquiring a user ID, by the one or more computers, which has been encrypted with a second public key, of each of the remaining data providers, wherein the second public key for the remaining data provider is acquired from the coordinator; acquiring, by the one or more computers, an overlap rate by comparing the local user ID with the acquired user ID of each of the remaining data providers; and screening, by the one or more computers, to obtain one or more data providers with an overlap rate being greater than a second threshold, to use the one or more data providers obtained by screening as new remaining data providers; and performing, by the one or more computers, federated learning-based modeling with each of the remaining data providers. 2 . The method of claim 1 , further comprising: acquiring, by the one or more computers, in response to the number of the remaining data providers being greater than the first threshold, a statistical indicator of respective data fields of the remaining data providers; and performing, by the one or more computers, screening on the remaining data providers based on the statistical indicator, to use one or more data providers obtained by screening as new remaining data providers. 3 . The method of claim 2 , wherein the statistical indicator comprises one or more of the following: a mean value, a maximum value, and a minimum value. 4 . The method of claim 1 , wherein performing federated learning-based modeling with each of the remaining data providers comprises: screening, by the one or more computers, feature data that is common with each of the remaining data providers by invoking a feature screening model; and performing, by the one or more computers, federated learning-based modeling with each of the remaining data providers based on the feature data. 5 . The method of claim 1 , wherein the required data field comprises a time interval. 6 . An electronic device, comprising: a memory storing one or more programs configured to be executed by one or more processors, the one or more programs including instructions for causing the electronic device to perform operations comprising: sending, in response to a number of data providers for federated learning being greater than a first threshold, a data field required for the federated learning to a coordinator; receiving, from the coordinator, information about one or more data providers comprising the required data field, for determining the data providers comprising the required data field as remaining data providers, wherein the coordinator stores a data field of each data provider; acquiring, in response to a number of the remaining data providers being greater than the first threshold, a first public key from the coordinator to encrypt a local user ID, wherein the local user ID is a user ID of a data user; acquiring a user ID, which has been encrypted with a second public key, of each of the remaining data providers, wherein the second public key for the remaining data provider is acquired from the coordinator; acquiring an overlap rate by comparing the local user ID with the acquired user ID of each of the remaining data providers; and screening to obtain one or more data providers with an overlap rate being greater than a second threshold, to use the one or more data providers obtained by screening as new remaining data providers; and performing federated learning-based modeling with each of the remaining data providers. 7 . The electronic device of claim 6 , the one or more programs further including instructions for causing the electronic device to perform operations comprising: acquiring, in response to the number of the remaining data providers being greater than the first threshold, a statistical indicator of respective data fields of the remaining data providers; and performing screening on the remaining data providers based on the statistical indicator, to use one or more data providers obtained by screening as new remaining data providers. 8 . The electronic device of claim 7 , wherein the statistical indicator comprises one or more of the following: a mean value, a maximum value, and a minimum value. 9 . The electronic device of claim 6 , wherein performing federated learning-based modeling with each of the remaining data providers comprises: screening, feature data that is common with each of the remaining data providers by invoking a feature screening model; and performing federated learning-based modeling with each of one or more remaining data providers based on the feature data. 10 . A non-transitory computer-readable storage medium that stores one or more programs comprising instructions that, when executed by one or more processors of an electronic device, cause the electronic device to implement operations comprising: sending, in response to a number of data providers for federated learning being greater than a first threshold, a data field required for the federated learning to a coordinator, the coordinator comprising a computer; receiving, from the coordinator, information about one or more data providers comprising the required data field, for determining the data providers corresponding to information comprising the required data field as remaining data providers, wherein the coordinator stores a data field of each data provider; acquiring, in response to a number of the remaining data providers being greater than the first threshold, a first public key from the coordinator to encrypt a local user ID, wherein the local user ID is a user ID of a data user; acquiring a user ID, which has been encrypted with a second public key, of each of the remaining data providers, wherein the second public key for the remaining data provider is acquired from the coordinator; acquiring an overlap rate by comparing the local user ID with the acquired user ID of each of the remaining data providers; and screening to obtain one or more data providers with an overlap rate being greater than a second threshold, to use the one or more data providers obtained by screening as new remaining data providers, and performing federated learning-based modeling with each of the remaining data providers. 11 . The non-transitory computer-readable storage medium of claim 10 , the operations further comprising: acquiring, in response to the number of the remaining data providers being greater than the first threshold, a statistical indicator of respective data fields of the remaining data providers; and performing screening on the remaining data providers based on the statistical indicator, to use one or more data providers obtained by screening as new remaining data providers. 12 . The non-transitory computer-readable storage medium of claim 11 , wherein the statistical in

Assignees

Inventors

Classifications

  • G06Q40/03Primary

    Credit; Loans; Processing thereof · CPC title

  • Public key, i.e. encryption algorithm being computationally infeasible to invert or user's encryption keys not requiring secrecy · CPC title

  • Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title

  • Ensuring data consistency and integrity · CPC title

  • Providing cryptographic facilities or services · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12468985B2 cover?
A method includes: sending, by one or more computers, in response to the number of data providers for federated learning being greater than a first threshold, a data field required for the federated learning to a coordinator, the coordinator comprising a computer; receiving, by one or more computers, from the coordinator, information about one or more data providers comprising the required data…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06Q40/03. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 11 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).