Code generation through reinforcement learning using code-quality rewards
US-2023195428-A1 · Jun 22, 2023 · US
US11935137B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11935137-B2 |
| Application number | US-202318357213-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 24, 2023 |
| Priority date | Aug 5, 2022 |
| Publication date | Mar 19, 2024 |
| Grant date | Mar 19, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for distributing an equity reward for federated learning based on an equity theory includes the following steps: applying Adams' equity theory to federated learning, analyzing, by a participant, all factors invested in a federated task comprehensively, then giving an expected reward for this task, calculating, by the task publisher, the reputation of the participant; participating, by the participant, in each round of a training task using a local data to evaluate data contribution, model contribution, and a waiting-time allowance of the participant, then combining contribution results of the three factors to evaluate the contribution of the participant; after a global model converges, dynamically adjusting weights of the three factors according to an objective function of the equity reward, with a goal that an actual reward of the participant is as close as possible to the expected reward, and obtaining and distributing the actual reward of the participant.
Opening claim text (preview).
What is claimed is: 1. A method for distributing an equity reward for federated learning based on an equity theory, wherein Adams' equity theory is applied to the federated learning that is a decentralized machine learning scheme to facilitate collaborative training across participants by a coordinator thereby improving model performance of the participants, the participants are intelligent clients ranging from resource-rich devices including a large-scale server to resource-constrained device, and the coordinator served by a blockchain, and inputs and benefits of Adams' equity theory are transformed into an expected reward and an actual reward in the federated learning, comprising the following steps: giving, by a participant, the expected reward for this task after comprehensively analyzing all factors invested in the federated task, and calculating, by a task publisher, the reputation of the participant using a historical task of the participant, wherein the participant is an intelligent client, the task publisher is a blockchain smart contract; participating, by the participant, in each round of a training task using local data, evaluating, by the task publisher, three factors of data contribution, model contribution, and waiting-time allowance of the participant, and evaluating, by the task publisher, the contribution of the participant by combining contribution results of the three factors; entering, by the task publisher, a reward distribution phase after a global model converges, dynamically adjusting weights of the three factors according to an objective function of the equity reward, with a goal that the actual reward of the participant is as close as possible to the expected reward, and obtaining and distributing, by the task publisher, the actual reward of the participant, the steps of giving, by the participant, the expected reward for this task after comprehensively analyzing, by the participant, all the factors invested in the federated task, and calculating, by the task publisher, the reputation of the participant using the historical task of the participant comprise: in the preparation phase, giving, by participant i, the expected reward ER for this task after comprehensively analyzing all the factors invested in the federated task (E i ); the task publisher combines a historical federated learning task of each participant and evaluates a degree of match between ER and AR for historical tasks, the task publisher utilizes a forgetting factor as a coefficient, the closer the task to a current moment, the higher the weight of the task, and the task publisher obtains the reputation V i of the participant i based on a historical reputation record of the participant with the following calculation formula: V i = ∑ t = t 0 t n o w ( 1 - 2 π · tan - 1 ( A i t - E i t A i t ) ) · e - ( t - t n o w ) ( 7 ) wherein A i t and E i t are the actual rewards ARs and the expected rewards ERs of the participant i in a t-th task, and e −(t−t now ) is a forgetting factor of the t-th task, the steps of participating, by the participant, in each round of the training task using the local data, evaluating, by the task publisher, the three factors of the data contribution, the model contribution, and the waiting-time allowance of the participant, and evaluating the contribution of the participant by combining the contribution results of the three factors comprises: in a training process, participating, by the participant, in each round of the training task according to training rules of federated learning, and in an r-th round, downloading an aggregated global model of the r−1-th round from a blockchain, training the aggregated global model using the local data, uploading an updated local model to the blockchain, and repeating this training process, after the global model of the r-th round is aggregated, until the model converges; in a contribution evaluation process, taking the data contribution, the model contribution, and the waiting-time allowance as the three factors, and calculating, by the participant i, the data contribution u 1i according to a formula u 1 i =
Finance or payroll · CPC title
Distributed learning, e.g. federated learning · CPC title
Machine learning · CPC title
Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.