What technology area does this patent fall under?

Primary CPC classification G06N20/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jun 06 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Systems and Methods for Distributed On-Device Learning with Data-Correlated Availability

US2019171978A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2019171978-A1
Application number	US-201715834001-A
Country	US
Kind code	A1
Filing date	Dec 6, 2017
Priority date	Dec 6, 2017
Publication date	Jun 6, 2019
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides systems and methods for distributed training of machine learning models. In one example, a computer-implemented method is provided for training machine-learned models. The method includes obtaining, by one or more computing devices, a plurality of regions based at least in part on temporal availability of user devices; selecting a plurality of available user devices within a region; and providing a current version of a machine-learned model associated with the region to the plurality of selected user devices within the region. The method includes obtaining, from the plurality of selected user devices, updated machine-learned model data generated by the plurality of selected user devices through training of the current version of the machine-learned model associated with the region using data local to each of the plurality of selected user devices and generating an updated machine-learned model associated with the region based on the updated machine-learned model data.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for training machine-learned models, the method comprising: obtaining, by one or more computing devices, a plurality of regions based at least in part on temporal availability of user devices; selecting, by the one or more computing devices, a plurality of available user devices within a region; providing, by the one or more computing devices, a current version of a machine-learned model associated with the region to the plurality of selected user devices within the region; obtaining, by the one or more computing devices from the plurality of selected user devices, updated machine-learned model data generated by the plurality of selected user devices through training of the current version of the machine-learned model associated with the region using data local to each of the plurality of selected user devices; and generating, by the one or more computing devices, an updated machine-learned model associated with the region based on the updated machine-learned model data. 2 . The computer-implemented method of claim 1 , wherein obtaining, by one or more computing devices, a plurality of regions based at least in part on temporal availability of user devices further comprises generating the plurality of regions based at least in part on one or more of: time zones, latitude ranges, longitude ranges, semantic boundaries, user population, or diurnal availability patterns. 3 . The computer-implemented method of claim 2 , wherein each region is generated such that each region comprises a user population having a similar diurnal cycle. 4 . The computer-implemented method of claim 1 , further comprising associating, by the one or more computing devices, a copy of a global machine-learned model with each region wherein the machine-learned model associated with the region is trained using federated learning based on users in the region. 5 . The computer-implemented method of claim 1 , further comprising: providing, by the one or more computing devices, a regularization term to the plurality of selected user devices within the region, wherein the regularization term is added to the loss function for training of the current version of the machine-learned model associated with the region, and wherein the regularization term represents a sum of distances, measured in parameter space, of the model associated with the region from at least one model of at least one other region. 6 . The computer-implemented method of claim 1 , wherein generating, by the one or more computing devices, the updated machine-learned model associated with the region based on the updated machine-learned model data further comprises performing, by the one or more computing devices, multitask learning to bias the machine-learned model associated with the region toward at least one machine-learned model associated with at least one other region. 7 . The computer-implemented method of claim 1 , further comprising: computing, by the one or more devices, a centroid of at least one model of at least one other region in parameter space; and providing, by the one or more computing devices, the centroid to the plurality of selected user devices within the region, wherein each of the plurality of selected user devices computes a distance, measured in parameter space, of the model associated with the region from the centroid as a regularization term that is added to the loss function for training of the current version of the machine-learned model associated with the region. 8 . A computing device comprising: one or more processors; and one or more non-transitory computer-readable media that store instructions that, when executed by the one or more processors, cause the computing device to: generate a plurality of regions based at least in part on temporal availability of user devices; select a plurality of available user devices within a region; provide a current version of a machine-learned model associated with the region to the plurality of selected user devices within the region, wherein each of the plurality of selected user devices performs training of the current version of the machine-learned model associated with the region using data local to each of the plurality of selected user devices; obtain, from the plurality of selected user devices, updated machine-learned model data; and generate an updated machine-learned model associated with the region based on the updated machine-learned model data. 9 . The computing device of claim 8 , wherein generating a plurality of regions based at least in part on temporal availability of user devices further comprises generating the plurality of regions based at least in part on one or more of: time zones, latitude ranges, longitude ranges, semantic boundaries, user population, or diurnal availability patterns. 10 . The computing device of claim 9 , wherein each region is generated such that each region comprises a user population having a similar diurnal cycle. 11 . The computing device of claim 8 , further comprising instructions that, when executed by the one or more processors, cause the computing device to associate a copy of a global machine-learned model with each region wherein the machine-learned model associated with the region is trained using federated learning based on users in the region. 12 . The computing device of claim 8 , further comprising instructions that, when executed by the one or more processors, cause the computing device to: compute a regularization term which represents a sum of distances, measured in parameter space, of the model associated with the region from at least one other model of at least one other region; and provide the regularization term to the plurality of selected user devices within the region, wherein the regularization term is added to the loss function for training of the current version of the machine-learned model associated with the region. 13 . The computing device of claim 8 , wherein generating the updated machine-learned model associated with the region based on the updated machine-learned model data further comprises performing multitask learning to bias the machine-learned model associated with the region toward at least one machine-learned model associated with at least one other region. 14 . The computing device of claim 8 , further comprising instructions that, when executed by the one or more processors, cause the computing device to: compute a centroid of at least one model of at least one other region in parameter space; and provide the centroid to the plurality of selected user devices within the region, wherein each of the plurality of selected user devices computes a distance, measured in parameter space, of the model associated with the region from the centroid as a regularization term that is added to the loss function for training of the current version of the machine-learned model associated with the region. 15 . A system comprising: a server; and a plurality of user devices; the server comprising: one or more processors; and one or more non-transitory computer-readable media that store instructions that, when executed by the one or more processors, cause the one or more processors to perform operations, the operations comprising: obtaining a plurality of regions based at least in part on temporal availability of user devices; selecting, from the plurality of user devices, a plurality of available user devices within a region; providing a current version of a machine-learned model associated with the region to the plur

Assignees

Google Llc

Inventors

Bonawitz Keith

Classifications

G06N20/00Primary
Machine learning · CPC title
G06N99/005Primary
Physics · mapped topic
G06F16/95Primary
Retrieval from the web · CPC title
G06N20/20
Ensemble learning · CPC title

Patent family

Related publications grouped by family.

View patent family 63794623

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019171978A1 cover?: The present disclosure provides systems and methods for distributed training of machine learning models. In one example, a computer-implemented method is provided for training machine-learned models. The method includes obtaining, by one or more computing devices, a plurality of regions based at least in part on temporal availability of user devices; selecting a plurality of available user devi…
Who is the assignee on this patent?: Google Llc
What technology area does this patent fall under?: Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jun 06 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).