Personalized Federated Learning for Assistant Systems

US2021117780A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021117780-A1
Application numberUS-202016815990-A
CountryUS
Kind codeA1
Filing dateMar 11, 2020
Priority dateOct 18, 2019
Publication dateApr 22, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment, a method includes receiving, by a first client system, from one or more remote servers, a current version of a global neural network model including multiple federated model parameters, accessing, from a local data store, multiple examples and a local personalization model including multiple local model parameters, wherein each of the examples includes one or more features and one or more labels, training the global neural network model and the local personalization model together on the examples to generate multiple updated federated model parameters and multiple updated local model parameters, storing, in the local data store, the trained local personalization model including the updated local model parameters, and sending, to one or more of the remote servers, the trained global neural network model including the updated federated model parameters.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising, by a first client system: receiving, from one or more remote servers, a current version of a global neural network model comprising a plurality of federated model parameters; accessing, from a local data store, a plurality of examples and a local personalization model comprising a plurality of local model parameters, wherein each of the plurality of examples comprises one or more features and one or more labels; training the global neural network model and the local personalization model together on the plurality of examples to generate a plurality of updated federated model parameters and a plurality of updated local model parameters; storing, in the local data store, the trained local personalization model comprising the plurality of updated local model parameters; and sending, to one or more of the remote servers, the trained global neural network model comprising the plurality of updated federated model parameters. 2 . The method of claim 1 , wherein the global neural network model is configured to generate, responsive to a first example being input into the global neural network model, one or more candidate labels corresponding to the first example. 3 . The method of claim 1 , wherein the global neural network model is a natural-language generation model, and wherein, for each of the plurality of examples, at least one of the labels is a linguistic response comprising one or more n-grams. 4 . The method of claim 1 , wherein the global neural network model is a data classification model, and wherein, for each of the plurality of examples, at least one of the labels is a data classification associated with the example. 5 . The method of claim 1 , wherein the global neural network model and the local personalization model are iteratively trained together using Stochastic Gradient Descent (SGD). 6 . The method of claim 1 , wherein training the global neural network model and the local personalized model together on the plurality of examples comprises: inputting each of the plurality of examples into the global neural network model; generating, for each of the plurality of examples, one or more candidate labels based on the input example, one or more of the local model parameters, and one or more of the federated model parameters; and generating the plurality of updated federated model parameters and the plurality of updated local model parameters based on the generated candidate labels. 7 . The method of claim 1 , wherein the plurality of updated federated model parameters and the plurality of updated local model parameters are generated based on, for each of the plurality of examples, a determination of whether one or more of the candidate labels generated by the global neural network model matches one or more of the labels. 8 . The method of claim 1 , wherein the plurality of updated federated model parameters and the plurality of updated local model parameters are generated based on, for each of the plurality of examples, a measure of error between one or more of the candidate labels generated by the global neural network model and one or more of the labels. 9 . The method of claim 1 , wherein one or more of the local model parameters correspond to one or more respective user representations in a user representation matrix. 10 . The method of claim 1 , wherein training the global neural network model and the local personalization model together on the plurality of examples is based at least in part on one or more respective user characteristics associated with the first client system. 11 . The method of claim 10 , wherein the user characteristics associated with the first client system comprise one or more of linguistic fluency, location, age, gender, occupation, income, marital status, number of children, or number of relationships. 12 . The method of claim 1 , wherein one or more of the local model parameters correspond to one or more respective linguistic characteristics associated with the first client system. 13 . The method of claim 12 , wherein the linguistic characteristics associated with the first client system comprise one or more of linguistic formality, punctuation usage, emoticon usage, or grammatical style. 14 . The method of claim 1 , wherein one or more of the local model parameters correspond to a probability that an example stored on the first client system is a linguistic conversation having greater than a threshold number of participants. 15 . The method of claim 1 , wherein the local personalization model further comprises a personalized portion of the global neural network model. 16 . The method of claim 1 , wherein the plurality of examples comprises, for each of a plurality of task categories, a set of examples associated with the task category, and wherein the training of the global neural network model and the local personalization model is repeated for each set of examples. 17 . The method of claim 1 , wherein the updated local model parameters comprise one or more new local model parameters created by the trained global neural network model. 18 . The method of claim 17 , wherein the trained global neural network model sent to the one or more remote servers is configured to modify one or more local personalization models of one or more other client systems, respectively, to include the one or more new local model parameters. 19 . A system comprising: one or more processors; and a memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to: receive, from one or more remote servers, a current version of a global neural network model comprising a plurality of federated model parameters; access, from a local data store, a plurality of examples and a local personalization model comprising a plurality of local model parameters, wherein each of the plurality of examples comprises one or more features and one or more labels; train the global neural network model and the local personalization model together on the plurality of examples to generate a plurality of updated federated model parameters and a plurality of updated local model parameters; store, in the local data store, the trained local personalization model comprising the plurality of updated local model parameters; and send, to one or more of the remote servers, the trained global neural network model comprising the plurality of updated federated model parameters. 20 . One or more computer-readable non-transitory storage media embodying software that is operable when executed to: receive, from one or more remote servers, a current version of a global neural network model comprising a plurality of federated model parameters; access, from a local data store, a plurality of examples and a local personalization model comprising a plurality of local model parameters, wherein each of the plurality of examples comprises one or more features and one or more labels; train the global neural network model and the local personalization model together on the plurality of examples to generate a plurality of updated federated model parameters and a plurality of updated local model parameters; store, in the local data store, the trained local personalization model comprising the plurality of updated local model parameters; and send, to one or more of the remote servers, the trained global neural network model comprising the plurality of updated federated model parameters.

Assignees

Inventors

Classifications

  • G06Q10/40Primary

    Business processes related to social networking or social networking services · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items (segmenting video sequences G06V20/49) · CPC title

  • Facial expression recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021117780A1 cover?
In one embodiment, a method includes receiving, by a first client system, from one or more remote servers, a current version of a global neural network model including multiple federated model parameters, accessing, from a local data store, multiple examples and a local personalization model including multiple local model parameters, wherein each of the examples includes one or more features an…
Who is the assignee on this patent?
Facebook Tech Llc
What technology area does this patent fall under?
Primary CPC classification G06Q10/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 22 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).