Communication efficient machine learning of data across multiple sites

US11699080B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11699080-B2
Application numberUS-201816131150-A
CountryUS
Kind codeB2
Filing dateSep 14, 2018
Priority dateSep 14, 2018
Publication dateJul 11, 2023
Grant dateJul 11, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment, a service receives machine learning-based generative models from a plurality of distributed sites. Each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model. The service receives, from each of the distributed sites, a subset of labeled data observed at that site. The service uses the generative models to generate synthetic unlabeled data. The service trains a global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, at a service, machine learning-based generative models from a plurality of distributed sites, wherein each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model; receiving, at the service and from each of the distributed sites, a subset of labeled data observed at that site; using, by the service, the generative models to generate synthetic unlabeled data; and training, by the service, a global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models. 2. The method as in claim 1 , wherein each generative model generates an amount of synthetic unlabeled data in proportion to the unlabeled data observed at the corresponding site at which the model was trained. 3. The method as in claim 1 , wherein the generative models are trained using labeled data having one or more labels that differ from a label that the global machine learning-based model uses. 4. The method as in claim 1 , further comprising: providing, by the service, the global machine learning-based model to one or more of the distributed sites, wherein the one or more distributed sites use the global machine learning-based model to select which labeled data is to be sent to the service. 5. The method as in claim 1 , further comprising: receiving, at the service, an unlabeled data sample from a particular one of the sites; using, by the service, the global machine learning-based model to apply a label to the received data sample; and providing, by the service, an indication of the label applied to the received data sample to the particular site from which the data sample was received. 6. The method as in claim 1 , wherein the labeled and unlabeled data observed at the sites comprise image data captured by cameras deployed at sites or network telemetry data. 7. The method as in claim 1 , further comprising: requesting, by the service, additional labeled data from one or more of the sites, based on a loss function associated with the machine learning-based model. 8. The method as in claim 1 , wherein the generative models are generative adversarial network (GAN) models. 9. The method as in claim 1 , wherein training the global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models comprises: mapping the received subsets of labeled data to a feature space, wherein the global machine learning-based model is trained using semi-supervised learning within the feature space. 10. The method as in claim 1 , wherein training the global machine learning-based model comprises using an autoencoder on the synthetic data to generate features for the global machine learning-based model. 11. An apparatus, comprising: one or more network interfaces to communicate with a network; a processor coupled to the network interfaces and configured to execute one or more processes; and a memory configured to store a process executable by the processor, the process when executed configured to: receive machine learning-based generative models from a plurality of distributed sites, wherein each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model; receive, from each of the distributed sites, a subset of labeled data observed at that site; use the generative models to generate synthetic unlabeled data; and train a global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models. 12. The apparatus as in claim 11 , wherein each generative model generates an amount of synthetic unlabeled data in proportion to the unlabeled data observed at the corresponding site at which the model was trained. 13. The apparatus as in claim 11 , wherein the distributed sites each comprise a local area network, and wherein the apparatus provides a cloud-based service to the local area networks. 14. The apparatus as in claim 11 , wherein the process when executed is further configured to: provide the global machine learning-based model to one or more of the distributed sites, wherein the one or more distributed sites use the global machine learning-based model to select which labeled data is to be sent to the service. 15. The apparatus as in claim 11 , wherein the process when executed is further configured to: receive an unlabeled data sample from a particular one of the sites; use the global machine learning-based model to apply a label to the received data sample; and provide an indication of the label applied to the received data sample to the particular site from which the data sample was received. 16. The apparatus as in claim 11 , wherein the subset of labeled data observed at the site are selected randomly for sending to the apparatus. 17. The apparatus as in claim 11 , wherein the generative models are trained using labeled data having one or more labels that differ from a label that the global machine learning-based model uses. 18. The apparatus as in claim 11 , wherein the apparatus trains the global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models by: mapping the received subsets of labeled data to a feature space, wherein the global machine learning-based model is trained using semi-supervised learning within the feature space. 19. A tangible, non-transitory, computer-readable medium storing program instructions that cause a service to execute a process comprising: receiving, at the service, machine learning-based generative models from a plurality of distributed sites, wherein each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model; receiving, at the service and from each of the distributed sites, a subset of labeled data observed at that site; using, by the service, the generative models to generate synthetic unlabeled data; and training, by the service, a global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models. 20. The computer-readable medium as in claim 19 , wherein the process further comprises: receiving, at the service, an unlabeled data sample from a particular one of the sites; using, by the service, the global machine learning-based model to apply a label to the received data sample; and providing, by the service, an indication of the label applied to the received data sample to the particular site from which the data sample was received.

Assignees

Inventors

Classifications

  • G06N3/088Primary

    Non-supervised learning, e.g. competitive learning · CPC title

  • of extracted features · CPC title

  • Machine learning · CPC title

  • Arrangements in telecontrol or telemetry systems for selectively calling a substation from a main station, in which substation desired apparatus is selected for applying a control signal thereto or for obtaining measured values therefrom · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11699080B2 cover?
In one embodiment, a service receives machine learning-based generative models from a plurality of distributed sites. Each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model. The service receives, from each of the distributed sites, a subset of labeled dat…
Who is the assignee on this patent?
Cisco Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/088. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 11 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).