Updating machine learning models across devices

US12094451B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-12094451-B1
Application numberUS-202217677614-A
CountryUS
Kind codeB1
Filing dateFeb 22, 2022
Priority dateFeb 22, 2022
Publication dateSep 17, 2024
Grant dateSep 17, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system may create a localized machine learning model including one or more customized local parameter values using a global model and variance data. The localized machine learning model may be used by a device or cohort of devices to perform evaluations of data. The localized model may be trained based off a global model that is adjusted and then trained a certain number of steps, where the number of steps is based at least in part on the variance data. The variance data may include variance data from other device cohorts which is received from a remote device, which can also re-train the global model using the variance data and/or the localized machine learning model(s).

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for updating machine learning models, the method comprising: determining first model data corresponding to a first machine learning model corresponding to a first function, the first model data comprising a first value corresponding to a first parameter; sending the first model data to a first device having a first characteristic; sending the first model data to a second device having a second characteristic different from the first characteristic; receiving first variance data corresponding to the first parameter, the first variance data based at least in part on a first range of values corresponding to a first plurality of devices having the first characteristic, wherein the first plurality of devices comprises the first device; receiving second variance data corresponding to the first parameter, the second variance data based at least in part on a second range of values corresponding to a second plurality of devices having the second characteristic; based at least in part on the first variance data and the second variance data, determining third variance data corresponding to the first parameter, the third variance data corresponding to at least the first plurality of devices and the second plurality of devices; sending the third variance data to the first device; sending the third variance data to the second device; based at least in part on the first variance data, the second variance data, and the first model data, determining second model data corresponding to a second machine learning model corresponding to the first function, the second model data comprising a second value corresponding to the first parameter; sending the second model data to the first device, wherein the first device is configured to perform processing of further input data using the second machine learning model; and sending the second model data to the second device. 2. The computer-implemented method of claim 1 , further comprising, by the first device: receiving first input data; processing the first input data using the first machine learning model to determine first output data; receiving second input data; processing the second input data using the first machine learning model to determine second output data; based at least in part on the first output data and the second output data, determining a first mean value corresponding to the first parameter; based at least in part on the first output data, the second output data, and the first mean value, determining standard deviation data corresponding to the first parameter; and determining the first variance data using the standard deviation data. 3. The computer-implemented method of claim 1 , further comprising: after sending the third variance data to the first device, receiving third model data corresponding to a third machine learning model corresponding to the first function, the third machine learning model trained for operation by devices having the first characteristic; and after sending the third variance data to the second device, receiving fourth model data corresponding to a fourth machine learning model corresponding to the first function, the fourth machine learning model trained for operation by devices having the second characteristic, wherein determining the second model data is further based at least in part on the third model data and the fourth model data. 4. The computer-implemented method of claim 3 , further comprising, by the first device: processing the first variance data and the second variance data to determine at least one value; applying the at least one value to the first model data to determine adjusted model data; processing the first variance data and the second variance data to determine a first number of training steps; and performing machine learning training by modifying the adjusted model data over the first number of training steps to determine the third model data. 5. A computer-implemented method for updating machine learning models, the method comprising: receiving first variance data corresponding to a first parameter of a first machine learning model, the first variance data corresponding to a first plurality of devices having a first characteristic, wherein the first plurality of devices comprises a first device; receiving second variance data corresponding to the first parameter, the second variance data corresponding to a second plurality of devices having a second characteristic; based at least in part on the first variance data and the second variance data, determining third variance data corresponding to the first parameter, the third variance data corresponding to at least the first plurality of devices and the second plurality of devices; sending the third variance data to at least one device corresponding to the first characteristic; sending the third variance data to a second device corresponding to the second characteristic; based at least in part on the first variance data, the second variance data, and the first machine learning model, determining an updated machine learning model; and sending updated model data representing the updated machine learning model to the first device and the second device, wherein the first device is configured to perform processing of further input data using the updated machine learning model. 6. The computer-implemented method of claim 5 , further comprising: after sending the third variance data to the first device, receiving first model data corresponding to a third machine learning model trained for operation by devices having the first characteristic; and after sending the third variance data to the second device, receiving second model data corresponding to a fourth machine learning model trained for operation by devices having the second characteristic, wherein determining the updated machine learning model is further based at least in part on the first model data and the second model data. 7. The computer-implemented method of claim 6 , further comprising, by the first device: processing the first variance data and the second variance data to determine at least one value; determining adjusted model data using the at least one value and the first machine learning model; and determining the third machine learning model from the adjusted model data. 8. The computer-implemented method of claim 6 , further comprising, by the first device: processing the first variance data and the second variance data to determine a first number of training steps; and training the third machine learning model using the first number of training steps. 9. The computer-implemented method of claim 5 , further comprising, by the first device: receiving first input data; processing the first input data using the first machine learning model to determine first output data; receiving second input data; processing the second input data using the first machine learning model to determine second output data; based at least in part on the first output data and the second output data, determining a first mean value corresponding to the first parameter; based at least in part on the first output data, the second output data, and the first mean value, determining standard deviation data corresponding to the first parameter; and determining the first variance data using the standard deviation data. 10. The computer-implemented method of claim 5 , further comprising: determining an estimated value for the first parameter corresponding to operation of the first machine learning model by at least a third device corresponding to a third characteristic; and determining the third variance data based at least in

Assignees

Inventors

Classifications

  • Architecture of speech synthesisers · CPC title

  • Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • for estimating an emotional state · CPC title

  • Training · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12094451B1 cover?
A system may create a localized machine learning model including one or more customized local parameter values using a global model and variance data. The localized machine learning model may be used by a device or cohort of devices to perform evaluations of data. The localized model may be trained based off a global model that is adjusted and then trained a certain number of steps, where the n…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06N20/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 17 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).