What technology area does this patent fall under?

Primary CPC classification G06N20/20. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 17 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Updating machine learning models across devices

US12094451B1 · US · B1

Patent metadata
Field	Value
Publication number	US-12094451-B1
Application number	US-202217677614-A
Country	US
Kind code	B1
Filing date	Feb 22, 2022
Priority date	Feb 22, 2022
Publication date	Sep 17, 2024
Grant date	Sep 17, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system may create a localized machine learning model including one or more customized local parameter values using a global model and variance data. The localized machine learning model may be used by a device or cohort of devices to perform evaluations of data. The localized model may be trained based off a global model that is adjusted and then trained a certain number of steps, where the number of steps is based at least in part on the variance data. The variance data may include variance data from other device cohorts which is received from a remote device, which can also re-train the global model using the variance data and/or the localized machine learning model(s).

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for updating machine learning models, the method comprising: determining first model data corresponding to a first machine learning model corresponding to a first function, the first model data comprising a first value corresponding to a first parameter; sending the first model data to a first device having a first characteristic; sending the first model data to a second device having a second characteristic different from the first characteristic; receiving first variance data corresponding to the first parameter, the first variance data based at least in part on a first range of values corresponding to a first plurality of devices having the first characteristic, wherein the first plurality of devices comprises the first device; receiving second variance data corresponding to the first parameter, the second variance data based at least in part on a second range of values corresponding to a second plurality of devices having the second characteristic; based at least in part on the first variance data and the second variance data, determining third variance data corresponding to the first parameter, the third variance data corresponding to at least the first plurality of devices and the second plurality of devices; sending the third variance data to the first device; sending the third variance data to the second device; based at least in part on the first variance data, the second variance data, and the first model data, determining second model data corresponding to a second machine learning model corresponding to the first function, the second model data comprising a second value corresponding to the first parameter; sending the second model data to the first device, wherein the first device is configured to perform processing of further input data using the second machine learning model; and sending the second model data to the second device. 2. The computer-implemented method of claim 1 , further comprising, by the first device: receiving first input data; processing the first input data using the first machine learning model to determine first output data; receiving second input data; processing the second input data using the first machine learning model to determine second output data; based at least in part on the first output data and the second output data, determining a first mean value corresponding to the first parameter; based at least in part on the first output data, the second output data, and the first mean value, determining standard deviation data corresponding to the first parameter; and determining the first variance data using the standard deviation data. 3. The computer-implemented method of claim 1 , further comprising: after sending the third variance data to the first device, receiving third model data corresponding to a third machine learning model corresponding to the first function, the third machine learning model trained for operation by devices having the first characteristic; and after sending the third variance data to the second device, receiving fourth model data corresponding to a fourth machine learning model corresponding to the first function, the fourth machine learning model trained for operation by devices having the second characteristic, wherein determining the second model data is further based at least in part on the third model data and the fourth model data. 4. The computer-implemented method of claim 3 , further comprising, by the first device: processing the first variance data and the second variance data to determine at least one value; applying the at least one value to the first model data to determine adjusted model data; processing the first variance data and the second variance data to determine a first number of training steps; and performing machine learning training by modifying the adjusted model data over the first number of training steps to determine the third model data. 5. A computer-implemented method for updating machine learning models, the method comprising: receiving first variance data corresponding to a first parameter of a first machine learning model, the first variance data corresponding to a first plurality of devices having a first characteristic, wherein the first plurality of devices comprises a first device; receiving second variance data corresponding to the first parameter, the second variance data corresponding to a second plurality of devices having a second characteristic; based at least in part on the first variance data and the second variance data, determining third variance data corresponding to the first parameter, the third variance data corresponding to at least the first plurality of devices and the second plurality of devices; sending the third variance data to at least one device corresponding to the first characteristic; sending the third variance data to a second device corresponding to the second characteristic; based at least in part on the first variance data, the second variance data, and the first machine learning model, determining an updated machine learning model; and sending updated model data representing the updated machine learning model to the first device and the second device, wherein the first device is configured to perform processing of further input data using the updated machine learning model. 6. The computer-implemented method of claim 5 , further comprising: after sending the third variance data to the first device, receiving first model data corresponding to a third machine learning model trained for operation by devices having the first characteristic; and after sending the third variance data to the second device, receiving second model data corresponding to a fourth machine learning model trained for operation by devices having the second characteristic, wherein determining the updated machine learning model is further based at least in part on the first model data and the second model data. 7. The computer-implemented method of claim 6 , further comprising, by the first device: processing the first variance data and the second variance data to determine at least one value; determining adjusted model data using the at least one value and the first machine learning model; and determining the third machine learning model from the adjusted model data. 8. The computer-implemented method of claim 6 , further comprising, by the first device: processing the first variance data and the second variance data to determine a first number of training steps; and training the third machine learning model using the first number of training steps. 9. The computer-implemented method of claim 5 , further comprising, by the first device: receiving first input data; processing the first input data using the first machine learning model to determine first output data; receiving second input data; processing the second input data using the first machine learning model to determine second output data; based at least in part on the first output data and the second output data, determining a first mean value corresponding to the first parameter; based at least in part on the first output data, the second output data, and the first mean value, determining standard deviation data corresponding to the first parameter; and determining the first variance data using the standard deviation data. 10. The computer-implemented method of claim 5 , further comprising: determining an estimated value for the first parameter corresponding to operation of the first machine learning model by at least a third device corresponding to a third characteristic; and determining the third variance data based at least in

Assignees

Amazon Tech Inc

Inventors

Classifications

G10L13/047
Architecture of speech synthesisers · CPC title
G10L25/78
Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title
G10L25/63
for estimating an emotional state · CPC title
G10L15/063
Training · CPC title
G06N20/00
Machine learning · CPC title

Patent family

Related publications grouped by family.

View patent family 92716311

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12094451B1 cover?: A system may create a localized machine learning model including one or more customized local parameter values using a global model and variance data. The localized machine learning model may be used by a device or cohort of devices to perform evaluations of data. The localized model may be trained based off a global model that is adjusted and then trained a certain number of steps, where the n…
Who is the assignee on this patent?: Amazon Tech Inc
What technology area does this patent fall under?: Primary CPC classification G06N20/20. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 17 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Deep multi-channel acoustic modeling

Machine Learning in Edge Analytics

Systems and Methods of Distributed Optimization

Personalized Machine Learning System

Learning model selection in a distributed network

Frequently asked questions