Implementing scalable storage of personalized machine learning models

US2026037784A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026037784-A1
Application numberUS-202418792519-A
CountryUS
Kind codeA1
Filing dateAug 1, 2024
Priority dateAug 1, 2024
Publication dateFeb 5, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure describes techniques for implementing scalable storage of personalized machine learning models. A plurality of personalized machine learning models are generated based on finetuning a base machine learning model. The base machine learning model comprises a first set of layers. Each of the plurality of personalized machine learning models comprises a second set of layers. A plurality of difference models are generated by computing differences between the first set of layers and the second set of layers. The plurality of difference models corresponds to the plurality of personalized machine learning models, respectively. The plurality of difference models are processed by compressing parameters of each of the plurality of difference models to generate a plurality of compressed models. The plurality of compressed models are stored for future use of the plurality of personalized machine learning models.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of implementing scalable storage of personalized machine learning models, comprising: generating a plurality of personalized machine learning models based on finetuning a base machine learning model, wherein each of the plurality of personalized machine learning models corresponds to a particular user from a plurality of users, wherein the base machine learning model comprises a first set of layers, and wherein each of the plurality of personalized machine learning models comprises a second set of layers; generating a plurality of difference models by computing differences between the first set of layers and the second set of layers, wherein the plurality of difference models correspond to the plurality of personalized machine learning models, respectively; processing the plurality of difference models by compressing parameters of each of the plurality of difference models to generate a plurality of compressed models; and storing the plurality of compressed models for future use of the plurality of personalized machine learning models, wherein the plurality of compressed models minimize storage costs without affecting performance quality of the plurality of personalized machine learning models. 2 . The method of claim 1 , further comprising: generating each of the plurality of personalized machine learning models by finetuning the base machine learning model based on at least one image received from the particular user. 3 . The method of claim 1 , further comprising: generating the plurality of difference models by computing differences between matrices of the first set of layers and matrices of the second set of layers. 4 . The method of claim 1 , wherein the processing the plurality of difference models further comprises: determining whether a difference between a certain layer of the first set of layers and the second set of layers in each of the plurality of personalized machine learning models is less than a threshold; and dropping out the certain layer from one of the plurality of difference models corresponding to each of the plurality of personalized machine learning models in response to determining that the difference is less than the threshold. 5 . The method of claim 1 , wherein the compressing parameters of each of the plurality of difference models further comprises: decomposing the parameters of each of the plurality of difference models into low-rank matrices, wherein the parameters of each of the plurality of difference models comprise large high-rank matrices. 6 . The method of claim 5 , further comprising: decomposing each of the large high-rank matrices into two low-rank matrices using a singular value decomposition (SVD) algorithm. 7 . The method of claim 6 , further comprising: storing the two low-rank matrices for each layer of each of the plurality of difference models. 8 . The method of claim 1 , further comprising: recovering one of the plurality of personalized machine learning models by implementing a revered process on one of the plurality of compressed models, wherein the one of the plurality of compressed models corresponds the one of the plurality of personalized machine learning models. 9 . The method of claim 8 , further comprising: computing each of large high-rank matrices based on corresponding low-rank matrices stored for the one of the plurality of compressed models; and recovering the one of the plurality of personalized machine learning model by adding the large high-rank matrices back to the base machine learning model. 10 . A system for implementing scalable storage of personalized machine learning models, comprising: at least one processor; and at least one memory communicatively coupled to the at least one processor and comprising computer-readable instructions that upon execution by the at least one processor cause the at least one processor to perform operations comprising: generating a plurality of personalized machine learning models based on finetuning a base machine learning model, wherein each of the plurality of personalized machine learning models corresponds to a particular user from a plurality of users, wherein the base machine learning model comprises a first set of layers, and wherein each of the plurality of personalized machine learning models comprises a second set of layers; generating a plurality of difference models by computing differences between the first set of layers and the second set of layers, wherein the plurality of difference models correspond to the plurality of personalized machine learning models, respectively; processing the plurality of difference models by compressing parameters of each of the plurality of difference models to generate a plurality of compressed models; and storing the plurality of compressed models for future use of the plurality of personalized machine learning models, wherein the plurality of compressed models minimize storage costs without affecting performance quality of the plurality of personalized machine learning models. 11 . The system of claim 10 , the operations further comprising: generating each of the plurality of personalized machine learning models by finetuning the base machine learning model based on at least one image received from the particular user. 12 . The system of claim 10 , the operations further comprising: generating the plurality of difference models by computing differences between matrices of the first set of layers and matrices of the second set of layers. 13 . The system of claim 10 , wherein the processing the plurality of difference models further comprises: determining whether a difference between a certain layer of the first set of layers and the second set of layers in each of the plurality of personalized machine learning models is less than a threshold; and dropping out the certain layer from one of the plurality of difference models corresponding to each of the plurality of personalized machine learning models in response to determining that the difference is less than the threshold. 14 . The system of claim 10 , wherein the compressing parameters of each of the plurality of difference models further comprises: decomposing the parameters of each of the plurality of difference models into low-rank matrices, wherein the parameters of each of the plurality of difference models comprise large high-rank matrices. 15 . The system of claim 14 , the operations further comprising: decomposing each of the large high-rank matrices into two low-rank matrices using a singular value decomposition (SVD) algorithm; and storing the two low-rank matrices for each layer of each of the plurality of difference models. 16 . A non-transitory computer-readable storage medium, storing computer-readable instructions that upon execution by a processor cause the processor to implement operations comprising: generating a plurality of personalized machine learning models based on finetuning a base machine learning model, wherein each of the plurality of personalized machine learning models corresponds to a particular user from a plurality of users, wherein the base machine learning model comprises a first set of layers, and wherein each of the plurality of personalized machine learning models comprises a second set of layers; generating a plurality of difference models by computing differences between the first set of layers and the second set of layers, wherein the plurality of difference models correspond to the plurality of personalized machine learning models, respectively; proc

Assignees

Inventors

Classifications

  • G06N3/045Primary

    Combinations of networks · CPC title

  • G06N3/0495Primary

    Quantised networks; Sparse networks; Compressed networks · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Transfer learning · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026037784A1 cover?
The present disclosure describes techniques for implementing scalable storage of personalized machine learning models. A plurality of personalized machine learning models are generated based on finetuning a base machine learning model. The base machine learning model comprises a first set of layers. Each of the plurality of personalized machine learning models comprises a second set of layers. …
Who is the assignee on this patent?
Lemon Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).