Layerwise Multi-Objective Neural Architecture Search for Optimization of Machine-Learned Models

US2025238683A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025238683-A1
Application numberUS-202418596110-A
CountryUS
Kind codeA1
Filing dateMar 5, 2024
Priority dateJan 23, 2024
Publication dateJul 24, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A search option can be selected for each model layer N of a plurality of model layers M. based on the model layer N, the one or more search options can be used to construct one or more candidate model layers for a model layer N+1 of the plurality of model layers. The one or more candidate model layers are respectively associated with one or more cost metrics. A cost metric is indicative of a cost associated with inclusion of the candidate model layer in an optimized machine-learned model. An optimized machine-learned model comprising M model layers can be constructed based on a cost function. The cost function maximizes an accuracy of the optimized machine-learned model subject to a sum of the cost metrics associated with each candidate model layer included in the optimized machine-learned model being less than a maximum cost.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computing system for layer-wise neural architecture search with polynomial complexity to combinatorically construct an optimized machine-learned model, comprising: one or more processors; one or more non-transitory computer-readable media that store instructions that, when executed by the one or more processors, cause the participant computing device to perform operations, the operations comprising: iteratively constructing, for each model layer of a plurality of model layers, one or more candidate model layers; determining a cost metric for each of the candidate model layers, wherein a cost metric is indicative of a cost associated with inclusion of the candidate model layer in an optimized machine-learned model; for each model layer, grouping the one or more respective candidate model layers into one or more candidate layer clusters, wherein each candidate layer cluster is associated with a range of cost metrics; filtering at least one candidate model layer based the cost metric associated with the candidate model layer being greater than a threshold cost; and constructing an optimized machine-learned model comprising a candidate model layer for each of the plurality of layers based on a cost function, wherein the cost function maximizes a performance metric of the optimized machine-learned model subject to a sum of the cost metrics associated with each candidate model layer included in the optimized machine-learned model being less than a maximum cost. 2 . The computing system of claim 1 , wherein the cost associated with selection of the candidate layer comprises one or more constraints, comprising: a size of the candidate layer; a degree of energy consumption associated with the candidate layer; or an inference latency associated with the candidate layer. 3 . The computing system of claim 1 , wherein the operations further comprise: determining, by the computing system, that the performance metric for the optimized machine-learned model is less than a threshold degree of performance; and constructing, by the computing system, a second optimized machine-learned model comprising a candidate model layer for each of the plurality of layers based on the cost function,, wherein the second cost function maximizes the performance metric of the optimized machine-learned model subject to a sum of the cost metrics associated with each candidate model layer included in the optimized machine-learned model being less than a second maximum cost greater than the maximum cost. 4 . A computer-implemented method to implement layerwise optimization of machine-learned models, the method comprising: for each model layer N of a plurality of model layers M: selecting, by a computing system comprising one or more computing devices, one or more layer search options from a plurality of layer search options; based on the model layer N, using, by the computing system, the one or more search options to construct one or more candidate model layers for a model layer N+1 of the plurality of model layers, wherein the one or more candidate model layers are respectively associated with one or more cost metrics, wherein a cost metric is indicative of a cost associated with inclusion of the candidate model layer in an optimized machine-learned model; and constructing, by the computing system, an optimized machine-learned model comprising M model layers based on a cost function, wherein the cost function maximizes a performance metric of the optimized machine-learned model subject to a sum of the cost metrics associated with each candidate model layer included in the optimized machine-learned model being less than a maximum cost. 5 . The computer-implemented method of claim 4 , wherein the cost associated with selection of the candidate layer comprises one or more constraints, comprising: a size of the candidate layer; a degree of energy consumption associated with the candidate layer; or an inference latency associated with the candidate layer. 6 . The computer-implemented method of claim 4 , wherein the one or more candidate layers comprises a plurality of candidate layers respectively associated with a plurality of cost metrics, wherein each of the plurality of cost metrics is different, and wherein each candidate layer represents an optimal layer for a respectively associated cost metric; and wherein using the one or more search options to identify one or more candidate layers further comprises: grouping, by the computing system, the plurality of candidate layers into a plurality of candidate layer clusters, wherein each candidate layer cluster is associated with a range of cost metrics. 7 . The computer-implemented method of claim 4 , wherein constructing the optimized machine-learned model comprises, for each layer of the optimized machine-learned model: determining, by the computing system, a candidate layer cluster of the plurality of candidate layer clusters for the layer based on the cost function and the range of cost metrics associated with the candidate layer cluster; and selecting, by the computing system, a candidate layer from the candidate layer cluster based on the cost function and the cost metrics associated with one or more layers selected prior to the candidate layer. 8 . The computer-implemented method of claim 4 , wherein grouping the plurality of candidate layers into the plurality of candidate layer clusters further comprises storing, by the computing system, layer selection information indicative of the plurality of candidate layer clusters and each of the plurality of candidate layers; and wherein determining the candidate layer cluster of the plurality of candidate layer clusters comprises determining, by the computing system, the candidate layer cluster of the plurality of candidate layer clusters for the layer based on the cost function and layer selection information. 9 . The computer-implemented method of claim 4 , wherein constructing the optimized machine-learned model based on the cost function comprises, for a model layer N of the plurality of model layers M: determining, by the computing system for a candidate model layer for the model layer N, that the cost metrics associated with each candidate model layer constructed for the model layer N+1 based on the candidate model layer are greater than a maximum cost; and filtering, by the computing system, the candidate model layer from inclusion in the optimized machine-learned model. 10 . The computer-implemented method of claim 4 , wherein the method further comprises: determining, by the computing system, that the performance metric for the optimized machine-learned model is less than a threshold degree of performance; and constructing, by the computing system, a second optimized machine-learned model comprising M model layers based on a second cost function, wherein the second cost function maximizes an accuracy of the optimized machine-learned model subject to a sum of the cost metrics associated with each candidate model layer included in the optimized machine-learned model being less than a second maximum cost greater than the maximum cost. 11 . The computer-implemented method of claim 4 , wherein, prior to selecting the one or more search options from a plurality of layer search options, the method comprises receiving, by the computing system, an optimization request indicative of a quantity of layers M and the maximum cost. 12 . One or more non-transitory computer-readable media that store instructions that, when executed by the one or more processors, cause the participant computing device to perform operations, the operations comprisin

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06N3/045Primary

    Combinations of networks · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • G06N3/0985Primary

    Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

  • Architecture, e.g. interconnection topology · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025238683A1 cover?
A search option can be selected for each model layer N of a plurality of model layers M. based on the model layer N, the one or more search options can be used to construct one or more candidate model layers for a model layer N+1 of the plurality of model layers. The one or more candidate model layers are respectively associated with one or more cost metrics. A cost metric is indicative of a co…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).