Dynamically selecting artificial intelligence models and hardware environments to execute tasks

US2025238333A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025238333-A1
Application numberUS-202418732320-A
CountryUS
Kind codeA1
Filing dateJun 3, 2024
Priority dateJan 22, 2024
Publication dateJul 24, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to systems, non-transitory computer-readable media, and methods for selecting machine-learning models and hardware environments for executing a task. In particular, in one or more embodiments, the disclosed systems select a designated machine-learning model for executing a task based on workload features of the task and task routing metrics for a plurality of machine-learning models. In addition, in one or more embodiments, the disclosed systems select a designated hardware environment for executing the task based on workload features for the task and task routing metrics for a plurality of hardware environments. In some embodiments, the disclosed systems select a fallback machine-learning model and a fallback hardware environment for executing the task if the designated machine-learning model or designated hardware environment are unavailable. Moreover, in one or more embodiments, the disclosed systems can pause and initiate tasks based on bandwidth availability.

First claim

Opening claim text (preview).

1 . A computer-implemented method comprising: receiving, from a device connected by a network to a content management system, workload data requesting execution of a task using a machine-learning model; extracting, from the workload data, workload features defining characteristics of the task; selecting, based on the workload features, a primary machine-learning model for executing the task and a fallback machine-learning model for executing the task if the primary machine-learning model is unavailable; and based on detecting that the primary machine-learning model is unavailable prior to executing the task, providing the workload data to a computing environment of the fallback machine-learning model for executing the task. 2 . The computer-implemented method of claim 1 , wherein extracting workload features defining characteristics of the task further comprises determining an estimated processing requirement and an estimated storage requirement for executing the task. 3 . The computer-implemented method of claim 1 , wherein selecting the primary machine-learning model and the fallback machine-learning model further comprises: generating optimization metrics for each machine-learning model of a plurality of machine-learning models; and selecting the primary machine-learning model and the fallback machine-learning model based on the optimization metrics. 4 . The computer-implemented method of claim 1 , wherein selecting the primary machine-learning model and the fallback machine-learning model further comprises: determining a model state for a plurality of machine-learning models; selecting the primary machine-learning model based in part on a first model state of the primary machine-learning model; and selecting the fallback machine-learning model based in part on a second model state of the fallback machine-learning model. 5 . The computer-implemented method of claim 1 , wherein selecting the primary machine-learning model and the fallback machine-learning model further comprises: determining, based on the workload features, a financial cost metric, an execution time metric, an execution cost metric, or a model fit metric for executing the task on each machine-learning models in a plurality of machine-learning models; and selecting the primary machine-learning model and the fallback machine-learning model is based on two or more of the financial cost metric, the execution time metric, the execution cost metric, or the model fit metric. 6 . The computer-implemented method of claim 1 , further comprising: selecting an additional machine-learning model for executing the task based on the workload features; and based on detecting that the fallback machine-learning model is unavailable, providing the workload data to a computing environment of the additional machine-learning model for executing the task. 7 . The computer-implemented method of claim 1 , further comprising selecting a trained machine-learning model as the primary machine-learning model and a third-party trained machine-learning model as the fallback machine-learning model. 8 . The computer-implemented method of claim 1 , further comprising: detecting that the primary machine-learning model is unavailable based on identifying that a first hardware environment associated with the primary machine-learning model is unavailable to execute the task; and providing the workload data to a second hardware environment associated with the fallback machine-learning model based on identifying that the second hardware environment is available to execute the task. 9 . The computer-implemented method of claim 1 , wherein selecting the primary machine-learning model and the fallback machine-learning model further comprises: performing a software domain analysis of each machine-learning model of a plurality of machine-learning models for executing the task; and selecting the primary machine-learning model and the fallback machine-learning model based on the software domain analysis. 10 . A non-transitory computer-readable medium storing instructions that, when executed by at least one processor, cause a computer system to: receive, from a device connected by a network to a content management system, workload data requesting execution of a task using a machine-learning model; extract, from the workload data, workload features defining characteristics of the task; determine task routing metrics for a plurality of machine-learning models hosted in respective network environments; select, based on the workload features and the task routing metrics, a primary machine-learning model for executing the task and a fallback machine-learning model from the plurality of machine-learning models for executing the task if the primary machine-learning model is unavailable; and based on detecting that the primary machine-learning model is unavailable prior to executing the task, provide the workload data to a computing environment of the fallback machine-learning model for executing the task. 11 . The non-transitory computer-readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to select the primary machine-learning model and the fallback machine-learning model by: determining task routing metrics for the plurality of machine-learning models by determining a financial cost metric, an execution time metric, an execution cost metric, or a model fit metric for executing the task on each machine-learning models of the plurality of machine-learning models; and selecting, from the plurality of machine-learning models, the primary machine-learning model and the fallback machine-learning model based on two or more of the financial cost metric, the execution time metric, the execution cost metric, or the model fit metric for executing the task. 12 . The non-transitory computer-readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to select the primary machine-learning model and the fallback machine-learning model by: generating, based on the workload data and the task routing metrics, an optimization metric for each machine-learning model of the plurality of machine-learning models; and selecting the primary machine-learning model and the fallback machine-learning model from the plurality of machine-learning models based on a first optimization metric for the primary machine-learning model and a second optimization metric for the fallback machine-learning model. 13 . The non-transitory computer-readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to select the primary machine-learning model and the fallback machine-learning model by: accessing historical user feedback data about executing tasks using one or more machine-learning models of the plurality of machine-learning models; determining a historical quality metric for each machine-learning model of the plurality of machine-learning models based on the historical user feedback data; and selecting the primary machine-learning model and the fallback machine-learning model from the plurality of machine-learning models based in part on a first historical quality metric for the primary machine-learning model and a second historical quality metric for the fallback machine-learning model. 14 . The non-transitory computer-readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to select the pr

Assignees

Inventors

Classifications

  • Task life-cycle, e.g. stopping, restarting, resuming execution (G06F9/4881 takes precedence) · CPC title

  • the resources being hardware resources other than CPUs, Servers and Terminals · CPC title

  • Packet rate · CPC title

  • H04L41/16Primary

    using machine learning or artificial intelligence · CPC title

  • Offload · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025238333A1 cover?
The present disclosure relates to systems, non-transitory computer-readable media, and methods for selecting machine-learning models and hardware environments for executing a task. In particular, in one or more embodiments, the disclosed systems select a designated machine-learning model for executing a task based on workload features of the task and task routing metrics for a plurality of mach…
Who is the assignee on this patent?
Dropbox Inc
What technology area does this patent fall under?
Primary CPC classification H04L41/16. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Jul 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).