Inference model selection method considering task request rate

US2025094840A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025094840-A1
Application numberUS-202418443445-A
CountryUS
Kind codeA1
Filing dateFeb 16, 2024
Priority dateSep 14, 2023
Publication dateMar 20, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is an inference model selection method and apparatus considering a task request rate. The inference model selection method is performed by a computing device including a processor and includes monitoring computing resources of the computing device; receiving a task; selecting an inference model to perform inference for the received task; inputting the task into a queue of the selected inference model; and performing an inference operation.

First claim

Opening claim text (preview).

What is claimed is: 1 . A n inference model selection method performed by a computing device comprising a processor, the interference model selection method comprising: monitoring computing resources of the computing device; receiving a task; selecting an inference model to perform inference for the received task; inputting the task into a queue of the selected inference model; and performing an inference operation. 2 . The inference model selection method of claim 1 , wherein the monitoring is periodically performed at predetermined periods. 3 . The inference model selection method of claim 2 , wherein the selecting of the inference model comprises selecting an appropriate inference model for tasks received over the predetermined period, and the inference model includes at least one Single-task Learning (STL) model and at least one Multi-task Learning (MTL) model. 4 . The inference model selection method of claim 3 , wherein the selecting of the inference model comprises solving a Q-function derived as a result of transforming a Markov Decision Process (MDP) model to Q-Learning, and the Q-function is represented as Q,(S,A)=max π [Q π (S,A)], where S denotes a state space, A denotes an action space, and π denotes a policy. 5 . The inference model selection method of claim 4 , wherein the performing of the inference operation comprises performing the inference operation by inputting a task input to a queue to the STL model without a waiting time if the STL model is selected. 6 . The inference model selection method of claim 5 , wherein the performing of the inference operation comprises performing the inference operation by, when the queue is full of tasks, inputting the task input to the queue to the MTL model if the MTL model is selected. 7 . The inference model selection method of claim 5 , wherein the performing of the inference operation comprises performing the inference operation by, when a sum of a time elapsed from a point in time at which at least one task stored in the queue is received and a computing time required for inference is equal to a value acquired by subtracting a predetermined period of time from a required delay time, inputting all the tasks stored in the queue to the MTL model, if the MTL model is selected. 8 . The inference model selection method of claim 5 , wherein the performing of the inference operation comprises performing the inference operation by, when a sum of a waiting time of a task in the queue and a computing time required for inference is equal to a value acquired by subtracting a predetermined period of time from a required delay time, inputting all the tasks stored in the queue to the MTL model, if the MTL model is selected. 9 . The inference model selection method of claim 8 , further comprising: transmitting an inference result.

Assignees

Inventors

Classifications

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Reinforcement learning · CPC title

  • G06N5/04Primary

    Inference or reasoning models · CPC title

  • based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025094840A1 cover?
Disclosed is an inference model selection method and apparatus considering a task request rate. The inference model selection method is performed by a computing device including a processor and includes monitoring computing resources of the computing device; receiving a task; selecting an inference model to perform inference for the received task; inputting the task into a queue of the selected…
Who is the assignee on this patent?
Univ Korea Res & Bus Found
What technology area does this patent fall under?
Primary CPC classification G06N5/04. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).