Edge network computing system with deep reinforcement learning based task scheduling

US12223336B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12223336-B2
Application numberUS-202117490861-A
CountryUS
Kind codeB2
Filing dateSep 30, 2021
Priority dateSep 30, 2021
Publication dateFeb 11, 2025
Grant dateFeb 11, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An edge network computing system includes: a plurality of terminal devices; a plurality of edge servers connected to the terminal device through an access network; and a plurality of cloud servers connected to the plurality of edge servers through a core network. Each edge server is configured to: receive a plurality of computing tasks originated from one of the plurality of terminal devices; use a deep Q-learning neural network (DQN) with experience replay to select one of the plurality of could servers to offload a portion of the plurality of computing tasks; and send the portion of the plurality of computing tasks to the selected cloud server and forward results of the portion of the plurality of computing tasks received from the selected cloud server to the originating terminal device.

First claim

Opening claim text (preview).

What is claimed is: 1. An edge network computing system, comprising: a plurality of terminal devices; a plurality of edge servers comprising at least one processor connected to the plurality of terminal devices through an access network; and a plurality of cloud servers connected to the plurality of edge servers through a core network, wherein each edge server is configured to: receive a plurality of computing tasks originated from one of the plurality of terminal devices; use a deep Q-learning neural network (DQN) with experience replay to select one of the plurality of cloud servers to offload a portion of the plurality of computing tasks, such that each edge server selects an offloading cloud server without being correlated to selection of the offloading cloud server by other edge servers; send the portion of the plurality of computing tasks to the selected cloud server; and execute the portion of the plurality of computing tasks and forward results of the portion of the plurality of computing tasks received from the selected cloud server to the originating terminal device; wherein: a first time is a time for completing a remaining portion of the plurality of computing tasks performed by one of the plurality of edge servers; a second time is a time for offloading and completing the portion of the plurality of computing tasks offloaded to the selected cloud server, and includes an uplink access latency, a data transmission time for transmitting the portion of the plurality of computing tasks from one of the plurality of edge servers to the selected cloud server, a computing time for completing the portion of the plurality of computing tasks by the selected cloud server, and a downlink access latency; when the plurality of computing tasks are independent of each other, a total time for completing the plurality of computing tasks is the greater of the first time and the second time, and an optimal task partitioning is achieved when the first time is equal to the second time; when the plurality of computing tasks are required to be performed sequentially, the total time for completing the plurality of computing tasks includes the greater of the first time and a sum of the uplink access latency and the data transmission time, the computing time, and the downlink access latency, and the optimal task partitioning is achieved when the first time is equal to the sum of the uplink access latency and the data transmission time; each edge server includes an agent of multi-agent reinforcement learning in a Markov decision process (MDP), and the agent learns from an environment, and takes action to maximize a long-term accumulative reward; a state space includes the uplink access latency, the downlink access latency, and a data rate of a backhaul link between the edge server and the cloud server, observed at a current time slot; an action space includes the plurality of cloud servers as candidates for task offloading, and a task partitioning ratio having a number of discrete values; and a reward includes minus the total time for completing the plurality of computing tasks. 2. The system according to claim 1 , wherein when using the deep Q-learning neural network (DQN) with experience replay to select the cloud server to offload the portion of the plurality of computing tasks to the cloud server, the edge server is configured to: each time after receiving a plurality of computing tasks, use the DQN to determine a cloud server and a task partition ratio to offload some of the plurality of computing tasks to the cloud server; store each experience including a current state, a current action, a current reward, and a next state in a replay memory; randomly select a minibatch of past experiences from the replay memory to input to the DQN and a target network; optimize the DQN with updated weights based on Q-values from the DQN and Q-values from the target network; and periodically update weights of the target network with the weights of the DQN. 3. The system according to claim 2 , wherein: the DQN includes an input layer of neurons representing the state space and the action space, and an output layer of Q-values for the state space and the action space at the input layer; training the DQN includes mapping inputs of state-action pairs in the minibatch of past experiences to their corresponding Q-values by approximating Q-values and minimizing a loss function of mean square error between the Q-values outputted by the DON and the Q-values outputted by the target network; and the target network includes a structure same as the DQN and weights periodically updated by the weights of the DQN obtained from training. 4. The system according to claim 2 , wherein: each experience stored in the replay memory includes an importance weight to naturally decay obsolete data. 5. The system according to claim 1 , wherein: the edge network computing system is a mobile edge computing system; the plurality of terminal devices are mobile devices; and the plurality of mobile devices are connected to the plurality of edge servers through a radio access network. 6. A task offloading method for an edge server comprising at least one processor in an edge network computing system, comprising: receiving a plurality of computing tasks from a terminal device; using a deep Q-learning neural network (DQN) with experience replay to select a cloud server to offload a portion of the plurality of computing tasks; sending the portion of the plurality of computing tasks to the cloud server; and executing the portion of the plurality of computing tasks and forwarding results of the portion of the plurality of computing tasks received from the cloud server to the terminal device; wherein: a first time is a time for completing a remaining portion of the plurality of computing tasks performed by one of the plurality of edge servers; a second time is a time for offloading and completing the portion of the plurality of computing tasks offloaded to the selected cloud server, and includes an uplink access latency, a data transmission time for transmitting the portion of the plurality of computing tasks from one of the plurality of edge servers to the selected cloud server, a computing time for completing the portion of the plurality of computing tasks by the selected cloud server, and a downlink access latency; when the plurality of computing tasks are independent of each other, a total time for completing the plurality of computing tasks is the greater of the first time and the second time, and an optimal task partitioning is achieved when the first time is equal to the second time; when the plurality of computing tasks are required to be performed sequentially, the total time for completing the plurality of computing tasks includes the greater of the first time and a sum of the uplink access latency and the data transmission time, the computing time, and the downlink access latency, and the optimal task partitioning is achieved when the first time is equal to the sum of the uplink access latency and the data transmission time; each edge server includes an agent of multi-agent reinforcement learning in a Markov decision process (MDP), and the agent learns from an environment, and takes action to maximize a long-term accumulative reward; a state space includes the uplink access latency, the downlink access latency, and a data rate of a backhaul link between the edge server and the cloud server, observed at a current time slot; an action space includes the plurality of cloud servers as candidates for task offloading, and a task partitioning ratio having a number of discrete values; and a reward includes minus the total time for completing the plurality of computing tasks. 7. The meth

Assignees

Inventors

Classifications

  • Task transfer initiation or dispatching · CPC title

  • Offload · CPC title

  • Combinations of networks · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Reinforcement learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12223336B2 cover?
An edge network computing system includes: a plurality of terminal devices; a plurality of edge servers connected to the terminal device through an access network; and a plurality of cloud servers connected to the plurality of edge servers through a core network. Each edge server is configured to: receive a plurality of computing tasks originated from one of the plurality of terminal devices; u…
Who is the assignee on this patent?
Intelligent Fusion Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06F9/44594. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 11 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).