Dynamic split computing framework in serverless edge computing

US12155535B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12155535-B2
Application numberUS-202318110083-A
CountryUS
Kind codeB2
Filing dateFeb 15, 2023
Priority dateOct 17, 2022
Publication dateNov 26, 2024
Grant dateNov 26, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is a split computing device operating in a serverless edge computing environment. The split computing device includes a transceiver configured to receive resource information of a terminal from the terminal and to measure a data transmission rate between the terminal and the split computing device in a process of receiving the resource information of the terminal; and a splitting point deriver configured to determine a splitting point of a deep neural network (DNN) model for split computing and an activation status of a container instance for each of tail models of a DNN corresponding to the respective splitting points using the resource information of the terminal, the data transmission rate, and resource information of the split computing device.

First claim

Opening claim text (preview).

What is claimed is: 1. A split computing device operating in a serverless edge computing environment, the split computing device comprising: a transceiver configured to receive resource information of a terminal from the terminal and to measure a data transmission rate between the terminal and the split computing device in a process of receiving the resource information from the terminal; a splitting point deriver configured to determine a splitting point of a deep neural network (DNN) model and an activation status of a container instance for each of tail models of the DNN for split computing using the resource information of the terminal, the data transmission rate, and resource information of the split computing device, wherein the transceiver is configured to receive, from the terminal, intermediate data that is inference results for a head model of the DNN model for the determined splitting point; and an inference unit configured to derive final results of the DNN model by performing inference on the intermediate data to derive inference results for a tail model of the DNN model for the determined splitting point, wherein the inference results derived by the inference unit are transmitted to the terminal via the transceiver. 2. The split computing device of claim 1 , wherein the splitting point of the DNN model determined by the splitting point deriver is transmitted to the terminal via the transceiver. 3. The split computing device of claim 1 , wherein the received resource information of the terminal and the intermediate data are stored in a storage. 4. The split computing device of claim 1 , wherein the resource information of the terminal includes available computing power of the terminal and the resource information of the split computing device includes available computing power of the split computing device, and wherein the splitting point deriver is configured to determine the splitting point capable of minimizing an inference latency while maintaining a computing power of the terminal below a first threshold and a computing power of the split computing device below a second threshold. 5. The split computing device of claim 4 , wherein the inference latency is a sum of an inference latency of a head model for each splitting point, a transmission latency due to transmission of intermediate data corresponding to results of the head model, and an inference latency of a tail model.

Assignees

Inventors

Classifications

  • Packet rate · CPC title

  • Inference or reasoning models · CPC title

  • Learning methods · CPC title

  • G06N3/045Primary

    Combinations of networks · CPC title

  • Energy efficient computing, e.g. low power processors, power management or thermal management · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12155535B2 cover?
Disclosed is a split computing device operating in a serverless edge computing environment. The split computing device includes a transceiver configured to receive resource information of a terminal from the terminal and to measure a data transmission rate between the terminal and the split computing device in a process of receiving the resource information of the terminal; and a splitting poin…
Who is the assignee on this patent?
Univ Korea Res & Bus Found
What technology area does this patent fall under?
Primary CPC classification H04L43/0894. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 26 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).